CovertComBench: A First Domain-Specific Testbed for LLMs in Wireless Covert Communication
This paper introduces CovertComBench, a specialized benchmark for evaluating Large Language Models in wireless covert communication, revealing that while current models excel at conceptual understanding and code generation, they significantly struggle with the rigorous mathematical derivations required for security-constrained optimization.