A Tale of 1001 LoC: Potential Runtime Error-Guided Specification Synthesis for Verifying Large-Scale Programs

This paper introduces Preguss, a modular framework that combines static analysis with LLM-aided synthesis to automatically generate and refine interprocedural specifications, enabling highly automated verification of large-scale programs (over 1,000 lines of code) while significantly reducing human effort.

Zhongyi Wang, Tengjie Lin, Mingshuai Chen, Haokun Li, Mingqi Yang, Xiao Yi, Shengchao Qin, Yixing Luo, Xiaofeng Li, Bin Gu, Liqiang Lu, Jianwei YinWed, 11 Ma💻 cs

Floating-Point Usage on GitHub: A Large-Scale Study of Statically Typed Languages

This paper presents the first large-scale empirical study of floating-point arithmetic usage in statically typed languages across millions of GitHub repositories, revealing that while existing benchmarks are partially representative, they do not fully capture real-world code patterns, and releasing a dataset of 10 million extracted functions to guide future reasoning techniques.

Andrea Gilot, Tobias Wrigstad, Eva DarulovaWed, 11 Ma💻 cs

Evaluating Large Language Models for Multilingual Vulnerability Detection at Dual Granularities

This paper presents a comprehensive empirical study evaluating state-of-the-art pre-trained and large language models for multilingual vulnerability detection across seven programming languages at both function and line levels, revealing that instruction-tuned GPT-4o significantly outperforms other models, particularly in identifying high-severity and unique multilingual vulnerabilities.

Honglin Shu, Michael Fu, Junji Yu, Dong Wang, Chakkrit Tantithamthavorn, Junjie Chen, Yasutaka KameiWed, 11 Ma💻 cs

An Empirical Study of Interaction Smells in Multi-Turn Human-LLM Collaborative Code Generation

This paper introduces the concept of "Interaction Smells" in multi-turn human-LLM code generation, establishes a taxonomy based on real-world data, analyzes their distribution across leading models, and proposes the Invariant-aware Constraint Evolution (InCE) framework to effectively mitigate these issues and improve task success rates.

Binquan Zhang, Li Zhang, Lin Shi, Song Wang, Yuwei Qian, Linhui Zhao, Fang Liu, An Fu, Yida YeWed, 11 Ma💻 cs

Preparing Students for AI-Driven Agile Development: A Project-Based AI Engineering Curriculum

This paper presents a project-based AI engineering curriculum that integrates agile practices with generative AI tools to prepare students for modern software development, demonstrating through a seven-sprint case study that embedding AI across the engineering lifecycle fosters hands-on competence while necessitating adaptations for tool evolution and foundational learning verification.

Andreas Rausch, Stefan Wittek, Tobias Geger, David InkermannWed, 11 Ma💻 cs

Experience Report on the Adaptable Integration of Requirements Engineering Courses into Curricula for Professionals

This paper reports on the authors' experience developing three professional software engineering curricula and proposes a systematic, content-mapping-based approach with guiding principles for effectively integrating Requirements Engineering courses into these dynamic and modular programs.

Oleksandr Kosenkov, Konstantin Blaschke, Tony Gorschek, Michael Unterkalmsteiner, Oleksandr Adamov, Davide FucciWed, 11 Ma💻 cs

Can ChatGPT Generate Realistic Synthetic System Requirement Specifications? Results of a Case Study

This case study demonstrates that while ChatGPT can generate realistic synthetic system requirement specifications across multiple industries using iterative prompt engineering, the resulting artifacts still contain significant flaws that necessitate thorough expert evaluation rather than relying solely on LLM-based quality assessments.

Alex R. Mattukat, Florian M. Braun, Horst LichterWed, 11 Ma💻 cs

ToolRosetta: Bridging Open-Source Repositories and Large Language Model Agents through Automated Tool Standardization

ToolRosetta is a unified framework that automatically transforms heterogeneous open-source code repositories into standardized, secure, and executable Model Context Protocol (MCP) tools, enabling LLM agents to autonomously plan and invoke specialized software for complex tasks with minimal human intervention.

Shimin Di, Xujie Yuan, Hanghui Guo, Chaoqian Ouyang, Zhangze Chen, Ling Yue, Libin Zheng, Jia Zhu, Shaowu Pan, Jian Yin, Min-Ling Zhang, Yong RuiWed, 11 Ma💻 cs

AgenticCyOps: Securing Multi-Agentic AI Integration in Enterprise Cyber Operations

This paper introduces AgenticCyOps, a security framework for enterprise multi-agent AI systems that mitigates emerging attack surfaces by formalizing tool orchestration and memory management as primary trust boundaries and applying five defensive principles aligned with global compliance standards to significantly reduce exploitable vulnerabilities in SOC workflows.

Shaswata Mitra, Raj Patel, Sudip Mittal, Md Rayhanur Rahman, Shahram RahimiWed, 11 Ma💻 cs