DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use

The paper introduces DIVE, an evidence-driven framework that prioritizes executing diverse real-world tools before reverse-deriving tasks to ensure grounding and structural variety, which significantly enhances the out-of-distribution generalization of tool-using LLMs compared to traditional quantity-focused scaling.

Aili Chen, Chi Zhang, Junteng Liu, Jiangjie Chen, Chengyu Du, Yunji Li, Ming Zhong, Qin Wang, Zhengmao Zhu, Jiayuan Song, Ke Ji, Junxian He, Pengyu Zhao, Yanghua XiaoFri, 13 Ma🤖 cs.AI

Exploration of Evolving Quantum Key Distribution Network Architecture Using Model-Based Systems Engineering

This paper proposes a variability-driven systems engineering framework using Orthogonal Variability Modelling and Systems Modelling Language to systematically model, trace, and evolve Quantum Key Distribution network architectures, thereby addressing the challenges of integrating complex quantum devices into existing classical infrastructure to meet future security needs.

Hayato Ishida, Amal Elsokary, Maria Aslam + 3 more2026-03-10⚛️ quant-ph

A framework for assessing the capabilities of code generation of constraint domain-specific languages with large language models

This paper proposes a generic evaluation framework to assess large language models' ability to generate code for constraint-based domain-specific languages like OCL and Alloy, revealing that while performance is generally lower than for general-purpose languages like Python, strategies such as code repair and multiple generation attempts can significantly improve output quality.

David Delgado, Lola Burgueño, Robert Clarisó2026-03-06💻 cs

Why Do You Contribute to Stack Overflow? Understanding Cross-Cultural Motivations and Usage Patterns before the Age of LLMs

This study investigates cross-cultural differences in Stack Overflow contributor motivations across the US, China, and Russia by combining qualitative profile analysis with quantitative linguistic data, revealing distinct regional patterns such as stronger self-promotion among Americans and learning-oriented engagement among Chinese users to inform strategies for sustaining the knowledge-sharing ecosystem in the age of LLMs.

Sherlock A. Licorish, Elijah Zolduoarrati, Tony Savarimuthu + 3 more2026-03-06💻 cs

Public Sector Open Source Program Offices - Archetypes for how to Grow (Common) Institutional Capabilities

This study identifies six distinct archetypes of Open Source Programme Offices (OSPOs) within European public sector organizations through a qualitative analysis of 16 cases, providing strategic guidance and policy recommendations for designing institutional capabilities that foster OSS adoption, digital sovereignty, and improved service interoperability.

Johan Linåker, Astor Nummelin Carlberg, Ciaran O'Riordan2026-03-06💻 cs

Behaviour Driven Development Scenario Generation with Large Language Models

This paper evaluates GPT-4, Claude 3, and Gemini on a proprietary dataset of 500 user stories to generate Behaviour-Driven Development scenarios, finding that while GPT-4 excels in text similarity, Claude 3 produces the highest quality results according to human and LLM-based experts, with optimal performance dependent on model-specific prompting strategies, high-quality input descriptions, and specific generation parameters.

Amila Rathnayake, Mojtaba Shahin, Golnoush Abaei2026-03-06💻 cs