Developing an AI Assistant for Knowledge Management and Workforce Training in State DOTs

This paper proposes a multi-agent Retrieval-Augmented Generation framework that integrates open-weight large language models and vision-language models to enhance knowledge management and workforce training in state Departments of Transportation by enabling context-aware, evidence-grounded responses from both textual and visual technical documentation.

Divija Amaram, Lu Gao, Gowtham Reddy Gudla + 1 more2026-03-05🤖 cs.AI

Token-Oriented Object Notation vs JSON: A Benchmark of Plain and Constrained Decoding Generation

This benchmark study evaluates Token-Oriented Object Notation (TOON) against JSON for LLM data serialization, finding that while TOON offers promising token efficiency for complex structures via in-context learning, its advantage is often negated by prompt overhead in short contexts and it currently underperforms constrained decoding for simple structures, suggesting its true potential follows a non-linear scaling curve dependent on task complexity.

Ivan Matveev2026-03-05🤖 cs.AI

Can Large Language Models Derive New Knowledge? A Dynamic Benchmark for Biological Knowledge Discovery

To address the limitations of static benchmarks and data contamination in evaluating AI's capacity for knowledge discovery, this paper introduces DBench-Bio, a dynamic, fully automated, and monthly-updated benchmark covering 12 biomedical sub-domains that rigorously assesses the ability of Large Language Models to derive new biological knowledge.

Chaoqun Yang, Xinyu Lin, Shulin Li + 4 more2026-03-05🤖 cs.AI