Floating-Point Usage on GitHub: A Large-Scale Study of Statically Typed Languages

This paper presents the first large-scale empirical study of floating-point arithmetic usage in statically typed languages across millions of GitHub repositories, revealing that while existing benchmarks are partially representative, they do not fully capture real-world code patterns, and releasing a dataset of 10 million extracted functions to guide future reasoning techniques.

Andrea Gilot, Tobias Wrigstad, Eva DarulovaWed, 11 Ma💻 cs

Idempotent Slices with Applications to Code-Size Reduction

This paper formalizes the concept of idempotent backward slices and presents a sound, efficient algorithm for extracting them from Gated Static Single Assignment (GSA) form to enable a novel sparse code-size reduction optimization that merges non-contiguous instructions, achieving up to 7.24% size reduction in specific benchmarks.

Rafael Alvarenga de Azevedo, Daniel Augusto Costa de Sa, Rodrigo Caetano Rocha, Fernando Magno Quintão PereiraWed, 11 Ma💻 cs

From State Changes to Creative Decisions: Documenting and Interpreting Traces Across Creative Domains

This paper addresses the limitation of existing creative activity tracing methods that capture state changes without preserving intent or higher-level structure by proposing three complementary domain-specific approaches: a node-based interface for GenAI, a vocabulary of visual cues for visualization authoring, and a semantic history-embedded programming model.

Xiaohan Peng, Sotiris Piliouras, Carl Abou Saada NujaimTue, 10 Ma💻 cs

Mining Beyond the Bools: Learning Data Transformations and Temporal Specifications

This paper proposes a novel approach to mining data-aware temporal specifications from execution traces by combining Syntax Guided Synthesis with a finite-prefix interpretation of Temporal Stream Logic (TSLf_f), enabling the robust and sample-efficient synthesis of reactive programs that capture both data transformations and temporal behaviors.

Sam Nicholas Kouteili, William Fishell, Christian Scaff, Mark Santolucito, Ruzica PiskacTue, 10 Ma💻 cs

Linear Layouts: Robust Code Generation of Efficient Tensor Computation Using F2\mathbb{F}_2

This paper introduces "Linear Layouts," a novel framework that models tensor layouts as linear algebra operations over F2\mathbb{F}_2 to enable generic, efficient, and bug-free layout definitions and conversions for deep learning workloads, successfully integrating with the Triton compiler to overcome the limitations of existing case-by-case approaches.

Keren Zhou, Mario Lezcano, Adam Goucher, Akhmed Rakhmati, Jeff Niu, Justin Lebar, Pawel Szczerbuk, Peter Bell, Phil Tillet, Thomas Raoux, Zahi MoudallalMon, 09 Ma💻 cs

Efficient Selection of Type Annotations for Performance Improvement in Gradual Typing

This paper proposes a lightweight, amortized technique for selecting a subset of type annotations based on data flow to mitigate performance degradation in gradually typed programs, achieving comparable execution speed to existing methods while significantly reducing compilation time.

Senxi Li (University of Tokyo, Japan), Feng Dai (University of Tokyo, Japan), Tetsuro Yamazaki (University of Tokyo, Japan), Shigeru Chiba (University of Tokyo, Japan)Mon, 09 Ma💻 cs

JoinActors: A Modular Library for Actors with Join Patterns

This paper presents an improved, modular version of the JoinActors library for Scala 3 that leverages metaprogramming to provide a developer-friendly API and an extensible architecture for integrating, comparing, and optimizing various join pattern matching algorithms, demonstrating significant performance gains over previous implementations while maintaining semantic correctness.

Ayman Hussein (Technical University of Denmark, Denmark), Philipp Haller (KTH Royal Institute of Technology, Sweden), Ioannis Karras (Technical University of Denmark, Denmark), Hernán Melgratti (University of Buenos Aires, Argentina / CONICET, Argentina), Alceste Scalas (Technical University of Denmark, Denmark), Emilio Tuosto (Gran Sasso Science Institute, Italy)Mon, 09 Ma💻 cs

Evaluating LLMs in the Context of a Functional Programming Course: A Comprehensive Study

This paper evaluates nine state-of-the-art Large Language Models on three new benchmarks (λ\lambdaCodeGen, λ\lambdaRepair, and λ\lambdaExplain) within an OCaml functional programming course, revealing that while top models effectively handle syntax/type corrections and conceptual questions, they solve significantly fewer homework problems in this low-resource language compared to high-resource languages like Python and Java.

Yihan Zhang (McGill University, Canada), Brigitte Pientka (McGill University, Canada), Xujie Si (University of Toronto, USA)Mon, 09 Ma💻 cs

Pitfalls in VM Implementation on CHERI: Lessons from Porting CRuby

This paper identifies and categorizes the specific pitfalls encountered when porting virtual machines to the CHERI architecture, highlighting how common C language assumptions and VM implementation idioms conflict with CHERI's strict memory safety model, and proposes verified workarounds through a case study of porting CRuby.

Hanhaotian Liu (University of Tokyo, Japan), Tetsuro Yamazaki (University of Tokyo, Japan), Tomoharu Ugawa (University of Tokyo, Japan)Mon, 09 Ma💻 cs

Hybrid Structured Editing: Structures for Tools, Text for Users

This paper proposes "Hybrid Structured Editing," a novel approach that bridges the gap between tool builders and users by enforcing structural constraints on code to ensure reliable tool integration while simultaneously providing programmers with a familiar and consistent text-based editing interface.

Tom Beckmann (Hasso Plattner Institute, Germany / University of Potsdam, Germany), Christoph Thiede (Hasso Plattner Institute, Germany / University of Potsdam, Germany), Jens Lincke (Hasso Plattner Institute, Germany / University of Potsdam, Germany), Robert Hirschfeld (Hasso Plattner Institute, Germany / University of Potsdam, Germany)Mon, 09 Ma💻 cs