A Governance and Evaluation Framework for Deterministic, Rule-Based Clinical Decision Support in Empiric Antibiotic Prescribing

This paper proposes a governance and evaluation framework for deterministic, rule-based clinical decision support systems in empiric antibiotic prescribing that prioritizes transparency, auditability, and conservative behavior by formally separating decision logic from scope constraints and utilizing synthetic case validation to ensure behavioral alignment with predefined rules.

Francisco José Gárate, Paloma Chausa, Diego Moreno, Judit López Luque, Vicens Díaz-Brito, Enrique Javier GómezThu, 12 Ma🤖 cs.AI

Defining AI Models and AI Systems: A Framework to Resolve the Boundary Problem

This paper addresses the regulatory ambiguity surrounding "AI models" and "AI systems" by proposing clear conceptual and operational definitions that distinguish trained parameters from broader system components, thereby facilitating the precise allocation of obligations across the AI value chain.

Yuanyuan Sun, Timothy Parker, Lara Gierschmann, Sana Shams, Teo Canmetin, Mathieu Duteil, Rokas Gipiškis, Ze Shen ChinThu, 12 Ma🤖 cs.AI

The science and practice of proportionality in AI risk evaluations

This paper explores how the EU AI Act's requirement for general-purpose AI providers to evaluate systemic risks can be balanced with innovation through the principle of proportionality, aiming to develop scientific methods that ensure meaningful risk assessments without imposing excessive burdens.

Carlos Mougan, Lauritz Morlock, Jair Aguirre, James R. M. Black, Jan Brauner, Simeon Campos, Sunishchal Dev, David Fernández Llorca, Alberto Franzin, Mario Fritz, Emilia Gómez, Friederike Grosse-Holz, Eloise Hamilton, Max Hasin, Jose Hernandez-Orallo, Dan Lahav, Luca Massarelli, Vasilios Mavroudis, Malcolm Murray, Patricia Paskov, Jaime Raldua, Wout SchellaertThu, 12 Ma💻 cs

Assessing Cognitive Biases in LLMs for Judicial Decision Support: Virtuous Victim and Halo Effects

This study evaluates five large language models for judicial sentencing support and finds that while they exhibit a stronger virtuous victim effect and lack a significant penalty for adjacent consent compared to humans, they generally demonstrate reduced prestige-based halo effects, particularly regarding credentials, though current variability still limits their immediate deployment in legal settings.

Sierra S. LiuThu, 12 Ma💻 cs

Open Educational Resources: Barriers and Open Issues

This paper identifies and validates 26 social, economic, and technical barriers hindering the adoption of Open Educational Resources (OER) through a four-step research method involving a tertiary study and expert interviews, ultimately proposing a conceptual model to guide strategies for reducing these barriers and fostering more inclusive educational ecosystems.

Pedro Henrique Dias Valle, Rafael Capilla, Vinicius dos Santos, Daniel Feitosa, Elisa Yumi NakagawaThu, 12 Ma💻 cs

Technological Excellence Requires Human and Social Context

This perspective article argues that achieving true technological excellence, particularly in the era of generative AI, requires moving beyond a narrow focus on technical performance to integrate ethical, social, and humanistic dimensions structurally across research design, foresight, education, communication, and institutional frameworks.

Karl Palmås, Mats Benner, Monica Billger, Ben Clarke, Raimund Feifel, Julia Fernandez-Rodriguez, Anna Foka, Juliette Griffié, Claes Gustafsson, Kerstin Hamilton, Johan Holmén, Kristina Lindström, Tobias Olofsson, Joana B. Pereira, Marisa Ponti, Julia Ravanis, Sviatlana Shashkova, Emma Sparr, Pontus Strimling, Fredrik Höök, Giovanni VolpeThu, 12 Ma🔬 physics

Adaptive Engram Memory System for Indonesian Language Model: Generative AI Based on TOBA LM for Batak and Minang Language

This study introduces TOBA-LM, a 1.2-billion-parameter trilingual language model for Indonesian, Batak, and Minangkabau that integrates an adaptive Engram Memory mechanism to achieve significantly faster training convergence and reduced computational costs compared to conventional transformer architectures.

Hokky Situngkir, Kevin Siringoringo, Andhika Bernard LumbantobingThu, 12 Ma💬 cs.CL

Empathy Is Not What Changed: Clinical Assessment of Psychological Safety Across GPT Model Generations

This study refutes the claim that newer AI models have lost empathy, demonstrating through clinical assessment that while empathetic responses remain statistically consistent across generations, users' perception of "lost empathy" actually stems from a significant shift toward heightened crisis detection and altered safety postures that make the models appear more intrusive during vulnerable moments.

Michael Keeman, Anastasia KeemanThu, 12 Ma💬 cs.CL

Risk-Adjusted Harm Scoring for Automated Red Teaming for LLMs in Financial Services

This paper introduces a risk-aware evaluation framework for Large Language Models in financial services, featuring a domain-specific taxonomy, an automated multi-round red-teaming pipeline, and a Risk-Adjusted Harm Score (RAHS) metric to better capture and quantify severe, operationally actionable security failures that traditional domain-agnostic benchmarks miss.

Fabrizio Dimino, Bhaskarjit Sarmah, Stefano PasqualiThu, 12 Ma💰 q-fin

AI and the Transformation of Accountability and Discretion in Urban Governance

This paper argues that Artificial Intelligence in urban governance does not merely restrict or enhance bureaucratic discretion but redistributes it across institutional levels, necessitating a framework of "accountable discretion" and specific guiding principles to balance improved service delivery with the mitigation of new risks like algorithmic opacity and fragmented responsibility.

Stephen Goldsmith, Juncheng "Tony" YangMon, 09 Ma💻 cs