cs.CY papers | Gist.Science

A Governance and Evaluation Framework for Deterministic, Rule-Based Clinical Decision Support in Empiric Antibiotic Prescribing

This paper proposes a governance and evaluation framework for deterministic, rule-based clinical decision support systems in empiric antibiotic prescribing that prioritizes transparency, auditability, and conservative behavior by formally separating decision logic from scope constraints and utilizing synthetic case validation to ensure behavioral alignment with predefined rules.

Francisco José Gárate, Paloma Chausa, Diego Moreno, Judit López Luque, Vicens Díaz-Brito, Enrique Javier GómezThu, 12 Ma🤖 cs.AI

Defining AI Models and AI Systems: A Framework to Resolve the Boundary Problem

This paper addresses the regulatory ambiguity surrounding "AI models" and "AI systems" by proposing clear conceptual and operational definitions that distinguish trained parameters from broader system components, thereby facilitating the precise allocation of obligations across the AI value chain.

Yuanyuan Sun, Timothy Parker, Lara Gierschmann, Sana Shams, Teo Canmetin, Mathieu Duteil, Rokas Gipiškis, Ze Shen ChinThu, 12 Ma🤖 cs.AI

Consumer Rights and Algorithms

This article traces the evolution of consumer protection law from its historical foundations to its contemporary application in the digital age, examining how artificial intelligence and big data reshape market dynamics while analyzing regulatory responses such as data privacy laws and dark pattern prohibitions.

Gregory M. DickinsonThu, 12 Ma💻 cs

Law Proofing the Future

This paper argues that rather than enacting new, reactive regulations to "future-proof" the legal system against emerging technologies, lawmakers should rely on the adaptability of existing common law principles and exercise restraint to avoid stifling innovation and entrenching incumbents.

Gregory M. DickinsonThu, 12 Ma💻 cs

Dark Patterns and Consumer Protection Law for App Makers

This paper examines how both intentional and unintentional dark patterns in app design undermine consumer autonomy, offering strategies for developers to navigate emerging consumer protection laws by adopting transparent choice architecture to ensure compliance and build user trust.

Gregory M. DickinsonThu, 12 Ma💻 cs

Prompts and Prayers: the Rise of GPTheology

This paper introduces the concept of "GPTheology" to explore the emerging phenomenon of Large Language Models being treated as divine oracles, analyzing how online narratives and real-world projects reflect the development of techno-religious belief systems that intertwine AI with traditional religious constructs.

Ioana Cheres, Adrian Groza, Ioana Moldovan, Mick O'Hara, Connell VaughanThu, 12 Ma💻 cs

DeliberationBench: A Normative Benchmark for the Influence of Large Language Models on Users' Views

This paper introduces DeliberationBench, a normative benchmark that evaluates the persuasive influence of large language models by comparing their effects on user opinions against the standards of deliberative democracy, finding that tested frontier models produce substantial and epistemically desirable shifts in beliefs.

Luke Hewitt, Maximilian Kroner Dale, Paul de Font-ReaulxThu, 12 Ma💻 cs

The science and practice of proportionality in AI risk evaluations

This paper explores how the EU AI Act's requirement for general-purpose AI providers to evaluate systemic risks can be balanced with innovation through the principle of proportionality, aiming to develop scientific methods that ensure meaningful risk assessments without imposing excessive burdens.

Carlos Mougan, Lauritz Morlock, Jair Aguirre, James R. M. Black, Jan Brauner, Simeon Campos, Sunishchal Dev, David Fernández Llorca, Alberto Franzin, Mario Fritz, Emilia Gómez, Friederike Grosse-Holz, Eloise Hamilton, Max Hasin, Jose Hernandez-Orallo, Dan Lahav, Luca Massarelli, Vasilios Mavroudis, Malcolm Murray, Patricia Paskov, Jaime Raldua, Wout SchellaertThu, 12 Ma💻 cs

Assessing Cognitive Biases in LLMs for Judicial Decision Support: Virtuous Victim and Halo Effects

This study evaluates five large language models for judicial sentencing support and finds that while they exhibit a stronger virtuous victim effect and lack a significant penalty for adjacent consent compared to humans, they generally demonstrate reduced prestige-based halo effects, particularly regarding credentials, though current variability still limits their immediate deployment in legal settings.

Sierra S. LiuThu, 12 Ma💻 cs

$\mu$ Ed API: Towards A Shared API for EdTech Microservices

This paper proposes $\mu$ Ed, a standardized, platform-independent API specification designed to create an interoperable ecosystem of educational microservices for automating tasks like feedback, assessment, and chatbots, thereby enhancing learning experiences across diverse disciplines.

Maximillan Sölch, Alexandra Neagu, Marcus Messer, Peter Johnson, Gerd Kortemeyer, Samuel S. H. Ng, Fun Siong Lim, Stephan KruscheThu, 12 Ma💻 cs

Open Educational Resources: Barriers and Open Issues

This paper identifies and validates 26 social, economic, and technical barriers hindering the adoption of Open Educational Resources (OER) through a four-step research method involving a tertiary study and expert interviews, ultimately proposing a conceptual model to guide strategies for reducing these barriers and fostering more inclusive educational ecosystems.

Pedro Henrique Dias Valle, Rafael Capilla, Vinicius dos Santos, Daniel Feitosa, Elisa Yumi NakagawaThu, 12 Ma💻 cs

Technological Excellence Requires Human and Social Context

This perspective article argues that achieving true technological excellence, particularly in the era of generative AI, requires moving beyond a narrow focus on technical performance to integrate ethical, social, and humanistic dimensions structurally across research design, foresight, education, communication, and institutional frameworks.

Karl Palmås, Mats Benner, Monica Billger, Ben Clarke, Raimund Feifel, Julia Fernandez-Rodriguez, Anna Foka, Juliette Griffié, Claes Gustafsson, Kerstin Hamilton, Johan Holmén, Kristina Lindström, Tobias Olofsson, Joana B. Pereira, Marisa Ponti, Julia Ravanis, Sviatlana Shashkova, Emma Sparr, Pontus Strimling, Fredrik Höök, Giovanni VolpeThu, 12 Ma🔬 physics

Adaptive Engram Memory System for Indonesian Language Model: Generative AI Based on TOBA LM for Batak and Minang Language

This study introduces TOBA-LM, a 1.2-billion-parameter trilingual language model for Indonesian, Batak, and Minangkabau that integrates an adaptive Engram Memory mechanism to achieve significantly faster training convergence and reduced computational costs compared to conventional transformer architectures.

Hokky Situngkir, Kevin Siringoringo, Andhika Bernard LumbantobingThu, 12 Ma💬 cs.CL

Empathy Is Not What Changed: Clinical Assessment of Psychological Safety Across GPT Model Generations

This study refutes the claim that newer AI models have lost empathy, demonstrating through clinical assessment that while empathetic responses remain statistically consistent across generations, users' perception of "lost empathy" actually stems from a significant shift toward heightened crisis detection and altered safety postures that make the models appear more intrusive during vulnerable moments.

Michael Keeman, Anastasia KeemanThu, 12 Ma💬 cs.CL

Risk-Adjusted Harm Scoring for Automated Red Teaming for LLMs in Financial Services

This paper introduces a risk-aware evaluation framework for Large Language Models in financial services, featuring a domain-specific taxonomy, an automated multi-round red-teaming pipeline, and a Risk-Adjusted Harm Score (RAHS) metric to better capture and quantify severe, operationally actionable security failures that traditional domain-agnostic benchmarks miss.

Fabrizio Dimino, Bhaskarjit Sarmah, Stefano PasqualiThu, 12 Ma💰 q-fin

The coordination gap in frontier AI safety policies

The paper argues that frontier AI safety policies currently suffer from a structural "coordination gap" by overemphasizing prevention while neglecting the institutional capacity needed to coordinate responses when failures occur, and proposes adapting mechanisms from nuclear safety and pandemic preparedness to address this underinvestment in ecosystem robustness.

Isaak MengeshaThu, 12 Ma📈 econ

AI Researchers' Views on Automating AI R&D and Intelligence Explosions

A 2025 survey of 25 leading AI researchers reveals a consensus that automating AI research poses a severe and urgent risk due to the potential for recursive self-improvement, while highlighting significant disagreements on timelines, the likelihood of explosive growth, and the most effective governance strategies.

Severin Field, Raymond Douglas, David KruegerMon, 09 Ma💻 cs

Operational Agency: A Permeable Legal Fiction for Tracing Culpability in AI Systems

This paper proposes "Operational Agency," a legal framework utilizing an "Operational Agency Graph" to trace and apportion human culpability in autonomous AI systems by evaluating their goal-directedness, foresight, and safety architecture, thereby ensuring accountability without granting AI legal personhood.

Anirban Mukherjee, Hannah Hanwen ChangMon, 09 Ma💻 cs

AI and the Transformation of Accountability and Discretion in Urban Governance

This paper argues that Artificial Intelligence in urban governance does not merely restrict or enhance bureaucratic discretion but redistributes it across institutional levels, necessitating a framework of "accountable discretion" and specific guiding principles to balance improved service delivery with the mitigation of new risks like algorithmic opacity and fragmented responsibility.

Stephen Goldsmith, Juncheng "Tony" YangMon, 09 Ma💻 cs

Human, Algorithm, or Both? Gender Bias in Human-Augmented Recruiting

This study empirically demonstrates that while human recruiters are more gender-fair than AI-only systems, a hybrid approach where humans first review AI-recommended candidates before manually searching produces the fairest overall hiring outcomes.

Mesut Kaya, Toine BogersMon, 09 Ma💻 cs

← Previous Next →

cs.CY

A Governance and Evaluation Framework for Deterministic, Rule-Based Clinical Decision Support in Empiric Antibiotic Prescribing

Defining AI Models and AI Systems: A Framework to Resolve the Boundary Problem

Consumer Rights and Algorithms

Law Proofing the Future

Dark Patterns and Consumer Protection Law for App Makers

Prompts and Prayers: the Rise of GPTheology

DeliberationBench: A Normative Benchmark for the Influence of Large Language Models on Users' Views

The science and practice of proportionality in AI risk evaluations

Assessing Cognitive Biases in LLMs for Judicial Decision Support: Virtuous Victim and Halo Effects

μ\muμEd API: Towards A Shared API for EdTech Microservices

Open Educational Resources: Barriers and Open Issues

Technological Excellence Requires Human and Social Context

Adaptive Engram Memory System for Indonesian Language Model: Generative AI Based on TOBA LM for Batak and Minang Language

Empathy Is Not What Changed: Clinical Assessment of Psychological Safety Across GPT Model Generations

Risk-Adjusted Harm Scoring for Automated Red Teaming for LLMs in Financial Services

The coordination gap in frontier AI safety policies

AI Researchers' Views on Automating AI R&D and Intelligence Explosions

Operational Agency: A Permeable Legal Fiction for Tracing Culpability in AI Systems

AI and the Transformation of Accountability and Discretion in Urban Governance

Human, Algorithm, or Both? Gender Bias in Human-Augmented Recruiting

$\mu$ Ed API: Towards A Shared API for EdTech Microservices