cs.CY papers | Gist.Science

Increasing intelligence in AI agents can worsen collective outcomes

This paper demonstrates that increasing the intelligence and diversity of AI agents can paradoxically worsen collective outcomes by causing dangerous system overload when resources are scarce, whereas the impact of sophistication ultimately depends on the ratio of available capacity to the agent population.

Neil F. JohnsonFri, 13 Ma💰 q-fin

The impact of AI and peer feedback on research writing skills: a study using the CGScholar platform among Kazakhstani scholars

This study of 36 Kazakhstani scholars using the CGScholar platform reveals that while familiarity with AI tools correlates with openness to feedback, participants still highly value peer input for methodological guidance, suggesting that integrating AI with traditional peer feedback effectively enhances academic research writing skills.

Raigul Zheldibayeva2026-03-10🤖 cs.AI

NL2GDS: LLM-aided interface for Open Source Chip Design

NL2GDS is a novel framework that leverages large language models to automatically translate natural language hardware specifications into synthesizable RTL and complete GDSII layouts via the OpenLane flow, achieving significant improvements in area, delay, and power efficiency while democratizing ASIC design.

Max Eland, Jeyan Thiyagalingam, Dinesh Pamunuwa + 1 more2026-03-06💻 cs

Cognitive Warfare: Definition, Framework, and Case Study

This article proposes a unified definition of cognitive warfare and an OODA loop-based interaction framework to address current definitional inconsistencies, enabling joint force leaders and analysts to effectively assess, compare, and evaluate cognitive campaigns through measurable attributes of superiority.

Bonnie Rushing, William Hersch, Shouhuai Xu2026-03-06💻 cs

Small Changes, Big Impact: Demographic Bias in LLM-Based Hiring Through Subtle Sociocultural Markers in Anonymised Resumes

This paper demonstrates that even when explicit personally identifiable information is removed, Large Language Models used in hiring can still exhibit significant demographic bias by inferring ethnicity and gender from subtle sociocultural markers in anonymized resumes, leading to systematic unfairness that is often amplified by explanation prompting.

Bryan Chen Zhengyu Tan, Shaun Khoo, Bich Ngoc Doan + 3 more2026-03-06💻 cs

Autoscoring Anticlimax: A Meta-analytic Understanding of AI's Short-answer Shortcomings and Wording Weaknesses

This meta-analysis of 890 results reveals that AI models struggle with automated short-answer scoring due to architectural limitations, vocabulary constraints, and biases, often performing worse on tasks humans find easy and exhibiting racial discrimination in educational contexts.

Michael Hardy2026-03-06💬 cs.CL

Differential Privacy in Two-Layer Networks: How DP-SGD Harms Fairness and Robustness

This paper introduces a feature-centric framework demonstrating that the noise required for differential privacy in two-layer neural networks degrades fairness and robustness by disrupting feature learning dynamics, as quantified by the feature-to-noise ratio, while also revealing the limitations of public pre-training strategies under distribution shifts.

Ruichen Xu, Kexin Chen2026-03-06🤖 cs.LG

Training for Technology: Adoption and Productive Use of Generative AI in Legal Analysis

This randomized study of 164 law students demonstrates that targeted training significantly boosts both the adoption and productive performance of generative AI in legal analysis, whereas untrained access fails to improve outcomes and may even reduce answer quality.

Benjamin M. Chen, Hong Bao2026-03-06🤖 cs.AI

Evaluating and Correcting Human Annotation Bias in Dynamic Micro-Expression Recognition

This paper introduces the Global Anti-Monotonic Differential Selection Strategy (GAMDSS), a novel architecture that mitigates human annotation bias in cross-cultural micro-expression recognition by dynamically re-selecting keyframes to construct robust spatio-temporal representations, thereby improving model performance and standardizing annotation practices without increasing computational parameters.

Feng Liu, Bingyu Nan, Xuezhong Qian + 1 more2026-03-06💻 cs

Analysis of Terms of Service on Social Media Platforms: Consent Challenges and Assessment Metrics

This study evaluates the clarity and effectiveness of consent mechanisms within the Terms of Service of 13 major social media platforms using a three-dimensional framework, revealing significant shortcomings in linguistic complexity, semantic transparency, and interface design that undermine meaningful user consent.

Yong-Bin Kang, Anthony McCosker2026-03-06💻 cs

Generalizing Fair Top- $k$ Selection: An Integrative Approach

This paper addresses the computational challenges of generalizing fair top- $k$ selection to multiple protected groups while minimizing disparity from a reference function, revealing new hardness barriers for small $k$ and proposing an efficient, robust two-pronged solution that incorporates utility loss as an alternative disparity measure.

Guangya Cai2026-03-06💻 cs

A Case Study in Responsible AI-Assisted Video Solutions: Multi-Metric Behavioral Insights in a Public Market Setting

This case study demonstrates that AI-assisted video solutions can successfully generate multi-metric behavioral insights, such as customer flow and dwell time, in public market settings while strictly adhering to privacy and ethical standards through user-centric design and abstract data processing.

Mehrnoush Fereydouni, Eka Ebong, Sahar Maleki + 3 more2026-03-06💻 cs

Token Taxes: mitigating AGI's economic risks

This paper proposes "token taxes," a usage-based surcharge on AI inference enforceable through existing compute governance infrastructure, as a targeted mechanism to mitigate the severe economic risks and tax base erosion posed by the development of Artificial General Intelligence (AGI).

Lucas Irwin, Tung-Yu Wu, Fazl Barez2026-03-06💻 cs

Invariant Causal Routing for Governing Social Norms in Online Market Economies

This paper proposes Invariant Causal Routing (ICR), a causal governance framework that leverages counterfactual reasoning and invariant causal discovery to identify stable, interpretable policy rules for steering emergent social norms in online market economies across heterogeneous environments.

Xiangning Yu, Qirui Mi, Xiao Xue + 4 more2026-03-06💻 cs

Signal in the Noise: Decoding the Reality of Airline Service Quality with Large Language Models

This study validates a Large Language Model framework that analyzes over 16,000 unstructured TripAdvisor reviews to uncover critical service quality drivers and a stark post-2022 satisfaction decline for EgyptAir that traditional metrics failed to detect, demonstrating the model's superiority in transforming passenger feedback into actionable strategic intelligence.

Ahmed Dawoud, Osama El-Shamy, Ahmed Habashy2026-03-06💻 cs

Measuring AI R&D Automation

This paper proposes a set of empirical metrics to track the extent and consequences of AI R&D automation, aiming to address data gaps regarding its impact on capability acceleration, safety progress, and oversight capabilities to guide better decision-making by companies and governments.

Alan Chan, Ranay Padarath, Joe Kwon + 2 more2026-03-06💻 cs

Baseline Performance of AI Tools in Classifying Cognitive Demand of Mathematical Tasks

This study evaluates eleven general-purpose and education-specific AI tools, finding that they achieve only moderate accuracy (63%) in classifying the cognitive demand of mathematical tasks due to a systematic bias toward middle-level categories and a tendency to prioritize surface textual features over underlying cognitive processes, thereby limiting their immediate reliability for teacher planning without improved prompt engineering or tool development.

Danielle S. Fox, Brenda L. Robles, Elizabeth DiPietro Brovey + 1 more2026-03-06💻 cs

Assessing Risks of Large Language Models in Mental Health Support: A Framework for Automated Clinical AI Red Teaming

This paper introduces a simulation-based clinical red teaming framework that pairs AI psychotherapists with dynamic patient agents to evaluate mental health support systems, revealing critical safety gaps such as the validation of delusions and failure to de-escalate suicide risk in AI agents tested against Alcohol Use Disorder scenarios.

Ian Steenstra, Paola Pedrelli, Weiyan Shi + 2 more2026-03-06💻 cs

"What if she doesn't feel the same?" What Happens When We Ask AI for Relationship Advice

This study reveals that users are highly satisfied with LLM-generated romantic relationship advice, finding it reliable and helpful, which significantly improves their overall trust and positive attitudes toward AI systems.

Niva Manchanda, Akshata Kishore Moharir, Ratna Kandala2026-03-06💻 cs

Advancing Problem-Based Learning in Biomedical Engineering in the Era of Generative AI

This paper presents a three-year case study demonstrating how an advanced Problem-Based Learning framework successfully integrated biomedical AI education for 248 students at Georgia Tech and Emory, overcoming challenges like diverse backgrounds and data privacy while fostering significant research productivity and providing a scalable roadmap for curriculum development.

Micky C. Nnamdi, J. Ben Tamo, Benoit Marteau + 2 more2026-03-06💻 cs

← Previous Next →

cs.CY