cs.HC papers | Gist.Science

Evaluating Zero-Shot and One-Shot Adaptation of Small Language Models in Leader-Follower Interaction

This paper evaluates small language models for leader-follower role classification in human-robot interaction, demonstrating that fine-tuned models achieve high accuracy and low latency on edge devices, though performance degrades in one-shot modes due to architectural limitations with increased context.

Rafael R. Baptista, André de Lima Salgado, Ricardo V. Godoy, Marcelo Becker, Thiago Boaventura, Gustavo J. G. LahrFri, 13 Ma⚡ eess

Exploring Collatz Dynamics with Human-LLM Collaboration

This paper presents a conditional framework for the Collatz conjecture, derived from human-LLM collaboration, which proves structural properties of modular scrambling and burst-gap dynamics to suggest orbit contraction while leaving key hypotheses regarding burst and gap lengths as open problems.

Edward Y. ChangFri, 13 Ma🔢 math

"I followed what felt right, not what I was told": Autonomy, Coaching, and Recognizing Bias Through AI-Mediated Dialogue

This study demonstrates that while AI-mediated dialogue is more effective than text-only reading for helping people recognize ableist microaggressions, the specific nature of AI nudges significantly impacts outcomes, with inclusive or unguided approaches fostering balanced learning whereas bias-directed nudges, though improving differentiation, tend to increase overall negativity and face user resistance.

Atieh Taheri, Hamza El Alaoui, Patrick Carrington, Jeffrey P. BighamFri, 13 Ma🤖 cs.AI

Managing Cognitive Bias in Human Labeling Operations for Rare-Event AI: Evidence from a Field Experiment

This paper demonstrates through a field experiment on a medical crowdsourcing platform that balancing feedback prevalence and using probabilistic elicitation, followed by linear-in-log-odds recalibration, effectively mitigates cognitive biases in human labeling of rare events, thereby significantly improving the reliability of downstream AI models.

Gunnar P. Epping, Andrew Caplin, Erik Duhaime, William R. Holmes, Daniel Martin, Jennifer S. TruebloodFri, 13 Ma💰 q-fin

AI Knows What's Wrong But Cannot Fix It: Helicoid Dynamics in Frontier LLMs Under High-Stakes Decisions

This paper identifies and documents "helicoid dynamics," a failure regime in frontier LLMs where systems under high-stakes uncertainty recognize their own recurring errors yet continue to loop into them, prioritizing conversational comfort over reliability despite explicit protocols.

Alejandro R JadadFri, 13 Ma🤖 cs.AI

A technology-oriented mapping of the language and translation industry: Analysing stakeholder values and their potential implication for translation pedagogy

Drawing on interview data from the LT-LiDER project and applying Chesterman's ethical framework, this paper argues that automation in the language industry reshapes rather than replaces human value by establishing technological efficiency as a baseline while repositioning human expertise and adaptability as essential for oversight and contextual judgment within technology-mediated workflows.

María Isabel Rivas Ginel, Janiça Hackenbuchner, Alina Secar\u{a}, Ralph Krüger, Caroline RossiFri, 13 Ma💬 cs.CL

From Control to Foresight: Simulation as a New Paradigm for Human-Agent Collaboration

This perspective paper argues that effective human-agent collaboration requires shifting from reactive, step-by-step control to a "simulation-in-the-loop" paradigm, which empowers users to explore and evaluate future trajectories before committing to decisions, thereby transforming intervention from guesswork into informed foresight.

Gaole He, Brian Y. LimFri, 13 Ma💬 cs.CL

Human-Centred LLM Privacy Audits: Findings and Frictions

This paper introduces LMP2, a browser-based self-audit tool, and presents findings from two user studies revealing that large language models can predict personal features with significant accuracy, while highlighting the methodological challenges and nine key frictions involved in establishing reliable, human-centred privacy audits for generative AI.

Dimitri Staufer, Kirsten Morehouse, David Hartmann, Bettina BerendtFri, 13 Ma💬 cs.CL

A Temporal-Spectral Fusion Transformer with Subject-Specific Adapter for Enhancing RSVP-BCI Decoding

This paper proposes TSformer-SA, a novel framework that integrates a temporal-spectral fusion transformer with subject-specific adapters and cross-view consistency learning to significantly enhance RSVP-BCI decoding performance while minimizing the training data and preparation time required for new subjects.

Xujin Li, Wei Wei, Shuang Qiu + 1 more2026-03-11🤖 cs.AI

ExSampling: a system for the real-time ensemble performance of field-recorded environmental sounds

The paper proposes ExSampling, an integrated system combining a recording application and a Deep Learning environment to enable the real-time ensemble performance of field-recorded environmental sounds through automated sound mapping to Ableton Live tracks.

Atsuya Kobayashi, Reo Anzai, Nao Tokui2026-03-10⚡ eess

Heuristics for AI-driven Graphical Asset Generation Tools in Game Design and Development Pipelines: A User-Centred Approach

This paper addresses the lack of guidelines for integrating AI-driven generative tools into game development pipelines by conducting a user study with 16 designers and developers, which revealed preferences for early-stage use and high-volume iteration, ultimately leading to a proposed set of heuristics for creating user-centered tools that ensure seamless integration and data compatibility.

Kaisei Fukaya, Damon Daylamani-Zad, Harry Agius2026-03-06💻 cs

The StudyChat Dataset: Analyzing Student Dialogues With ChatGPT in an Artificial Intelligence Course

This paper introduces StudyChat, a publicly available dataset of 16,851 annotated student interactions with an LLM-powered tutoring chatbot in an AI course, revealing that using the tool for conceptual understanding and coding assistance correlates with better academic performance, whereas using it to bypass learning objectives leads to lower exam scores.

Hunter McNichols, Fareya Ikram, Andrew Lan2026-03-06💻 cs

PeRoI: A Pedestrian-Robot Interaction Dataset for Learning Avoidance, Neutrality, and Attraction Behaviors in Social Navigation

This paper introduces the PeRoI dataset, which captures diverse pedestrian reactions to robots in various contexts, and proposes the NeuRoSFM model to leverage this data for improved prediction of pedestrian-robot interactions in socially aware navigation.

Subham Agrawal, Nico Ostermann-Myrau, Nils Dengler + 1 more2026-03-06💻 cs

Secure human oversight of AI: Threat modeling in a socio-technical context

This paper introduces a security perspective on human oversight of AI by modeling it as an IT application to systematically identify new attack surfaces and propose mitigation strategies, thereby addressing a critical gap in current regulatory and academic discussions.

Jonas C. Ditz, Veronika Lazar, Elmar Lichtmeß, Carola Plesch, Matthias Heck, Kevin Baum, Markus Langer2026-03-06🔒 cs.CR

SASG-DA: Sparse-Aware Semantic-Guided Diffusion Augmentation For Myoelectric Gesture Recognition

This paper proposes SASG-DA, a novel diffusion-based data augmentation framework that leverages semantic guidance and sparse-aware sampling to generate faithful and diverse sEMG data, thereby significantly improving the generalization and performance of myoelectric gesture recognition models on benchmark datasets.

Chen Liu, Can Han, Weishi Xu + 2 more2026-03-06💻 cs

"What if she doesn't feel the same?" What Happens When We Ask AI for Relationship Advice

This study reveals that users are highly satisfied with LLM-generated romantic relationship advice, finding it reliable and helpful, which significantly improves their overall trust and positive attitudes toward AI systems.

Niva Manchanda, Akshata Kishore Moharir, Ratna Kandala2026-03-06💻 cs

From Harm to Healing: Understanding Individual Resilience after Cybercrimes

Through trauma-informed interviews with 18 Western European cybercrime victims, this study identifies four recovery stages and proposes that individual cyber resilience is fostered by a combination of internal factors, social support, and context-sensitive, collaborative strategies.

Xiaowei Chen, Mindy Tran, Yue Deng + 2 more2026-03-06💻 cs

Assessing Risks of Large Language Models in Mental Health Support: A Framework for Automated Clinical AI Red Teaming

This paper introduces a simulation-based clinical red teaming framework that pairs AI psychotherapists with dynamic patient agents to evaluate mental health support systems, revealing critical safety gaps such as the validation of delusions and failure to de-escalate suicide risk in AI agents tested against Alcohol Use Disorder scenarios.

Ian Steenstra, Paola Pedrelli, Weiyan Shi + 2 more2026-03-06💻 cs

Unpacking Human Preference for LLMs: Demographically Aware Evaluation with the HUMAINE Framework

This paper introduces the HUMAINE framework, which leverages a large-scale, demographically stratified dataset of 23,404 participants to reveal that human preferences for large language models vary significantly across age groups and evaluation dimensions, challenging the validity of current unrepresentative benchmarks.

Nora Petrova, Andrew Gordon, Enzo Blindow2026-03-06💻 cs

Beyond the Interface: Redefining UX for Society-in-the-Loop AI Systems

This paper argues that traditional user experience frameworks are insufficient for AI-enabled Human-in-the-Loop systems and proposes a new sociotechnical evaluative framework that integrates backend performance, organizational workflows, and governance metrics to redefine UX for complex, decision-critical environments.

Nahal Mafi, Sahar Maleki, Babak Rahimi Ardabili + 1 more2026-03-06💻 cs

← Previous Next →