Evaluating Zero-Shot and One-Shot Adaptation of Small Language Models in Leader-Follower Interaction

This paper evaluates small language models for leader-follower role classification in human-robot interaction, demonstrating that fine-tuned models achieve high accuracy and low latency on edge devices, though performance degrades in one-shot modes due to architectural limitations with increased context.

Rafael R. Baptista, André de Lima Salgado, Ricardo V. Godoy, Marcelo Becker, Thiago Boaventura, Gustavo J. G. LahrFri, 13 Ma⚡ eess

"I followed what felt right, not what I was told": Autonomy, Coaching, and Recognizing Bias Through AI-Mediated Dialogue

This study demonstrates that while AI-mediated dialogue is more effective than text-only reading for helping people recognize ableist microaggressions, the specific nature of AI nudges significantly impacts outcomes, with inclusive or unguided approaches fostering balanced learning whereas bias-directed nudges, though improving differentiation, tend to increase overall negativity and face user resistance.

Atieh Taheri, Hamza El Alaoui, Patrick Carrington, Jeffrey P. BighamFri, 13 Ma🤖 cs.AI

Managing Cognitive Bias in Human Labeling Operations for Rare-Event AI: Evidence from a Field Experiment

This paper demonstrates through a field experiment on a medical crowdsourcing platform that balancing feedback prevalence and using probabilistic elicitation, followed by linear-in-log-odds recalibration, effectively mitigates cognitive biases in human labeling of rare events, thereby significantly improving the reliability of downstream AI models.

Gunnar P. Epping, Andrew Caplin, Erik Duhaime, William R. Holmes, Daniel Martin, Jennifer S. TruebloodFri, 13 Ma💰 q-fin

A technology-oriented mapping of the language and translation industry: Analysing stakeholder values and their potential implication for translation pedagogy

Drawing on interview data from the LT-LiDER project and applying Chesterman's ethical framework, this paper argues that automation in the language industry reshapes rather than replaces human value by establishing technological efficiency as a baseline while repositioning human expertise and adaptability as essential for oversight and contextual judgment within technology-mediated workflows.

María Isabel Rivas Ginel, Janiça Hackenbuchner, Alina Secar\u{a}, Ralph Krüger, Caroline RossiFri, 13 Ma💬 cs.CL

Heuristics for AI-driven Graphical Asset Generation Tools in Game Design and Development Pipelines: A User-Centred Approach

This paper addresses the lack of guidelines for integrating AI-driven generative tools into game development pipelines by conducting a user study with 16 designers and developers, which revealed preferences for early-stage use and high-volume iteration, ultimately leading to a proposed set of heuristics for creating user-centered tools that ensure seamless integration and data compatibility.

Kaisei Fukaya, Damon Daylamani-Zad, Harry Agius2026-03-06💻 cs

The StudyChat Dataset: Analyzing Student Dialogues With ChatGPT in an Artificial Intelligence Course

This paper introduces StudyChat, a publicly available dataset of 16,851 annotated student interactions with an LLM-powered tutoring chatbot in an AI course, revealing that using the tool for conceptual understanding and coding assistance correlates with better academic performance, whereas using it to bypass learning objectives leads to lower exam scores.

Hunter McNichols, Fareya Ikram, Andrew Lan2026-03-06💻 cs

Assessing Risks of Large Language Models in Mental Health Support: A Framework for Automated Clinical AI Red Teaming

This paper introduces a simulation-based clinical red teaming framework that pairs AI psychotherapists with dynamic patient agents to evaluate mental health support systems, revealing critical safety gaps such as the validation of delusions and failure to de-escalate suicide risk in AI agents tested against Alcohol Use Disorder scenarios.

Ian Steenstra, Paola Pedrelli, Weiyan Shi + 2 more2026-03-06💻 cs