The impact of AI and peer feedback on research writing skills: a study using the CGScholar platform among Kazakhstani scholars

This study of 36 Kazakhstani scholars using the CGScholar platform reveals that while familiarity with AI tools correlates with openness to feedback, participants still highly value peer input for methodological guidance, suggesting that integrating AI with traditional peer feedback effectively enhances academic research writing skills.

Raigul Zheldibayeva2026-03-10🤖 cs.AI

Small Changes, Big Impact: Demographic Bias in LLM-Based Hiring Through Subtle Sociocultural Markers in Anonymised Resumes

This paper demonstrates that even when explicit personally identifiable information is removed, Large Language Models used in hiring can still exhibit significant demographic bias by inferring ethnicity and gender from subtle sociocultural markers in anonymized resumes, leading to systematic unfairness that is often amplified by explanation prompting.

Bryan Chen Zhengyu Tan, Shaun Khoo, Bich Ngoc Doan + 3 more2026-03-06💻 cs

Evaluating and Correcting Human Annotation Bias in Dynamic Micro-Expression Recognition

This paper introduces the Global Anti-Monotonic Differential Selection Strategy (GAMDSS), a novel architecture that mitigates human annotation bias in cross-cultural micro-expression recognition by dynamically re-selecting keyframes to construct robust spatio-temporal representations, thereby improving model performance and standardizing annotation practices without increasing computational parameters.

Feng Liu, Bingyu Nan, Xuezhong Qian + 1 more2026-03-06💻 cs

Signal in the Noise: Decoding the Reality of Airline Service Quality with Large Language Models

This study validates a Large Language Model framework that analyzes over 16,000 unstructured TripAdvisor reviews to uncover critical service quality drivers and a stark post-2022 satisfaction decline for EgyptAir that traditional metrics failed to detect, demonstrating the model's superiority in transforming passenger feedback into actionable strategic intelligence.

Ahmed Dawoud, Osama El-Shamy, Ahmed Habashy2026-03-06💻 cs

Baseline Performance of AI Tools in Classifying Cognitive Demand of Mathematical Tasks

This study evaluates eleven general-purpose and education-specific AI tools, finding that they achieve only moderate accuracy (63%) in classifying the cognitive demand of mathematical tasks due to a systematic bias toward middle-level categories and a tendency to prioritize surface textual features over underlying cognitive processes, thereby limiting their immediate reliability for teacher planning without improved prompt engineering or tool development.

Danielle S. Fox, Brenda L. Robles, Elizabeth DiPietro Brovey + 1 more2026-03-06💻 cs

Assessing Risks of Large Language Models in Mental Health Support: A Framework for Automated Clinical AI Red Teaming

This paper introduces a simulation-based clinical red teaming framework that pairs AI psychotherapists with dynamic patient agents to evaluate mental health support systems, revealing critical safety gaps such as the validation of delusions and failure to de-escalate suicide risk in AI agents tested against Alcohol Use Disorder scenarios.

Ian Steenstra, Paola Pedrelli, Weiyan Shi + 2 more2026-03-06💻 cs

Advancing Problem-Based Learning in Biomedical Engineering in the Era of Generative AI

This paper presents a three-year case study demonstrating how an advanced Problem-Based Learning framework successfully integrated biomedical AI education for 248 students at Georgia Tech and Emory, overcoming challenges like diverse backgrounds and data privacy while fostering significant research productivity and providing a scalable roadmap for curriculum development.

Micky C. Nnamdi, J. Ben Tamo, Benoit Marteau + 2 more2026-03-06💻 cs