cs.CY 篇论文 | Gist.Science

AI Misuse in Education Is a Measurement Problem: Toward a Learning Visibility Framework

该论文主张将教育中的 AI 滥用问题从“检测难题”重新定义为“测量难题”，并提出以“学习可见性框架”为核心，通过明确 AI 使用规范、将学习过程视为评估证据以及建立透明的活动轨迹，在保障伦理与信任的前提下实现 AI 与教育的良性融合。

Eduardo Davalos, Yike ZhangTue, 10 Ma💻 cs

Social Proof is in the Pudding: The (Non)-Impact of Social Proof on Software Downloads

该研究通过在 GitHub 上进行的两项现场实验发现，人为操纵开源软件的社会证明指标（如点赞数和下载量）并不能显著影响开发者的下载行为或项目活跃度，表明此类指标难以被恶意利用来诱导软件选择。

Lucas Shen, Gaurav SoodTue, 10 Ma💻 cs

Semantic Risk Scoring of Aggregated Metrics: An AI-Driven Approach for Healthcare Data Governance

该论文提出了一种基于 AI 的模块化框架，通过结合语义与语法特征对 SQL 指标定义进行静态风险评分，从而在无需访问敏感患者数据的情况下，实现对医疗聚合指标隐私泄露风险的预先检测与可解释性治理。

Mohammed Omer Shakeel AhmedTue, 10 Ma🤖 cs.LG

Evaluating LLM-Based Grant Proposal Review via Structured Perturbations

该研究通过结构化扰动评估了大语言模型在 EPSRC 资助提案评审中的能力，发现分章节分析架构在检测率和评分可靠性上表现最佳，但现有模型仍存在高变异性且更倾向于合规性检查而非整体评估，因此目前仅适合作为辅助评审工具。

William Thorne, Joseph James, Yang Wang, Chenghua Lin, Diana MaynardTue, 10 Ma💬 cs.CL

Improving Fairness with Ensemble Combination: Margin-Dependent Bounds

该论文提出了一种名为“判别风险”的新公平性度量方法，通过扰动受保护属性同时涵盖个体与群体公平性，并建立了基于边界的理论保证，进而设计了集成剪枝算法以在提升分类准确性的同时有效改善模型公平性。

Yijun BianThu, 12 Ma🤖 cs.LG

Personalizing explanations of AI-driven hints to users' characteristics: an empirical evaluation

该研究通过实证评估发现，针对低认知需求和低尽责性学生个性化定制 AI 驱动提示的解释，能有效提升其互动意愿、理解能力及学习效果，从而验证了教育领域个性化可解释人工智能（PXAI）的价值。

Vedant Bahel, Harshinee Sriram, Cristina ConatiThu, 12 Ma🤖 cs.AI

Shiksha Copilot: Teacher-AI Collaboration for Curating and Customizing Lesson Plans in Low-Resource Schools

该研究基于在印度卡纳塔克邦政府学校开展的大规模混合方法研究，评估了"Shiksha Copilot"这一人机协作工具在低资源、多语言环境中如何帮助教师减轻行政负担、缩短备课时间并推动活动式教学，同时也揭示了师资短缺等系统性挑战对深层教学变革的限制。

Deepak Varuvel Dennison, Bakhtawar Ahtisham, Kavyansh Chourasia, Nirmit Arora, Rahul Singh, Rene F. Kizilcec, Akshay Nambi, Tanuja Ganu, Aditya VashisthaThu, 12 Ma💻 cs

Recommender systems, representativeness, and online music: a psychosocial analysis of Italian listeners

该研究通过对意大利音乐听众的访谈与情感文本分析，揭示了听众虽习惯使用推荐系统却缺乏对其运作机制的批判性理解，且对性别代表性问题认知有限，从而强调了在音乐推荐系统设计中融合心理社会视角的重要性。

Lorenzo Porcaro, Chiara MonaldiThu, 12 Ma💻 cs

R v F (2025): Addressing the Defence of Hacking

本文以 R v F (2025) 案为例，首次通过实证研究展示了数字取证调查人员如何有效应对“黑客辩护”（即“他人所为”辩护），为司法系统区分无辜者与罪犯提供了实用的调查技术与经验教训。

Junade AliThu, 12 Ma💻 cs

Intuition First or Reflection Before Judgment? The Impact of Evaluation Sequence on Consumer Ratings

该研究通过实验与大数据分析发现，评价顺序（先评分后写评 vs. 先写评后评分）会通过情感启发式与认知努力的双重中介机制显著影响消费者评分，导致高服务质量情境下评分更高、低服务质量情境下评分更低，且该效应在享乐型产品中更为强烈，从而揭示了界面设计对在线评分真实性与分布形态的关键影响。

He Wang, Yueheng Wang, Ziyu Zhou, Hanxiang LiuThu, 12 Ma💻 cs

Empathy Is Not What Changed: Clinical Assessment of Psychological Safety Across GPT Model Generations

该研究通过临床评估发现，OpenAI 不同代际模型（GPT-4o 至 GPT-5-mini）在共情能力上并无统计学差异，用户感知的“共情丧失”实为模型危机检测能力增强与过度干预建议之间的安全策略转变，这种在对话中途危机时刻发生的显著变化揭示了当前评估体系难以捕捉的潜在风险。

Michael Keeman, Anastasia KeemanThu, 12 Ma💬 cs.CL

Adaptive Engram Memory System for Indonesian Language Model: Generative AI Based on TOBA LM for Batak and Minang Language

该研究提出了一种名为 TOBA-LM 的 12 亿参数三语语言模型，通过结合 GPT-2 架构与自适应印迹记忆（Engram Memory）机制，利用音节黏着分词技术高效训练印尼语、巴塔克语和米南加保语，显著提升了训练效率并降低了计算资源需求。

Hokky Situngkir, Kevin Siringoringo, Andhika Bernard LumbantobingThu, 12 Ma💬 cs.CL

Open Educational Resources: Barriers and Open Issues

该研究通过文献综述与专家访谈，系统识别并评估了阻碍开放教育资源（OER）创建、使用和维护的26项社会、经济及技术障碍，并构建了概念模型以提出减轻这些障碍的策略，从而推动教育资源的普惠获取与包容性生态建设。

Pedro Henrique Dias Valle, Rafael Capilla, Vinicius dos Santos, Daniel Feitosa, Elisa Yumi NakagawaThu, 12 Ma💻 cs

$\mu$ Ed API: Towards A Shared API for EdTech Microservices

该论文提出了一种名为 $\mu$ Ed 的标准化、平台无关的教育微服务 API 规范，旨在通过整合多机构现有系统功能（如反馈、评估和教育聊天机器人），构建一个互操作的微服务生态系统，从而解决大型学习平台因缺乏专业自动化而受限的问题，并提升跨学科的学习体验。

Maximillan Sölch, Alexandra Neagu, Marcus Messer, Peter Johnson, Gerd Kortemeyer, Samuel S. H. Ng, Fun Siong Lim, Stephan KruscheThu, 12 Ma💻 cs

The coordination gap in frontier AI safety policies

该论文指出当前前沿 AI 安全政策过度侧重预防而忽视了预防失效后的协调机制，导致系统性投资不足，并借鉴核安全与流行病防控等领域的经验，提出应建立预先承诺、共享协议及常设协调平台等制度架构以填补这一结构性缺口。

Isaak MengeshaThu, 12 Ma📈 econ

Assessing Cognitive Biases in LLMs for Judicial Decision Support: Virtuous Victim and Halo Effects

该研究通过对比五种大语言模型在司法量刑场景下的表现，发现模型虽表现出类似人类的“美德受害者”偏见且对“相邻同意”缺乏显著惩罚，但在职业、公司及学历光环效应上比人类偏见更弱（其中学历光环效应减弱尤为明显），表明尽管当前模型尚不足以直接用于司法决策，但其在减少部分偏见方面已展现出优于人类的潜力。

Sierra S. LiuThu, 12 Ma💻 cs

The science and practice of proportionality in AI risk evaluations

本文探讨了如何在欧盟《人工智能法案》框架下，运用比例原则科学地校准通用人工智能模型的风险评估实践，以在有效管理系统性风险与避免给提供者施加过度负担之间取得平衡。

Carlos Mougan, Lauritz Morlock, Jair Aguirre, James R. M. Black, Jan Brauner, Simeon Campos, Sunishchal Dev, David Fernández Llorca, Alberto Franzin, Mario Fritz, Emilia Gómez, Friederike Grosse-Holz, Eloise Hamilton, Max Hasin, Jose Hernandez-Orallo, Dan Lahav, Luca Massarelli, Vasilios Mavroudis, Malcolm Murray, Patricia Paskov, Jaime Raldua, Wout SchellaertThu, 12 Ma💻 cs

cs.CY

AI Misuse in Education Is a Measurement Problem: Toward a Learning Visibility Framework

Social Proof is in the Pudding: The (Non)-Impact of Social Proof on Software Downloads

Semantic Risk Scoring of Aggregated Metrics: An AI-Driven Approach for Healthcare Data Governance

Evaluating LLM-Based Grant Proposal Review via Structured Perturbations

Improving Fairness with Ensemble Combination: Margin-Dependent Bounds

Personalizing explanations of AI-driven hints to users' characteristics: an empirical evaluation

Shiksha Copilot: Teacher-AI Collaboration for Curating and Customizing Lesson Plans in Low-Resource Schools

Recommender systems, representativeness, and online music: a psychosocial analysis of Italian listeners

R v F (2025): Addressing the Defence of Hacking

Intuition First or Reflection Before Judgment? The Impact of Evaluation Sequence on Consumer Ratings

Empathy Is Not What Changed: Clinical Assessment of Psychological Safety Across GPT Model Generations

Adaptive Engram Memory System for Indonesian Language Model: Generative AI Based on TOBA LM for Batak and Minang Language

Open Educational Resources: Barriers and Open Issues

$\mu$ Ed API: Towards A Shared API for EdTech Microservices

The coordination gap in frontier AI safety policies

Assessing Cognitive Biases in LLMs for Judicial Decision Support: Virtuous Victim and Halo Effects

The science and practice of proportionality in AI risk evaluations

DeliberationBench: A Normative Benchmark for the Influence of Large Language Models on Users' Views

Prompts and Prayers: the Rise of GPTheology

Dark Patterns and Consumer Protection Law for App Makers

cs.CY

AI Misuse in Education Is a Measurement Problem: Toward a Learning Visibility Framework

Social Proof is in the Pudding: The (Non)-Impact of Social Proof on Software Downloads

Semantic Risk Scoring of Aggregated Metrics: An AI-Driven Approach for Healthcare Data Governance

Evaluating LLM-Based Grant Proposal Review via Structured Perturbations

Improving Fairness with Ensemble Combination: Margin-Dependent Bounds

Personalizing explanations of AI-driven hints to users' characteristics: an empirical evaluation

Shiksha Copilot: Teacher-AI Collaboration for Curating and Customizing Lesson Plans in Low-Resource Schools

Recommender systems, representativeness, and online music: a psychosocial analysis of Italian listeners

R v F (2025): Addressing the Defence of Hacking

Intuition First or Reflection Before Judgment? The Impact of Evaluation Sequence on Consumer Ratings

Empathy Is Not What Changed: Clinical Assessment of Psychological Safety Across GPT Model Generations

Adaptive Engram Memory System for Indonesian Language Model: Generative AI Based on TOBA LM for Batak and Minang Language

Open Educational Resources: Barriers and Open Issues

μ\muμEd API: Towards A Shared API for EdTech Microservices

The coordination gap in frontier AI safety policies

Assessing Cognitive Biases in LLMs for Judicial Decision Support: Virtuous Victim and Halo Effects

The science and practice of proportionality in AI risk evaluations

DeliberationBench: A Normative Benchmark for the Influence of Large Language Models on Users' Views

Prompts and Prayers: the Rise of GPTheology

Dark Patterns and Consumer Protection Law for App Makers

$\mu$ Ed API: Towards A Shared API for EdTech Microservices