cs.AI 篇论文 | Gist.Science

Autonomous Algorithm Discovery for Ptychography via Evolutionary LLM Reasoning

本文提出了一种名为 Ptychi-Evolve 的自主框架，该框架利用大语言模型结合进化机制（如语义引导的交叉与变异）自动发现并演化新的正则化算法，在多种挑战性成像数据集中显著提升了相干衍射成像的重建质量并实现了可解释的算法演化记录。

Xiangyu Yin, Ming Du, Junjing Deng, Zhi Yang, Yimo Han, Yi Jiang2026-03-09🤖 cs.AI

Reasoning Models Struggle to Control their Chains of Thought

该论文通过引入 CoT-Control 评估套件发现，尽管推理模型在控制最终输出方面表现较强，但其控制思维链（CoT）内容的能力显著较弱，且随着模型规模增大、强化学习训练、测试时计算增加或问题难度提升而进一步降低，这表明目前 CoT 监控机制不太可能因模型主动操控思维链而失效。

Chen Yueh-Han, Robert McCarthy, Bruce W. Lee, He He, Ian Kivlichan, Bowen Baker, Micah Carroll, Tomek Korbak2026-03-09🤖 cs.AI

The Rise of AI in Weather and Climate Information and its Impact on Global Inequality

该论文指出，人工智能在地球系统科学中的快速应用若缺乏干预，将因算力与数据基础设施的全球南北差异而加剧气候信息领域的不平等，因此呼吁通过转向以数据为中心的开发模式、建立气候数字公共基础设施以及推动知识共同生产，来确保 AI 革命真正促进全球系统韧性而非加剧不公。

Amirpasha Mozaffari, Amanda Duarte, Lina Teckentrup, Stefano Materia, Gina E. C. Charnley, Lluis Palma, Eulalia Baulenas Serra, Dragana Bojovic, Paula Checchia, Aude Carreric, Francisco Doblas-Reyes2026-03-09🤖 cs.AI

Cultural Perspectives and Expectations for Generative AI: A Global Survey Approach

该论文通过一项涵盖全球多地区的大规模调查，从不同社群中提炼出文化的操作性定义，以评估人们对生成式 AI 如何呈现文化 artifacts、概念及价值观的看法与期望，并最终提出了包括参与式方法、超越地理维度的文化考量以及文化“红线”敏感性框架在内的开发建议。

Erin van Liemt, Renee Shelby, Andrew Smart, Sinchana Kumbale, Richard Zhang, Neha Dixit, Qazi Mamunur Rashid, Jamila Smith-Loud2026-03-09🤖 cs.AI

LTLGuard: Formalizing LTL Specifications with Compact Language Models and Lightweight Symbolic Reasoning

本文提出了 LTLGuard，一种结合约束生成与轻量级形式化一致性检查的模块化工具链，旨在利用资源高效的小型语言模型将非正式需求准确转化为无冲突的线性时序逻辑（LTL）规范。

Medina Andresel, Cristinel Mateis, Dejan Nickovic, Spyridon Kounoupidis, Panagiotis Katsaros, Stavros Tripakis2026-03-09🤖 cs.AI

Revisiting the (Sub)Optimality of Best-of-N for Inference-Time Alignment

该论文通过引入更贴近实际的胜率指标重新审视 Best-of-N（BoN）采样，证明在最小假设下其具有统计最优性，并提出一种能消除奖励黑客攻击且保持最优性能的改进变体。

Ved Sriraman, Adam Block2026-03-09🤖 cs.AI

TML-Bench: Benchmark for Data Science Agents on Tabular ML Tasks

本文介绍了 TML-Bench，这是一个针对 Kaggle 风格表格机器学习任务的自主数据科学智能体基准，通过评估 10 个开源大语言模型在不同时间预算下的端到端表现，发现 MiniMax-M2.1 模型综合性能最佳且性能随时间预算增加而提升。

Mykola Pinchuk2026-03-09🤖 cs.AI

Bridging Domains through Subspace-Aware Model Merging

该论文提出了一种名为 SCORE 的新方法，通过计算各模型主奇异向量的共享正交基并剪枝冲突分量，有效解决了多领域微调模型合并时的子空间冲突问题，从而显著提升了模型在未见领域上的泛化性能。

Levy Chaves, Chao Zhou, Rebekka Burkholz, Eduardo Valle, Sandra Avila2026-03-09🤖 cs.AI

Depth Charge: Jailbreak Large Language Models from Deep Safety Attention Heads

该论文提出了名为 SAHA 的新型越狱框架，通过识别深层注意力机制中的脆弱性并采用消融影响排序与分层扰动策略，成功突破了现有大语言模型的安全对齐，显著提升了攻击成功率。

Jinman Wu, Yi Xie, Shiqian Zhao, Xiaofeng Chen2026-03-09🤖 cs.AI

Knowing without Acting: The Disentangled Geometry of Safety Mechanisms in Large Language Models

该论文提出了“解耦安全假设”（DSH），通过几何分析揭示大语言模型中“识别有害性”与“执行拒绝”机制在深层解耦的现象，并据此开发了能实现“只知不行”状态的双差分提取与自适应因果引导方法，进而提出了具有 SOTA 攻击成功率的“拒绝擦除攻击”（REA）。

Jinman Wu, Yi Xie, Shen Lin, Shiqian Zhao, Xiaofeng Chen2026-03-09🤖 cs.AI

PVminerLLM: Structured Extraction of Patient Voice from Patient-Generated Text using Large Language Models

该论文提出了 PVminer 基准及经过监督微调的大语言模型 PVminerLLM，旨在从患者生成的文本中高效提取结构化患者声音信息，实验表明该方法在多种任务上显著优于提示基线，且无需超大模型规模即可实现可扩展的社会与体验信号分析。

Samah Fodeh, Linhai Ma, Ganesh Puthiaraju, Srivani Talakokkul, Afshan Khan, Ashley Hagaman, Sarah Lowe, Aimee Roundtree2026-03-09🤖 cs.AI

Balancing Domestic and Global Perspectives: Evaluating Dual-Calibration and LLM-Generated Nudges for Diverse News Recommendation

该研究通过在 POPROX 平台上对 120 名美国用户进行为期 5 周的实地实验，验证了结合“主题 - 地域双重校准”算法与基于大语言模型的个性化呈现“助推”策略能有效提升新闻推荐的多样性，并促使读者逐渐养成兼顾国内与国际新闻的阅读习惯。

Ruixuan Sun, Matthew Zent, Minzhu Zhao, Thanmayee Boyapati, Xinyi Li, Joseph A. Konstan2026-03-09🤖 cs.AI

cs.AI

Autonomous Algorithm Discovery for Ptychography via Evolutionary LLM Reasoning

Reasoning Models Struggle to Control their Chains of Thought

The Rise of AI in Weather and Climate Information and its Impact on Global Inequality

Cultural Perspectives and Expectations for Generative AI: A Global Survey Approach

LTLGuard: Formalizing LTL Specifications with Compact Language Models and Lightweight Symbolic Reasoning

Revisiting the (Sub)Optimality of Best-of-N for Inference-Time Alignment

TML-Bench: Benchmark for Data Science Agents on Tabular ML Tasks

Bridging Domains through Subspace-Aware Model Merging

Depth Charge: Jailbreak Large Language Models from Deep Safety Attention Heads

Knowing without Acting: The Disentangled Geometry of Safety Mechanisms in Large Language Models

PVminerLLM: Structured Extraction of Patient Voice from Patient-Generated Text using Large Language Models

Balancing Domestic and Global Perspectives: Evaluating Dual-Calibration and LLM-Generated Nudges for Diverse News Recommendation

Visual Words Meet BM25: Sparse Auto-Encoder Visual Word Scoring for Image Retrieval

Proof-of-Guardrail in AI Agents and What (Not) to Trust from It

StreamWise: Serving Multi-Modal Generation in Real-Time at Scale

Ambiguity Collapse by LLMs: A Taxonomy of Epistemic Risks

Margin and Consistency Supervision for Calibrated and Robust Vision Models

Lexara: A User-Centered Toolkit for Evaluating Large Language Models for Conversational Visual Analytics

Evaluating LLM Alignment With Human Trust Models

Remote Sensing Image Classification Using Deep Ensemble Learning