cs.AI 篇论文 | Gist.Science

Enhancing Instruction Following of LLMs via Activation Steering with Dynamic Rejection

该论文提出了名为 DIRECTER 的新方法，通过结合注意力敏感性分析与基于合理性的解码循环，动态调节激活引导强度，从而在无需额外数据集的情况下有效缓解大语言模型的过度引导问题，显著提升了指令遵循能力且未牺牲生成质量。

Minjae Kang, Jaehyung Kim2026-03-10🤖 cs.LG

ButterflyViT: 354 $\times$ Expert Compression for Edge Vision Transformers

本文提出了 ButterflyViT，一种通过将专家视为共享量化基底的几何重定向并引入空间平滑正则化来解决线性内存扩展瓶颈的方法，从而在 CIFAR-100 等任务上实现了 64 专家配置下 354 倍的内存压缩且精度损失可忽略，使稀疏混合专家视觉 Transformer 能够部署于边缘设备。

Aryan Karmore2026-03-10💻 cs

Property-driven Protein Inverse Folding With Multi-Objective Preference Alignment

本文提出了 ProtAlign 框架，通过多目标偏好对齐策略微调预训练逆折叠模型，在保持结构可设计性的同时，有效平衡了蛋白质序列设计中溶解度、热稳定性等多种开发属性。

Xiaoyang Hou, Junqi Liu, Chence Shi, Xin Liu, Zhi Yang, Jian Tang2026-03-10🤖 cs.LG

Robotic Foundation Models for Industrial Control: A Comprehensive Survey and Readiness Assessment Framework

这篇论文全面综述了机器人基础模型（RFM）的工业适用性，提出了包含 149 项具体标准的评估框架，并通过大规模评估发现当前 RFM 在工业领域的成熟度有限且发展不均衡，强调未来的进步应依赖于将安全、实时性、鲁棒感知及系统集成等要素系统性地纳入可审计的部署堆栈中。

David Kube, Simon Hadwiger, Tobias Meisen2026-03-10💻 cs

XMACNet: An Explainable Lightweight Attention based CNN with Multi Modal Fusion for Chili Disease Classification

本文提出了一种名为 XMACNet 的可解释轻量级注意力 CNN 模型，通过融合可见光图像与植被指数并引入 StyleGAN 数据增强，在辣椒病害分类任务中实现了高精度、强可解释性及边缘部署能力。

Tapon Kumer Ray, Rajkumar Y, Shalini R, Srigayathri K, Jayashree S, Lokeswari P2026-03-10💻 cs

Learning Unbiased Cluster Descriptors for Interpretable Imbalanced Concept Drift Detection

该论文提出了一种名为 ICD3 的无偏聚类描述符方法，通过多分布粒度搜索识别不平衡概念并分别训练单类分类器，从而有效克服了主导大簇对少数小概念漂移的“掩蔽效应”，实现了可解释且鲁棒的不平衡概念漂移检测。

Yiqun Zhang, Zhanpei Huang, Mingjie Zhao, Chuyao Zhang, Yang Lu, Yuzhu Ji, Fangqing Gu, An Zeng2026-03-10🤖 cs.LG

Enhancing SHAP Explainability for Diagnostic and Prognostic ML Models in Alzheimer Disease

该论文提出了一种多层次可解释性框架，通过在 NACC 数据集上整合多种指标，验证了 SHAP 方法在阿尔茨海默病诊断与预后模型中跨任务、跨阶段及跨架构的解释具有高度的一致性与稳定性，从而增强了临床应用的可靠性。

Pablo Guillén, Enrique Frias-Martinez2026-03-10🤖 cs.LG

Gradient-based Nested Co-Design of Aerodynamic Shape and Control for Winged Robots

本文提出了一种基于梯度的嵌套共设计框架，通过结合最优控制规划器与神经代理气动模型，实现了对固定翼滑翔机气动外形与控制策略的联合优化，从而在显著缩短计算时间的同时，有效提升了其在复杂动态任务（如停栖和短距着陆）中的性能。

Daniele Affinita, Mingda Xu, Benoît Valentin Gherardi, Pascal Fua2026-03-10💻 cs

Diversity-Aware Adaptive Collocation for Physics-Informed Neural Networks via Sparse QUBO Optimization and Hybrid Coresets

该论文提出了一种基于稀疏 QUBO 优化和混合核心集构建的多样性感知自适应配点方法，通过从候选池中筛选兼具高信息量与低冗余度的点集，有效解决了物理信息神经网络（PINNs）在训练效率与精度上的瓶颈问题。

Hadi Salloum, Maximilian Mifsud Bonici, Sinan Ibrahim, Pavel Osinenko, Alexei Kornaev2026-03-10🤖 cs.LG

Failure Detection in Chemical Processes using Symbolic Machine Learning: A Case Study on Ethylene Oxidation

本文提出了一种基于符号机器学习的故障预测方法，通过利用化学过程模拟器生成的数据，在乙烯氧化案例中证明了该方法在保持模型可解释性的同时，其性能优于随机森林和多层感知机等基线模型，并探讨了其在辅助化工操作员决策中的应用潜力。

Julien Amblard, Niklas Groll, Matthew Tait, Mark Law, Gürkan Sin, Alessandra Russo2026-03-10🤖 cs.LG

HGT-Scheduler: Deep Reinforcement Learning for the Job Shop Scheduling Problem via Heterogeneous Graph Transformers

本文提出了一种基于异构图 Transformer 的强化学习调度框架（HGT-Scheduler），通过将作业车间调度问题建模为异构图并利用边类型感知的注意力机制来捕捉不同的关系语义，从而在 Fisher-Thompson 基准测试中显著提升了调度策略的性能。

Bulent Soykan2026-03-10🤖 cs.LG

SpatialMAGIC: A Hybrid Framework Integrating Graph Diffusion and Spatial Attention for Spatial Transcriptomics Imputation

SpatialMAGIC 是一种结合图扩散与空间自注意力机制的混合框架，旨在解决空间转录组数据的高稀疏性和技术噪声问题，通过有效恢复缺失表达值并保留空间一致性，在聚类精度和下游生物分析中显著优于现有基准方法。

Sayeem Bin Zaman, Fahim Hafiz, Riasat Azim2026-03-10🤖 cs.LG

cs.AI

Enhancing Instruction Following of LLMs via Activation Steering with Dynamic Rejection

ButterflyViT: 354 $\times$ Expert Compression for Edge Vision Transformers

Property-driven Protein Inverse Folding With Multi-Objective Preference Alignment

Robotic Foundation Models for Industrial Control: A Comprehensive Survey and Readiness Assessment Framework

XMACNet: An Explainable Lightweight Attention based CNN with Multi Modal Fusion for Chili Disease Classification

Learning Unbiased Cluster Descriptors for Interpretable Imbalanced Concept Drift Detection

Enhancing SHAP Explainability for Diagnostic and Prognostic ML Models in Alzheimer Disease

Gradient-based Nested Co-Design of Aerodynamic Shape and Control for Winged Robots

Diversity-Aware Adaptive Collocation for Physics-Informed Neural Networks via Sparse QUBO Optimization and Hybrid Coresets

Failure Detection in Chemical Processes using Symbolic Machine Learning: A Case Study on Ethylene Oxidation

HGT-Scheduler: Deep Reinforcement Learning for the Job Shop Scheduling Problem via Heterogeneous Graph Transformers

SpatialMAGIC: A Hybrid Framework Integrating Graph Diffusion and Spatial Attention for Spatial Transcriptomics Imputation

xaitimesynth: A Python Package for Evaluating Attribution Methods for Time Series with Synthetic Ground Truth

Physics-Informed Diffusion Model for Generating Synthetic Extreme Rare Weather Events Data

Optimistic Policy Regularization

Best-of-Tails: Bridging Optimism and Pessimism in Inference-Time Alignment

Breaking the Martingale Curse: Multi-Agent Debate via Asymmetric Cognitive Potential Energy

A Hybrid Machine Learning Model for Cerebral Palsy Detection

Making AI Evaluation Deployment Relevant Through Context Specification

Reinforcing the World's Edge: A Continual Learning Problem in the Multi-Agent-World Boundary

cs.AI

Enhancing Instruction Following of LLMs via Activation Steering with Dynamic Rejection

ButterflyViT: 354×\times× Expert Compression for Edge Vision Transformers

Property-driven Protein Inverse Folding With Multi-Objective Preference Alignment

Robotic Foundation Models for Industrial Control: A Comprehensive Survey and Readiness Assessment Framework

XMACNet: An Explainable Lightweight Attention based CNN with Multi Modal Fusion for Chili Disease Classification

Learning Unbiased Cluster Descriptors for Interpretable Imbalanced Concept Drift Detection

Enhancing SHAP Explainability for Diagnostic and Prognostic ML Models in Alzheimer Disease

Gradient-based Nested Co-Design of Aerodynamic Shape and Control for Winged Robots

Diversity-Aware Adaptive Collocation for Physics-Informed Neural Networks via Sparse QUBO Optimization and Hybrid Coresets

Failure Detection in Chemical Processes using Symbolic Machine Learning: A Case Study on Ethylene Oxidation

HGT-Scheduler: Deep Reinforcement Learning for the Job Shop Scheduling Problem via Heterogeneous Graph Transformers

SpatialMAGIC: A Hybrid Framework Integrating Graph Diffusion and Spatial Attention for Spatial Transcriptomics Imputation

xaitimesynth: A Python Package for Evaluating Attribution Methods for Time Series with Synthetic Ground Truth

Physics-Informed Diffusion Model for Generating Synthetic Extreme Rare Weather Events Data

Optimistic Policy Regularization

Best-of-Tails: Bridging Optimism and Pessimism in Inference-Time Alignment

Breaking the Martingale Curse: Multi-Agent Debate via Asymmetric Cognitive Potential Energy

A Hybrid Machine Learning Model for Cerebral Palsy Detection

Making AI Evaluation Deployment Relevant Through Context Specification

Reinforcing the World's Edge: A Continual Learning Problem in the Multi-Agent-World Boundary

ButterflyViT: 354 $\times$ Expert Compression for Edge Vision Transformers