cs.LG 篇论文 | Gist.Science

Are Expressive Encoders Necessary for Discrete Graph Generation?

该论文提出了名为 GenGNN 的模块化消息传递框架，证明了在离散图生成任务中，无需依赖高表达力的 Transformer 等复杂架构，仅使用 GenGNN 作为扩散模型骨干即可在保持与图 Transformer 相当的有效性（如树和平面图数据集超过 90%、分子生成达 99.49%）的同时，实现 2 至 5 倍的推理速度提升。

Jay Revolinsky, Harry Shomer, Jiliang TangWed, 11 Ma🤖 cs.AI

MASEval: Extending Multi-Agent Evaluation from Models to Systems

该论文提出了 MASEval 框架，旨在填补现有基准测试仅关注模型而忽视系统实现（如拓扑结构和编排逻辑）的空白，通过系统级评估证明框架选择对多智能体系统性能的影响与模型选择同等重要。

Cornelius Emde, Alexander Rubinstein, Anmol Goel, Ahmed Heakl, Sangdoo Yun, Seong Joon Oh, Martin GubriWed, 11 Ma🤖 cs.AI

Expressivity-Efficiency Tradeoffs for Hybrid Sequence Models

该论文通过理论证明与实验验证，揭示了混合序列模型（结合 Transformer 与状态空间模型）在解决特定合成任务时，能够以远少于纯 Transformer 或纯状态空间模型的参数量和内存开销实现同等性能，并展现出更优的长度泛化能力与分布外鲁棒性。

John Cooper, Ilias Diakonikolas, Mingchen Ma, Frederic SalaWed, 11 Ma🤖 cs.LG

APPLV: Adaptive Planner Parameter Learning from Vision-Language-Action Model

本文提出了 APPLV 方法，通过利用预训练视觉 - 语言模型预测经典规划器的参数，结合监督与强化学习策略，有效解决了移动机器人在高约束环境下的导航安全性、精确控制及泛化难题。

Yuanjie Lu, Beichen Wang, Zhengqi Wu, Yang Li, Xiaomin Lin, Chengzhi Mao, Xuesu XiaoWed, 11 Ma🤖 cs.LG

Why Channel-Centric Models are not Enough to Predict End-to-End Performance in Private 5G: A Measurement Campaign and Case Study

该研究通过在私有 5G 环境中的实测表明，仅依赖信道级指标（如信号强度）的模型会因高估 MIMO 空间层数而系统性地高估端到端吞吐量，而直接基于实测数据学习的高斯过程模型能显著降低预测误差，证明通信感知规划需采用数据驱动方法或精细校准的链路层模型以准确预测系统性能。

Nils JörgensenWed, 11 Ma🤖 cs.LG

A New Modeling to Feature Selection Based on the Fuzzy Rough Set Theory in Normal and Optimistic States on Hybrid Information Systems

本文针对混合信息系统中模糊粗糙集理论在高维空间下计算效率低及易产生噪声的问题，提出了一种名为 FSbuHD 的新特征选择模型，该模型通过计算对象间综合距离构建模糊等价关系，将特征选择转化为优化问题，并在正常和乐观两种模式下经实验验证了其高效性与优越性。

Mohammad Hossein Safarpour, Seyed Mohammad Alavi, Mohammad Izadikhah, Hossein DibachiWed, 11 Ma🤖 cs.AI

Cross-Domain Uncertainty Quantification for Selective Prediction: A Comprehensive Bound Ablation with Transfer-Informed Betting

本文提出了一种名为“转移信息博彩（TIB）”的新方法，通过结合跨域风险分布预热与博彩置信序列，在数据稀缺场景下显著提升了选择性预测的覆盖率，并系统评估了九类有限样本界在多个基准测试中的表现。

Abhinaba BasuWed, 11 Ma🤖 cs.AI

FedLECC: Cluster- and Loss-Guided Client Selection for Federated Learning under Non-IID Data

本文提出了 FedLECC，一种针对非独立同分布数据的联邦学习客户端选择策略，通过结合标签分布聚类与局部损失引导，在显著降低通信开销的同时提升了模型收敛速度与测试精度。

Daniel M. Jimenez-Gutierrez, Giovanni Giunta, Mehrdad Hassanzadeh, Aris Anagnostopoulos, Ioannis Chatzigiannakis, Andrea VitalettiWed, 11 Ma🤖 cs.AI

Quantifying Memorization and Privacy Risks in Genomic Language Models

该论文提出了一种整合困惑度检测、金丝雀序列提取和成员推断的多向量隐私评估框架，系统量化了基因组语言模型在不同架构和训练条件下的记忆化风险，揭示了单一攻击手段的局限性并强调了多向量审计的必要性。

Alexander Nemecek, Wenbiao Li, Xiaoqian Jiang, Jaideep Vaidya, Erman AydayWed, 11 Ma🤖 cs.LG

Uncovering a Winning Lottery Ticket with Continuously Relaxed Bernoulli Gates

该论文提出了一种基于连续松弛伯努利门的全微分优化方法，用于在保持网络权重冻结的情况下高效发现强彩票子网络，从而在多种架构上实现了比现有方法更高的稀疏度且几乎无精度损失。

Itamar Tsayag, Ofir LindenbaumWed, 11 Ma🤖 cs.AI

Vision-Language Models Encode Clinical Guidelines for Concept-Based Medical Reasoning

本文提出了 MedCBR 框架，通过将临床指南融入视觉 - 语言模型与概念推理，实现了从医学图像分析到符合指南的专家级诊断推理的端到端可解释性提升。

Mohamed Harmanani, Bining Long, Zhuoxin Guo, Paul F. R. Wilson, Amirhossein Sabour, Minh Nguyen Nhat To, Gabor Fichtinger, Purang Abolmaesumi, Parvin MousaviWed, 11 Ma🤖 cs.LG

Optimizing Reinforcement Learning Training over Digital Twin Enabled Multi-fidelity Networks

本文提出了一种基于数字孪生多保真网络的层次化强化学习框架，通过联合优化天线倾角调整策略与物理/虚拟网络数据采集比例，在满足时延约束的同时最大化用户数据速率，并显著降低了物理网络的数据采集延迟。

Hanzhi Yu, Hasan Farooq, Julien Forgeat, Shruti Bothe, Kristijonas Cyras, Md Moin Uddin Chowdhury, Mingzhe ChenWed, 11 Ma🤖 cs.LG

Interpretable Markov-Based Spatiotemporal Risk Surfaces for Missing-Child Search Planning with Reinforcement Learning and LLM-Based Quality Assurance

本文介绍了名为"Guardian"的端到端决策支持系统，该系统通过结合可解释的马尔可夫链、强化学习及大语言模型质量验证的三层架构，将非结构化案件数据转化为缺失儿童搜索的时空风险预测与优化方案。

Joshua Castillo, Ravi MukkamalaWed, 11 Ma🤖 cs.AI

cs.LG

Are Expressive Encoders Necessary for Discrete Graph Generation?

MASEval: Extending Multi-Agent Evaluation from Models to Systems

Expressivity-Efficiency Tradeoffs for Hybrid Sequence Models

APPLV: Adaptive Planner Parameter Learning from Vision-Language-Action Model

Why Channel-Centric Models are not Enough to Predict End-to-End Performance in Private 5G: A Measurement Campaign and Case Study

A New Modeling to Feature Selection Based on the Fuzzy Rough Set Theory in Normal and Optimistic States on Hybrid Information Systems

Cross-Domain Uncertainty Quantification for Selective Prediction: A Comprehensive Bound Ablation with Transfer-Informed Betting

FedLECC: Cluster- and Loss-Guided Client Selection for Federated Learning under Non-IID Data

Quantifying Memorization and Privacy Risks in Genomic Language Models

Uncovering a Winning Lottery Ticket with Continuously Relaxed Bernoulli Gates

Vision-Language Models Encode Clinical Guidelines for Concept-Based Medical Reasoning

Optimizing Reinforcement Learning Training over Digital Twin Enabled Multi-fidelity Networks

Interpretable Markov-Based Spatiotemporal Risk Surfaces for Missing-Child Search Planning with Reinforcement Learning and LLM-Based Quality Assurance

BiCLIP: Domain Canonicalization via Structured Geometric Transformation

Kernel Debiased Plug-in Estimation based on the Universal Least Favorable Submodel

Towards Reliable Simulation-based Inference

A Consensus-Driven Multi-LLM Pipeline for Missing-Person Investigations

A Survey of Reinforcement Learning For Economics

The $qs$ Inequality: Quantifying the Double Penalty of Mixture-of-Experts at Inference

Semantic Level of Detail: Multi-Scale Knowledge Representation via Heat Kernel Diffusion on Hyperbolic Manifolds

cs.LG

Are Expressive Encoders Necessary for Discrete Graph Generation?

MASEval: Extending Multi-Agent Evaluation from Models to Systems

Expressivity-Efficiency Tradeoffs for Hybrid Sequence Models

APPLV: Adaptive Planner Parameter Learning from Vision-Language-Action Model

Why Channel-Centric Models are not Enough to Predict End-to-End Performance in Private 5G: A Measurement Campaign and Case Study

A New Modeling to Feature Selection Based on the Fuzzy Rough Set Theory in Normal and Optimistic States on Hybrid Information Systems

Cross-Domain Uncertainty Quantification for Selective Prediction: A Comprehensive Bound Ablation with Transfer-Informed Betting

FedLECC: Cluster- and Loss-Guided Client Selection for Federated Learning under Non-IID Data

Quantifying Memorization and Privacy Risks in Genomic Language Models

Uncovering a Winning Lottery Ticket with Continuously Relaxed Bernoulli Gates

Vision-Language Models Encode Clinical Guidelines for Concept-Based Medical Reasoning

Optimizing Reinforcement Learning Training over Digital Twin Enabled Multi-fidelity Networks

Interpretable Markov-Based Spatiotemporal Risk Surfaces for Missing-Child Search Planning with Reinforcement Learning and LLM-Based Quality Assurance

BiCLIP: Domain Canonicalization via Structured Geometric Transformation

Kernel Debiased Plug-in Estimation based on the Universal Least Favorable Submodel

Towards Reliable Simulation-based Inference

A Consensus-Driven Multi-LLM Pipeline for Missing-Person Investigations

A Survey of Reinforcement Learning For Economics

The qsqsqs Inequality: Quantifying the Double Penalty of Mixture-of-Experts at Inference

Semantic Level of Detail: Multi-Scale Knowledge Representation via Heat Kernel Diffusion on Hyperbolic Manifolds

The $qs$ Inequality: Quantifying the Double Penalty of Mixture-of-Experts at Inference