cs.LG 件の論文 | Gist.Science

In-Run Data Shapley for Adam Optimizer

この論文は、従来の SGD ベースの手法では Adam 最適化器の複雑な動的挙動を捉えられないという課題を解決するため、固定状態仮説に基づく閉形式近似と「線形化ゴースト近似」を導入し、Adam 最適化器に対応した高速かつ高精度なデータ寄与度評価手法「Adam-Aware In-Run Data Shapley」を提案するものである。

Meng Ding, Zeqing Zhang, Di Wang, Lijie Hu2026-03-10🤖 cs.LG

Do Schwartz Higher-Order Values Help Sentence-Level Human Value Detection? A Study of Hierarchical Gating and Calibration

シュワルツの高次価値カテゴリーは、単一の文から人間の価値を検出するタスクにおいて、厳密な階層的ゲートリングやスタンドアロンのコンパクト LLM としてよりも、閾値調整やアンサンブルによる校正、あるいは帰納的バイアスとして活用する方が、限られた計算資源下でより効果的であることが示されました。

Víctor Yeste, Paolo Rosso2026-03-10🤖 cs.LG

← 前へ次へ →

cs.LG

In-Run Data Shapley for Adam Optimizer

Do Schwartz Higher-Order Values Help Sentence-Level Human Value Detection? A Study of Hierarchical Gating and Calibration

LatentMem: Customizing Latent Memory for Multi-Agent Systems

Thickening-to-Thinning: Reward Shaping via Human-Inspired Learning Dynamics for LLM Reasoning

Inference-Time Backdoors via Hidden Instructions in LLM Chat Templates

Hinge Regression Tree: A Newton Method for Oblique Regression Tree Splitting

Radial Müntz-Szász Networks: Neural Architectures with Learnable Power Bases for Multidimensional Singularities

SDFed: Bridging Local Global Discrepancy via Subspace Refinement and Divergence Control in Federated Prompt Learning

Retrieval Pivot Attacks in Hybrid RAG: Measuring and Mitigating Amplified Leakage from Vector Seeds to Graph Expansion

Diffusion-Guided Pretraining for Brain Graph Foundation Models

Learning Page Order in Shuffled WOO Releases

Discovering Semantic Latent Structures in Psychological Scales: A Response-Free Pathway to Efficient Simplification

TrasMuon: Trust-Region Adaptive Scaling for Orthogonalized Momentum Optimizers

Benchmark Leakage Trap: Can We Trust LLM-based Recommendation?

Mean Flow Policy with Instantaneous Velocity Constraint for One-step Action Generation

Pawsterior: Variational Flow Matching for Structured Simulation-Based Inference

Why Code, Why Now: Learnability, Computability, and the Real Limits of Machine Learning

LongAudio-RAG: Event-Grounded Question Answering over Multi-Hour Long Audio

Accelerated Predictive Coding Networks via Direct Kolen-Pollack Feedback Alignment

On the Power of Source Screening for Learning Shared Feature Extractors