cs.LG 件の論文 | Gist.Science

Entropy-Preserving Reinforcement Learning

この論文は、方策勾配アルゴリズムが訓練中にエントロピー（探索の多様性）を自然に減少させる問題を指摘し、REPO や ADAPO といったエントロピー制御メカニズムを導入することで、多様性を維持しつつ性能と継続学習能力を向上させる手法を提案しています。

Aleksei Petrenko, Ben Lipkin, Kevin Chen, Erik Wijmans, Marco Cusumano-Towner, Raja Giryes, Philipp Krähenbühl2026-03-13🤖 cs.LG

Causal Prosody Mediation for Text-to-Speech:Counterfactual Training of Duration, Pitch, and Energy in FastSpeech2

この論文は、FastSpeech2 構造に明示的な感情条件付けと因果推論に基づく対照的訓練（間接経路制約と対照的プロソディ制約）を導入し、言語内容から感情プロソディを分離することで、自然さを損なわずに制御可能な感情表現とプロソディ編集を実現する新しい音声合成フレームワークを提案しています。

Suvendu Sekhar Mohanty2026-03-13🤖 cs.AI

Decomposing Observational Multiplicity in Decision Trees: Leaf and Structural Regret

本論文は、決定木分類器における観測的多様性を「葉の後悔」と「構造の後悔」に分解する理論的枠組みを提示し、構造の後悔が主要な要因であることを実証するとともに、これらを棄却メカニズムとして活用することでモデルの安全性を向上させる手法を提案しています。

Mustafa Cavus2026-03-13📊 stat

EvoFlows: Evolutionary Edit-Based Flow-Matching for Protein Engineering

この論文は、テンプレート配列に対して挿入・削除・置換を制御可能な回数で予測する「EvoFlows」という変異フローに基づく手法を提案し、既存の言語モデルと同等の分布再現性を保ちつつ、より自然で非自明なタンパク質変異体を生成できることを示しています。

Nicolas Deutschmann, Constance Ferragu, Jonathan D. Ziegler, Shayan Aziznejad, Eli Bixby2026-03-13🤖 cs.LG

Cross-Resolution Attention Network for High-Resolution PM2.5 Prediction

この論文は、超高分解能かつ広域な大気汚染予測の課題を解決するため、メタデータ入力に代わって物理的制約を組み込んだクロス解像度アテンション機構を備えた双枝ビジョントランスフォーマー「CRAN-PM」を提案し、単一 GPU で 1.8 秒以内にヨーロッパ全域の PM2.5 分布を生成しながら、従来の手法よりも予測精度を大幅に向上させることを示しています。

Ammar Kheder, Helmi Toropainen, Wenqing Peng, Samuel Antão, Zhi-Song Liu, Michael Boy2026-03-13🤖 cs.LG

CINDI: Conditional Imputation and Noisy Data Integrity with Flows in Power Grid Data

本論文は、電力網などの複雑な時系列データにおけるノイズや異常値を、異常検知と欠損値補完を単一の条件付き正規化フローに基づく確率的枠組み「CINDI」で統合的に処理し、データの物理的・統計的性質を維持しながら下流タスクの信頼性を向上させる手法を提案しています。

David Baumgartner, Helge Langseth, Heri Ramampiaro2026-03-13🤖 cs.AI

Mitigating the Multiplicity Burden: The Role of Calibration in Reducing Predictive Multiplicity of Classifiers

この論文は、クレジットリスク予測における予測多重性の負担を軽減するため、特にプラットスケーリングやアイソトニック回帰などの事後校正手法が、ラシュモネ集合内のモデル間不一致を低減し手続き的公平性を支援し得ることを示しています。

Mustafa Cavus2026-03-13🤖 cs.LG

Anomaly detection in time-series via inductive biases in the latent space of conditional normalizing flows

この論文は、観測空間の尤度ではなく、条件付き正規化フローの潜在空間に時系列の構造的な帰納的バイアスを導入し、規定された時間的ダイナミクスからの逸脱を統計的適合度検定で評価することで、高尤度領域でも有効な異常検出を実現する手法を提案しています。

David Baumgartner, Eliezer de Souza da Silva, Iñigo Urteaga2026-03-13🤖 cs.AI

Exploiting Expertise of Non-Expert and Diverse Agents in Social Bandit Learning: A Free Energy Approach

この論文は、報酬の知識がない他者の行動のみを観測する社会型バンディット学習において、オラクルや社会的規範に依存せずに他者の専門性を推定し、直接経験と他者の推定ポリシーを統合する自由エネルギーベースのアルゴリズムを提案し、その最適ポリシーへの収束性と、非専門家を含む多様なエージェント集団における学習性能の向上を実証的に示したものである。

Erfan Mirzaei, Seyed Pooya Shariatpanahi, Alireza Tavakoli, Reshad Hosseini, Majid Nili Ahmadabadi2026-03-13📊 stat

Modeling Trial-and-Error Navigation With a Sequential Decision Model of Information Scent

この論文は、ユーザーが記憶制約と時間制約の下で戦略的にリンクを探索する逐次意思決定プロセスとして情報香りを再定義し、モデルが実際のナビゲーションにおける誤選択やバックトラックを含む試行錯誤行動を再現できることを示しています。

Xiaofu Jin, Yunpeng Bai, Antti Oulasvirta2026-03-13🤖 cs.LG

A Further Efficient Algorithm with Best-of-Both-Worlds Guarantees for $m$ -Set Semi-Bandit Problem

本論文は、 $m$ -集合半バンドット問題において、フリーチェ分布やパレート分布を用いたフォロウ・ザ・パーターブド・リーダー（FTPL）アルゴリズムが、敵対的設定で最適な regret 境界 $O(\sqrt{mdT})$ を達成し、確率的設定でも対数 regret を達成する「両方の世界における最適性」を有することを示すと同時に、条件付き幾何学的リサンプリングを拡張することで計算複雑性を $O(md(\log(d/m)+1))$ まで削減する効率的なアルゴリズムを提案しています。

Botao Chen, Jongyeong Lee, Chansoo Kim, Junya Honda2026-03-13📊 stat

Language Generation with Replay: A Learning-Theoretic View of Model Collapse

本論文は、学習理論の枠組みを用いて、生成モデルの出力が学習データに再流入する「リプレイ」が、生成の強弱の定義によってモデル崩壊を誘発する条件を理論的に解明し、実務で用いられるデータクリーニングなどの手法の有効性と限界を明らかにしたものである。

Giorgio Racca, Michal Valko, Amartya Sanyal2026-03-13📊 stat

Disentangled Representation Learning through Unsupervised Symmetry Group Discovery

この論文は、事前の対称性群の構造に関する知識を必要とせず、環境との相互作用を通じて自律的に群構造を発見し、その識別可能性を証明するとともに、既存手法を上回る線形対称性に基づく解離表現学習アルゴリズムを提案するものである。

Dang-Nhu Barthélémy, Annabi Louis, Argentieri Sylvain2026-03-13🤖 cs.LG

Exponential-Family Membership Inference: From LiRA and RMIA to BaVarIA

この論文は、LiRA、RMIA、BASE といった既存のメンバーシップ推論攻撃を指数分布族の対数尤度比フレームワークで統一的に解釈し、分散推定を改善したベイズ推論攻撃 BaVarIA を提案することで、特にシャドウモデル数が限られる実用的な環境において、より安定した攻撃性能を実現することを示しています。

Rickard Brännvall2026-03-13🤖 cs.LG

OSM-based Domain Adaptation for Remote Sensing VLMs

この論文は、大規模な教師モデルや手動アノテーションに依存せず、OpenStreetMap のメタデータと OCR 機能を活用して自己完結型のドメイン適応フレームワーク「OSMDA」を提案し、リモートセンシング用 VLM の性能向上とスケーラビリティを実現したことを述べています。

Stefan Maria Ailuro (INSAIT, Sofia University "St. Kliment Ohridski"), Mario Markov (INSAIT, Sofia University "St. Kliment Ohridski"), Mohammad Mahdi (INSAIT, Sofia University "St. Kliment Ohridski"), Delyan Boychev (INSAIT, Sofia University "St. Kliment Ohridski"), Luc Van Gool (INSAIT, Sofia University "St. Kliment Ohridski"), Danda Pani Paudel (INSAIT, Sofia University "St. Kliment Ohridski")2026-03-13🤖 cs.LG

← 前へ次へ →

cs.LG

Entropy-Preserving Reinforcement Learning

Causal Prosody Mediation for Text-to-Speech:Counterfactual Training of Duration, Pitch, and Energy in FastSpeech2

Decomposing Observational Multiplicity in Decision Trees: Leaf and Structural Regret

EvoFlows: Evolutionary Edit-Based Flow-Matching for Protein Engineering

Cross-Resolution Attention Network for High-Resolution PM2.5 Prediction

CINDI: Conditional Imputation and Noisy Data Integrity with Flows in Power Grid Data

Mitigating the Multiplicity Burden: The Role of Calibration in Reducing Predictive Multiplicity of Classifiers

Anomaly detection in time-series via inductive biases in the latent space of conditional normalizing flows

Exploiting Expertise of Non-Expert and Diverse Agents in Social Bandit Learning: A Free Energy Approach

Modeling Trial-and-Error Navigation With a Sequential Decision Model of Information Scent

A Further Efficient Algorithm with Best-of-Both-Worlds Guarantees for $m$ -Set Semi-Bandit Problem

Language Generation with Replay: A Learning-Theoretic View of Model Collapse

Disentangled Representation Learning through Unsupervised Symmetry Group Discovery

Exponential-Family Membership Inference: From LiRA and RMIA to BaVarIA

OSM-based Domain Adaptation for Remote Sensing VLMs

Hypercomplex Widely Linear Processing: Fundamentals for Quaternion Machine Learning

Inverse Neural Operator for ODE Parameter Optimization

Multi-Station WiFi CSI Sensing Framework Robust to Station-wise Feature Missingness and Limited Labeled Data

On the Role of Reversible Instance Normalization

FlexRec: Adapting LLM-based Recommenders for Flexible Needs via Reinforcement Learning

cs.LG

Entropy-Preserving Reinforcement Learning

Causal Prosody Mediation for Text-to-Speech:Counterfactual Training of Duration, Pitch, and Energy in FastSpeech2

Decomposing Observational Multiplicity in Decision Trees: Leaf and Structural Regret

EvoFlows: Evolutionary Edit-Based Flow-Matching for Protein Engineering

Cross-Resolution Attention Network for High-Resolution PM2.5 Prediction

CINDI: Conditional Imputation and Noisy Data Integrity with Flows in Power Grid Data

Mitigating the Multiplicity Burden: The Role of Calibration in Reducing Predictive Multiplicity of Classifiers

Anomaly detection in time-series via inductive biases in the latent space of conditional normalizing flows

Exploiting Expertise of Non-Expert and Diverse Agents in Social Bandit Learning: A Free Energy Approach

Modeling Trial-and-Error Navigation With a Sequential Decision Model of Information Scent

A Further Efficient Algorithm with Best-of-Both-Worlds Guarantees for mmm-Set Semi-Bandit Problem

Language Generation with Replay: A Learning-Theoretic View of Model Collapse

Disentangled Representation Learning through Unsupervised Symmetry Group Discovery

Exponential-Family Membership Inference: From LiRA and RMIA to BaVarIA

OSM-based Domain Adaptation for Remote Sensing VLMs

Hypercomplex Widely Linear Processing: Fundamentals for Quaternion Machine Learning

Inverse Neural Operator for ODE Parameter Optimization

Multi-Station WiFi CSI Sensing Framework Robust to Station-wise Feature Missingness and Limited Labeled Data

On the Role of Reversible Instance Normalization

FlexRec: Adapting LLM-based Recommenders for Flexible Needs via Reinforcement Learning

A Further Efficient Algorithm with Best-of-Both-Worlds Guarantees for $m$ -Set Semi-Bandit Problem