cs.LG 件の論文 | Gist.Science

Characterizing MARL for Energy Control: A Multi-KPI Benchmark on the CityLearn Environment

この論文は、CityLearn 環境を用いて都市エネルギー制御におけるマルチエージェント強化学習（MARL）を多角的な KPI で評価し、分散学習分散実行（DTDE）が集中学習分散実行（CTDE）よりも優れており、時間依存性の学習がバッテリー寿命などの持続可能性指標の改善に寄与することを示しています。

Aymen Khouja, Imen Jendoubi, Oumayma Mahjoub, Oussama Mahfoudhi, Ruan De Kock, Siddarth Singh, Claude Formanek2026-03-10🤖 cs.LG

RAmmStein: Regime Adaptation in Mean-reverting Markets with Stein Thresholds -- Optimal Impulse Control in Concentrated AMMs

本論文は、分散型取引所の流動性プロバイダーが手数料収益とリバランスコストのトレードオフを最適化するため、平均回帰市場における最適インパルス制御問題を定式化し、深層強化学習を用いた「RAmmStein」手法を提案することで、過剰なリバランスを抑制しつつ資本効率を大幅に向上させることを示しています。

Pranay Anchuri2026-03-10🤖 cs.LG

← 前へ次へ →

cs.LG

Characterizing MARL for Energy Control: A Multi-KPI Benchmark on the CityLearn Environment

RAmmStein: Regime Adaptation in Mean-reverting Markets with Stein Thresholds -- Optimal Impulse Control in Concentrated AMMs

Benchmarking GNN Models on Molecular Regression Tasks with CKA-Based Representation Analysis

MrBERT: Modern Multilingual Encoders via Vocabulary, Domain, and Dimensional Adaptation

Autoregressive Visual Decoding from EEG Signals

CeRA: Breaking the Linear Ceiling of Low-Rank Adaptation via Manifold Expansion

Scaling Search Relevance: Augmenting App Store Ranking with LLM-Generated Judgments

End-to-end Differentiable Calibration and Reconstruction for Optical Particle Detectors

Attn-QAT: 4-Bit Attention With Quantization-Aware Training

The Partition Principle Revisited: Non-Equal Volume Designs Achieve Minimal Expected Star Discrepancy

How Well Do Multimodal Models Reason on ECG Signals?

Opponent State Inference Under Partial Observability: An HMM-POMDP Framework for 2026 Formula 1 Energy Strategy

TCG CREST System Description for the DISPLACE-M Challenge

A Detection-Gated Pipeline for Robust Glottal Area Waveform Extraction and Clinical Pathology Assessment

Leveraging Model Soups to Classify Intangible Cultural Heritage Images from the Mekong Delta

Personalized Multi-Agent Average Reward TD-Learning via Joint Linear Approximation

Embedding interpretable $\ell_1$ -regression into neural networks for uncovering temporal structure in cell imaging

Interpretable Motion-Attentive Maps: Spatio-Temporally Localizing Concepts in Video Diffusion Transformers

CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

Why Adam Can Beat SGD: Second-Moment Normalization Yields Sharper Tails

cs.LG

Characterizing MARL for Energy Control: A Multi-KPI Benchmark on the CityLearn Environment

RAmmStein: Regime Adaptation in Mean-reverting Markets with Stein Thresholds -- Optimal Impulse Control in Concentrated AMMs

Benchmarking GNN Models on Molecular Regression Tasks with CKA-Based Representation Analysis

MrBERT: Modern Multilingual Encoders via Vocabulary, Domain, and Dimensional Adaptation

Autoregressive Visual Decoding from EEG Signals

CeRA: Breaking the Linear Ceiling of Low-Rank Adaptation via Manifold Expansion

Scaling Search Relevance: Augmenting App Store Ranking with LLM-Generated Judgments

End-to-end Differentiable Calibration and Reconstruction for Optical Particle Detectors

Attn-QAT: 4-Bit Attention With Quantization-Aware Training

The Partition Principle Revisited: Non-Equal Volume Designs Achieve Minimal Expected Star Discrepancy

How Well Do Multimodal Models Reason on ECG Signals?

Opponent State Inference Under Partial Observability: An HMM-POMDP Framework for 2026 Formula 1 Energy Strategy

TCG CREST System Description for the DISPLACE-M Challenge

A Detection-Gated Pipeline for Robust Glottal Area Waveform Extraction and Clinical Pathology Assessment

Leveraging Model Soups to Classify Intangible Cultural Heritage Images from the Mekong Delta

Personalized Multi-Agent Average Reward TD-Learning via Joint Linear Approximation

Embedding interpretable ℓ1\ell_1ℓ1​-regression into neural networks for uncovering temporal structure in cell imaging

Interpretable Motion-Attentive Maps: Spatio-Temporally Localizing Concepts in Video Diffusion Transformers

CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

Why Adam Can Beat SGD: Second-Moment Normalization Yields Sharper Tails

Embedding interpretable $\ell_1$ -regression into neural networks for uncovering temporal structure in cell imaging