cs.LG 편의 논문 | Gist.Science

Characterizing MARL for Energy Control: A Multi-KPI Benchmark on the CityLearn Environment

이 논문은 CityLearn 환경을 활용하여 도시 에너지 관리에 대한 다중 에이전트 강화학습 (MARL) 알고리즘을 다양한 핵심 성과 지표 (KPI) 로 평가하고, 분산 훈련이 중앙 집중식 훈련보다 우수하며 시간적 의존성 학습이 배터리 수명 등 지속 가능성 지표 향상에 기여함을 입증했습니다.

Aymen Khouja, Imen Jendoubi, Oumayma Mahjoub, Oussama Mahfoudhi, Ruan De Kock, Siddarth Singh, Claude Formanek2026-03-10🤖 cs.LG

RAmmStein: Regime Adaptation in Mean-reverting Markets with Stein Thresholds -- Optimal Impulse Control in Concentrated AMMs

이 논문은 분산형 거래소의 집중 유동성 공급 문제를 최적 제어 문제로 정식화하고, Ornstein-Uhlenbeck 과정의 평균 회귀 속도를 활용한 딥 강화 학습 기법인 RAmmStein 을 제안하여, 불필요한 재조정 비용을 줄이면서도 자본 효율성을 극대화하는 지능형 유동성 관리 전략을 입증했습니다.

Pranay Anchuri2026-03-10🤖 cs.LG

← 이전 다음 →

cs.LG

Characterizing MARL for Energy Control: A Multi-KPI Benchmark on the CityLearn Environment

RAmmStein: Regime Adaptation in Mean-reverting Markets with Stein Thresholds -- Optimal Impulse Control in Concentrated AMMs

Benchmarking GNN Models on Molecular Regression Tasks with CKA-Based Representation Analysis

MrBERT: Modern Multilingual Encoders via Vocabulary, Domain, and Dimensional Adaptation

Autoregressive Visual Decoding from EEG Signals

CeRA: Breaking the Linear Ceiling of Low-Rank Adaptation via Manifold Expansion

Scaling Search Relevance: Augmenting App Store Ranking with LLM-Generated Judgments

End-to-end Differentiable Calibration and Reconstruction for Optical Particle Detectors

Attn-QAT: 4-Bit Attention With Quantization-Aware Training

The Partition Principle Revisited: Non-Equal Volume Designs Achieve Minimal Expected Star Discrepancy

How Well Do Multimodal Models Reason on ECG Signals?

Opponent State Inference Under Partial Observability: An HMM-POMDP Framework for 2026 Formula 1 Energy Strategy

TCG CREST System Description for the DISPLACE-M Challenge

A Detection-Gated Pipeline for Robust Glottal Area Waveform Extraction and Clinical Pathology Assessment

Leveraging Model Soups to Classify Intangible Cultural Heritage Images from the Mekong Delta

Personalized Multi-Agent Average Reward TD-Learning via Joint Linear Approximation

Embedding interpretable $\ell_1$ -regression into neural networks for uncovering temporal structure in cell imaging

Interpretable Motion-Attentive Maps: Spatio-Temporally Localizing Concepts in Video Diffusion Transformers

CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

Why Adam Can Beat SGD: Second-Moment Normalization Yields Sharper Tails

cs.LG

Characterizing MARL for Energy Control: A Multi-KPI Benchmark on the CityLearn Environment

RAmmStein: Regime Adaptation in Mean-reverting Markets with Stein Thresholds -- Optimal Impulse Control in Concentrated AMMs

Benchmarking GNN Models on Molecular Regression Tasks with CKA-Based Representation Analysis

MrBERT: Modern Multilingual Encoders via Vocabulary, Domain, and Dimensional Adaptation

Autoregressive Visual Decoding from EEG Signals

CeRA: Breaking the Linear Ceiling of Low-Rank Adaptation via Manifold Expansion

Scaling Search Relevance: Augmenting App Store Ranking with LLM-Generated Judgments

End-to-end Differentiable Calibration and Reconstruction for Optical Particle Detectors

Attn-QAT: 4-Bit Attention With Quantization-Aware Training

The Partition Principle Revisited: Non-Equal Volume Designs Achieve Minimal Expected Star Discrepancy

How Well Do Multimodal Models Reason on ECG Signals?

Opponent State Inference Under Partial Observability: An HMM-POMDP Framework for 2026 Formula 1 Energy Strategy

TCG CREST System Description for the DISPLACE-M Challenge

A Detection-Gated Pipeline for Robust Glottal Area Waveform Extraction and Clinical Pathology Assessment

Leveraging Model Soups to Classify Intangible Cultural Heritage Images from the Mekong Delta

Personalized Multi-Agent Average Reward TD-Learning via Joint Linear Approximation

Embedding interpretable ℓ1\ell_1ℓ1​-regression into neural networks for uncovering temporal structure in cell imaging

Interpretable Motion-Attentive Maps: Spatio-Temporally Localizing Concepts in Video Diffusion Transformers

CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

Why Adam Can Beat SGD: Second-Moment Normalization Yields Sharper Tails

Embedding interpretable $\ell_1$ -regression into neural networks for uncovering temporal structure in cell imaging