cs.LG 篇论文 | Gist.Science

Offline Dynamic Inventory and Pricing Strategy: Addressing Censored and Dependent Demand

本文提出了一种基于离线数据的创新算法，通过构建高阶马尔可夫决策过程并结合生存分析技术，有效解决了在需求具有依赖性和 censoring（截断）特性的动态库存与定价问题，从而估计出最大化长期利润的最优策略。

Korel Gundem, Zhengling Qi2026-03-12📊 stat

Score Matching Diffusion Based Feedback Control and Planning of Nonlinear Systems

本文提出了一种基于得分匹配扩散的非线性系统确定性反馈控制框架，通过前向扩散探索状态空间并设计反向去噪律来驱动系统概率密度收敛至目标分布，从而为漂移自由及线性时不变系统提供了可靠的密度控制与规划方法。

Karthik Elamvazhuthi, Darshan Gadginmath, Fabio Pasqualetti2026-03-12⚡ eess

Scalable Multi-Task Learning through Spiking Neural Networks with Adaptive Task-Switching Policy for Intelligent Autonomous Agents

本文提出了一种名为 SwitchMT 的新方法，通过结合具有主动树突和决斗结构的深度脉冲 Q 网络以及基于奖励与网络内部动力学的自适应任务切换策略，有效解决了资源受限自主代理在多任务强化学习中的任务干扰问题，实现了无需增加网络复杂度的可扩展高效多任务学习。

Rachmad Vidya Wicaksana Putra, Avaneesh Devkota, Muhammad Shafique2026-03-12🤖 cs.AI

cs.LG

Offline Dynamic Inventory and Pricing Strategy: Addressing Censored and Dependent Demand

Score Matching Diffusion Based Feedback Control and Planning of Nonlinear Systems

Scalable Multi-Task Learning through Spiking Neural Networks with Adaptive Task-Switching Policy for Intelligent Autonomous Agents

Panda: A pretrained forecast model for chaotic dynamics

LLLMs: A Data-Driven Survey of Evolving Research on Limitations of Large Language Models

Consistency-based Abductive Reasoning over Perceptual Errors of Multiple Pre-trained Models in Novel Environments

CARTGen-IR: Synthetic Tabular Data Generation for Imbalanced Regression

Comparative Analysis of Modern Machine Learning Models for Retail Sales Forecasting

Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions

Sequential-Parallel Duality in Prefix Scannable Models

Differential Privacy in Machine Learning: A Survey from Symbolic AI to LLMs

Silhouette-Driven Instance-Weighted $k$ -means

The Yokai Learning Environment: Tracking Beliefs Over Space and Time

Order Optimal Regret Bounds for Sharpe Ratio Optimization under Thompson Sampling

Universal Dynamics with Globally Controlled Analog Quantum Simulators

Tensor Train Completion from Fiberwise Observations Along a Single Mode

Zero-Shot Transferable Solution Method for Parametric Optimal Control Problems

Global Minimizers of Sigmoid Contrastive Loss

Deep Learning for Clouds and Cloud Shadow Segmentation in Methane Satellite and Airborne Imaging Spectroscopy

Multi-modal Data Spectrum: Multi-modal Datasets are Multi-dimensional

cs.LG

Offline Dynamic Inventory and Pricing Strategy: Addressing Censored and Dependent Demand

Score Matching Diffusion Based Feedback Control and Planning of Nonlinear Systems

Scalable Multi-Task Learning through Spiking Neural Networks with Adaptive Task-Switching Policy for Intelligent Autonomous Agents

Panda: A pretrained forecast model for chaotic dynamics

LLLMs: A Data-Driven Survey of Evolving Research on Limitations of Large Language Models

Consistency-based Abductive Reasoning over Perceptual Errors of Multiple Pre-trained Models in Novel Environments

CARTGen-IR: Synthetic Tabular Data Generation for Imbalanced Regression

Comparative Analysis of Modern Machine Learning Models for Retail Sales Forecasting

Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions

Sequential-Parallel Duality in Prefix Scannable Models

Differential Privacy in Machine Learning: A Survey from Symbolic AI to LLMs

Silhouette-Driven Instance-Weighted kkk-means

The Yokai Learning Environment: Tracking Beliefs Over Space and Time

Order Optimal Regret Bounds for Sharpe Ratio Optimization under Thompson Sampling

Universal Dynamics with Globally Controlled Analog Quantum Simulators

Tensor Train Completion from Fiberwise Observations Along a Single Mode

Zero-Shot Transferable Solution Method for Parametric Optimal Control Problems

Global Minimizers of Sigmoid Contrastive Loss

Deep Learning for Clouds and Cloud Shadow Segmentation in Methane Satellite and Airborne Imaging Spectroscopy

Multi-modal Data Spectrum: Multi-modal Datasets are Multi-dimensional

Silhouette-Driven Instance-Weighted $k$ -means