cs.LG papers | Gist.Science

Generative Models in Decision Making: A Survey

This survey proposes a principled, function-centric taxonomy grounded in Control as Inference to unify generative models in decision making into four distinct roles—Controllers, Modelers, Optimizers, and Evaluators—while critically analyzing their applications in high-stakes domains and outlining challenges for developing Generalist Physical Intelligence.

Xinyu Shao, Jianping Zhang, Haozhi Wang + 9 more2026-03-06💻 cs

BACE-RUL: A Bi-directional Adversarial Network with Covariate Encoding for Machine Remaining Useful Life Prediction

This paper introduces BACE-RUL, a bi-directional adversarial network with covariate encoding that predicts machine Remaining Useful Life using only current sensor measurements to overcome the limitations of prior knowledge and temporal mining, demonstrating superior performance over state-of-the-art methods on real-world datasets.

Zekai Zhang, Dan Li, Shunyu Wu + 4 more2026-03-06💻 cs

Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated by Machine Unlearning

This paper identifies the "safety mirage" in Vision-Language Models, where supervised fine-tuning creates spurious correlations that leave models vulnerable to simple attacks and prone to over-refusal, and proposes machine unlearning as a superior alignment strategy that significantly reduces attack success rates and unnecessary rejections while preserving general capabilities.

Yiwei Chen, Yuguang Yao, Yihua Zhang + 3 more2026-03-06💻 cs

Assessing the Impact of Code Changes on the Fault Localizability of Large Language Models

This paper introduces a large-scale, mutation-based evaluation framework to assess the robustness of Large Language Models in fault localization, revealing that their reasoning is often brittle and reliant on syntactic cues rather than deep semantic understanding, as evidenced by a 78% failure rate when subjected to semantic-preserving code changes.

Sabaat Haroon, Ahmad Faraz Khan, Ahmad Humayun + 5 more2026-03-06💻 cs

ms-Mamba: Multi-scale Mamba for Time-Series Forecasting

This paper introduces ms-Mamba, a novel multi-scale architecture that employs Mamba blocks with varying sampling rates to capture temporal information at different scales, achieving state-of-the-art forecasting performance with greater efficiency than existing Transformer and Mamba-based models.

Yusuf Meric Karadag, Ismail Talaz, Ipek Gursel Dino + 1 more2026-03-06💻 cs

TianQuan-S2S: A Subseasonal-to-Seasonal Global Weather Model via Incorporate Climatology State

The paper introduces TianQuan-S2S, a novel global subseasonal-to-seasonal weather forecasting model that integrates climatological states into patch embeddings and utilizes an uncertainty-augmented Transformer to overcome the limitations of over-smoothing and inadequate climate representation, thereby outperforming both traditional numerical methods and advanced data-driven models in deterministic and ensemble forecasting.

Guowen Li, Xintong Liu, Yang Liu + 11 more2026-03-06💻 cs

Noise2Ghost: Self-supervised deep convolutional reconstruction for ghost imaging

The paper introduces Noise2Ghost, a self-supervised deep learning method that achieves superior noise reduction and reconstruction quality in ghost imaging without requiring clean reference data, thereby enabling high-quality imaging in low-light scenarios such as dose-sensitive x-ray fluorescence and biological studies.

Mathieu Manni, Dmitry Karpov, K. Joost Batenburg + 2 more2026-03-06🔬 physics

Differentially Private and Scalable Estimation of the Network Principal Component

This paper proposes a novel, instance-specific Differentially Private framework based on the Propose-Test-Release mechanism that enables scalable and accurate estimation of network principal components on large real-world graphs, achieving a 180-fold runtime improvement over existing baselines while also providing the first DP solution for the Densest- $k$ -subgraph problem.

Alireza Khayatian, Anil Vullikanti, Aritra Konar2026-03-06💻 cs

Variational Formulation of Particle Flow

This paper presents a variational inference formulation of log-homotopy particle flow as a Fisher-Rao gradient flow, deriving Gaussian and Gaussian mixture approximations that recover the Exact Daum and Huang flow under linear Gaussian assumptions while enhancing expressiveness for multi-modal estimation.

Yinzhuang Yi, Jorge Cortés, Nikolay Atanasov2026-03-06💻 cs

ReactDance: Hierarchical Representation for High-Fidelity and Coherent Long-Form Reactive Dance Generation

ReactDance is a novel diffusion framework that achieves high-fidelity, coherent long-form reactive dance generation by employing Hierarchical Finite Scalar Quantization for fine-grained spatial control and a Blockwise Local Context strategy for efficient, temporally consistent sequence synthesis.

Jingzhong Lin, Xinru Li, Yuanyuan Qi + 8 more2026-03-06💻 cs

Learning Virtual Machine Scheduling in Cloud Computing through Language Agents

This paper proposes MiCo, a hierarchical language agent framework that leverages large language models to design adaptive heuristics for solving the complex Online Dynamic Multidimensional Bin Packing problem in cloud VM scheduling, achieving a 96.9% competitive ratio in large-scale, real-world scenarios.

JieHao Wu, Ziwei Wang, Junjie Sheng + 3 more2026-03-06💻 cs

Ice Cream Doesn't Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference

This paper introduces CausalPitfalls, a comprehensive benchmark designed to rigorously evaluate and expose the significant limitations of large language models in handling statistical causal inference pitfalls, such as Simpson's paradox, through both direct and code-assisted prompting protocols.

Jin Du, Li Chen, Xun Xian + 6 more2026-03-06💻 cs

ShIOEnv: A Command Evaluation Environment for Grammar-Constrained Synthesis and Execution Behavior Modeling

This paper introduces ShIOEnv, a grammar-constrained, self-supervised Bash environment that generates 2.1 million system-grounded input-output pairs to significantly improve the accuracy of modeling complex command-line execution behaviors compared to prior execution-free approaches.

Jarrod Ragsdale, Rajendra Boppana2026-03-06💻 cs

VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use

VTool-R1 is a novel framework that leverages reinforcement learning to train vision-language models to generate multimodal chains of thought by strategically interleaving text with intermediate visual reasoning steps using Python-based editing tools, thereby enhancing performance on structured visual tasks without requiring process-based supervision.

Mingyuan Wu, Jingcheng Yang, Jize Jiang + 6 more2026-03-06💻 cs

Attribute-Efficient PAC Learning of Sparse Halfspaces with Constant Malicious Noise Rate

This paper presents an attribute-efficient PAC learning algorithm for sparse halfspaces that achieves robustness against a constant malicious noise rate using $poly(s, \log d)$ samples by applying simple variants to hinge loss minimization under specific concentration and margin conditions.

Shiwei Zeng, Jie Shen2026-03-06💻 cs

Highly Efficient and Effective LLMs with Multi-Boolean Architectures

This paper introduces a novel framework that enables direct finetuning of large language models using multi-kernel Boolean parameters without latent weights, significantly reducing complexity while outperforming existing ultra low-bit quantization and binarization techniques.

Ba-Hien Tran, Van Minh Nguyen2026-03-06💻 cs

Continuous Chain of Thought Enables Parallel Exploration and Reasoning

This paper introduces Continuous Chain of Thought (CoT2), a framework that replaces discrete token sampling with continuously-valued tokens to enable parallel exploration of multiple reasoning traces, offering theoretical guarantees for solving combinatorial problems and demonstrating improved performance through novel supervision and policy optimization strategies.

Halil Alperen Gozeten, M. Emrullah Ildiz, Xuechen Zhang + 3 more2026-03-06💻 cs

SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models

The paper introduces SealQA, a new benchmark comprising three challenging flavors (Seal-0, Seal-Hard, and LongSeal) designed to evaluate search-augmented language models on fact-seeking tasks with noisy or conflicting web results, revealing that even frontier models struggle significantly with reasoning accuracy, robustness to noise, and long-context document retrieval.

Thinh Pham, Nguyen Nguyen, Pratibha Zunjare + 3 more2026-03-06💻 cs

FPGA-Enabled Machine Learning Applications in Earth Observation: A Systematic Review

This systematic review analyzes 68 experiments deploying machine learning models on FPGAs for Earth Observation, introducing dual taxonomies for model architectures and implementation strategies to address the challenges of onboard processing in the NewSpace era.

Cédric Léonard, Dirk Stober, Martin Schulz2026-03-06💻 cs

HSG-12M: A Large-Scale Benchmark of Spatial Multigraphs from the Energy Spectra of Non-Hermitian Crystals

This paper introduces Poly2Graph, an automated pipeline for generating HSG-12M, a pioneering 16.7-million-scale dataset of spatial multigraphs derived from non-Hermitian crystal energy spectra, which bridges condensed matter physics and geometry-aware graph learning by preserving vital geometric information often discarded in existing benchmarks.

Xianquan Yan, Hakan Akgün, Kenji Kawaguchi + 2 more2026-03-06🔬 cond-mat.mes-hall

← Previous Next →