cs.LG papers | Gist.Science

Analysis-Driven Procedural Generation of an Engine Sound Dataset with Embedded Control Annotations

This paper introduces an analysis-driven framework that generates a publicly available, 19-hour procedural engine sound dataset with sample-accurate RPM and torque annotations by extracting harmonic structures from real recordings to drive a parametric synthesizer, thereby addressing the scarcity of clean, standardized audio data for automotive sound design and machine learning applications.

Robin Doerfler, Lonce Wyse2026-03-10🤖 cs.LG

Models as Lego Builders: Assembling Malice from Benign Blocks via Semantic Blueprints

This paper introduces StructAttack, a black-box jailbreak framework that exploits the semantic slot-filling vulnerability of Large Vision-Language Models by embedding benign-looking visual structures to covertly assemble and generate harmful content.

Chenxi Li, Xianggan Liu, Dake Shen, Yaosong Du, Zhibo Yao, Hao Jiang, Linyi Jiang, Chengwei Cao, Jingzhe Zhang, RanYi Peng, Peiling Bai, Xiande Huang2026-03-10🤖 cs.LG

Shorter Thoughts, Same Answers: Difficulty-Scaled Segment-Wise RL for CoT Compression

The paper proposes Difficulty-Scaled Segment-Wise GRPO (DSS-GRPO), a reinforcement learning method that decomposes training signals into separate "think" and "answer" segments with difficulty-aware scaling to compress reasoning traces without compromising answer quality.

Ye Tian, Aijun Liu2026-03-10🤖 cs.LG

MetaSort: An Accelerated Approach for Non-uniform Compression and Few-shot Classification of Neural Spike Waveforms

MetaSort is a novel algorithm that integrates an adaptive level crossing compression technique with meta-transfer learning-based feature representation to simultaneously achieve high-fidelity neural spike compression and robust few-shot classification, demonstrating strong potential for ultra-low-power on-chip implementation.

Luca M. Meyer, Majid Zamani2026-03-10🤖 cs.LG

TT-Sparse: Learning Sparse Rule Models with Differentiable Truth Tables

The paper introduces TT-Sparse, a differentiable neural framework utilizing truth tables and a novel soft TopK operator to learn sparse, high-performance rule-based models that enable exact extraction of globally interpretable Boolean formulas while outperforming existing state-of-the-art methods in both accuracy and complexity.

Hans Farrell Soegeng, Sarthak Ketanbhai Modi, Thomas Peyrin2026-03-10🤖 cs.LG

MAS-H2: A Hierarchical Multi-Agent System for Holistic Cloud-Native Autoscaling

This paper introduces MAS-H2, a hierarchical multi-agent system for Kubernetes that bridges the gap between business policies and resource provisioning through strategic, planning, and execution agents, demonstrating significant reductions in CPU stress and peak load while enabling zero-downtime infrastructure migrations compared to native autoscalers.

Hamed Hamzeh, Parisa Vahdatian2026-03-10🤖 cs.LG

Compression as Adaptation: Implicit Visual Representation with Diffusion Foundation Models

This paper proposes a novel visual representation framework that encodes signals as functions parametrized by low-rank adaptations on frozen diffusion models, enabling compact storage via single-vector hashing and bridging visual compression with generation through inference-time scaling and control.

Jiajun He, Zongyu Guo, Zhaoyang Jia, Xiaoyi Zhang, Jiahao Li, Xiao Li, Bin Li, José Miguel Hernández-Lobato, Yan Lu2026-03-10🤖 cs.LG

SMAT: Staged Multi-Agent Training for Co-Adaptive Exoskeleton Control

The paper proposes Staged Multi-Agent Training (SMAT), a four-stage curriculum that progressively trains a human-exoskeleton system to achieve stable co-adaptation, resulting in a control policy that significantly reduces hip muscle activation and delivers consistent, positive mechanical power across diverse users without requiring subject-specific retraining.

Yifei Yuan, Ghaith Androwis, Xianlian Zhou2026-03-10🤖 cs.LG

Accelerating Diffusion Models for Generative AI Applications with Silicon Photonics

This paper introduces a novel silicon photonics-based accelerator that significantly enhances the energy efficiency and throughput of diffusion models, addressing the high computational costs and energy consumption associated with their iterative denoising processes on conventional electronic platforms.

Tharini Suresh, Salma Afifi, Sudeep Pasricha2026-03-10🤖 cs.LG

Exoskeleton Control through Learning to Reduce Biological Joint Moments in Simulations

This paper presents a reinforcement learning framework for training exoskeleton controllers to reduce biological joint moments and establishes a quantitative validation pipeline that demonstrates strong simulation-to-data consistency in torque predictions, particularly at the hip, while identifying specific challenges in timing and power injection for sim-to-real transfer.

Zihang You, Xianlian Zhou2026-03-10🤖 cs.LG

Helix: Evolutionary Reinforcement Learning for Open-Ended Scientific Problem Solving

The paper introduces HELIX, a Hierarchical Evolutionary Reinforcement Learning framework that combines in-context learning with iterative policy refinement to achieve state-of-the-art results in open-ended scientific problem solving, outperforming existing methods and GPT-4o on tasks like circle packing and machine learning benchmarks.

Chang Su, Zhongkai Hao, Zhizhou Zhang, Zeyu Xia, Youjia Wu, Hang Su, Jun Zhu2026-03-10🤖 cs.LG

Evaluating Synthetic Data for Baggage Trolley Detection in Airport Logistics

This paper proposes a high-fidelity synthetic data generation pipeline using NVIDIA Omniverse to address data scarcity and privacy constraints in airport logistics, demonstrating that mixed training with synthetic data and only 40% of real annotations achieves performance comparable to full real-data baselines while reducing annotation effort by 25–35%.

Abdeldjalil Taibi, Mohmoud Badlis, Amina Bensalem, Belkacem Zouilekh, Mohammed Brahimi2026-03-10🤖 cs.LG

Compressed Proximal Federated Learning for Non-Convex Composite Optimization on Heterogeneous Data

This paper proposes FedCEF, a novel federated learning algorithm that combines a decoupled proximal update scheme with error feedback and control variates to achieve communication-efficient, sublinear convergence for non-convex composite optimization on heterogeneous data under extreme compression.

Pu Qiu, Chen Ouyang, Yongyang Xiong, Keyou You, Wanquan Liu, Yang Shi2026-03-10🤖 cs.LG

Partial Differential Equations in the Age of Machine Learning: A Critical Synthesis of Classical, Machine Learning, and Hybrid Methods

This critical review synthesizes classical and machine learning approaches for solving partial differential equations by contrasting their deductive and inductive epistemologies, identifying three genuine complementarities, and establishing principles for hybrid methods that rigorously address error budgets and structural guarantees across emerging computational frontiers.

Mohammad Nooraiepour, Jakub Wiktor Both, Teeratorn Kadeethum, Saeid Sadeghnejad2026-03-10🤖 cs.LG

Beyond Surrogates: A Quantitative Analysis for Inter-Metric Relationships

This paper proposes a unified theoretical framework that quantifies the relationships between different evaluation metrics using Bayes-Optimal Set and Regret Transfer to bridge the gap between offline validation and online performance by addressing the structural asymmetry in metric mismatch.

Yuanhao Pu, Defu Lian, Enhong Chen2026-03-10🤖 cs.LG

Mitigating the Memory Bottleneck with Machine Learning-Driven and Data-Aware Microarchitectural Techniques

This dissertation addresses the memory bottleneck in modern computing by advocating a shift from data-agnostic to data-informed microarchitectural designs, proposing four machine learning-driven and data-aware mechanisms that significantly enhance performance and energy efficiency.

Rahul Bera2026-03-10🤖 cs.LG

Scalable Training of Mixture-of-Experts Models with Megatron Core

This paper presents Megatron Core, a scalable and production-ready open-source framework that addresses the coupled memory, communication, and computation challenges of Mixture-of-Experts (MoE) training through integrated system-level optimizations, enabling high-performance training of models ranging from billions to trillions of parameters on large-scale GPU clusters.

Zijie Yan (NVIDIA), Hongxiao Bai (NVIDIA), Xin Yao (NVIDIA), Dennis Liu (NVIDIA), Tong Liu (NVIDIA), Hongbin Liu (NVIDIA), Pingtian Li (NVIDIA), Evan Wu (NVIDIA), Shiqing Fan (NVIDIA), Li Tao (NVIDIA), Robin Zhang (NVIDIA), Yuzhong Wang (NVIDIA), Shifang Xu (NVIDIA), Jack Chang (NVIDIA), Xuwen Chen (NVIDIA), Kunlun Li (NVIDIA), Yan Bai (NVIDIA), Gao Deng (NVIDIA), Nan Zheng (NVIDIA), Vijay Anand Korthikanti (NVIDIA), Abhinav Khattar (NVIDIA), Ethan He (NVIDIA), Soham Govande (NVIDIA), Sangkug Lym (NVIDIA), Zhongbo Zhu (NVIDIA), Qi Zhang (NVIDIA), Haochen Yuan (NVIDIA), Xiaowei Ren (NVIDIA), Deyu Fu (NVIDIA), Tailai Ma (NVIDIA), Shunkang Zhang (NVIDIA), Jiang Shao (NVIDIA), Ray Wang (NVIDIA), Santosh Bhavani (NVIDIA), Xipeng Li (NVIDIA), Chandler Zhou (NVIDIA), David Wu (NVIDIA), Yingcan Wei (NVIDIA), Ashwath Aithal (NVIDIA), Michael Andersch (NVIDIA), Mohammad Shoeybi (NVIDIA), Jiajie Yao (NVIDIA), June Yang (NVIDIA)2026-03-10🤖 cs.LG

Global Convergence of Average Reward Constrained MDPs with Neural Critic and General Policy Parameterization

This paper proposes a primal-dual natural actor-critic algorithm using multi-layer neural network critics and Neural Tangent Kernel theory to establish the first global convergence and cumulative constraint violation guarantees for infinite-horizon Constrained MDPs with general policy parameterizations, overcoming the limitations of previous tabular or linear-critic approaches.

Anirudh Satheesh, Pankaj Kumar Barman, Washim Uddin Mondal, Vaneet Aggarwal2026-03-10🤖 cs.LG

Step-Size Decay and Structural Stagnation in Greedy Sparse Learning

This paper demonstrates that over-decaying step-size schedules in greedy sparse learning algorithms induce structural stagnation and prevent convergence in realizable regression problems, even in low-dimensional settings with controlled feature coherence.

Pablo M. Berná2026-03-10🤖 cs.LG

Deep Incentive Design with Differentiable Equilibrium Blocks

This paper introduces Deep Incentive Design (DID), a novel framework that employs game-agnostic differentiable equilibrium blocks (DEBs) to enable the automated, neural-network-based design of multi-agent interactions across diverse economic and computer science tasks, effectively solving complex incentive problems by handling a wide range of game scales and contexts.

Vinzenz Thoma, Georgios Piliouras, Luke Marris2026-03-10🤖 cs.LG

← Previous Next →