cs.LG papers | Gist.Science

MCMC Informed Neural Emulators for Uncertainty Quantification in Dynamical Systems

This paper introduces an MCMC-informed neural emulator framework that decouples uncertainty quantification from network architecture by incorporating model-parameter distributions as training inputs, thereby enabling computationally efficient and accurate surrogate modeling for dynamical systems while avoiding exhaustive sampling and unphysical parameter evaluations.

Heikki Haario, Zhi-Song Liu, Martin Simon, Hendrik Weichel2026-03-12🤖 cs.LG

ForwardFlow: Simulation only statistical inference using deep learning

This paper proposes "ForwardFlow," a frequentist deep learning framework that utilizes a branched neural network trained on simulated data to directly estimate statistical parameters, demonstrating advantages such as finite sample exactness, robustness to contamination, and the ability to automatically approximate complex algorithms like the EM-algorithm.

Stefan Böhringer2026-03-12📊 stat

Bayesian Optimization with Gaussian Processes to Accelerate Stationary Point Searches

This paper presents a unified Bayesian optimization framework using Gaussian processes with derivative observations and advanced extensions like Optimal Transport and random Fourier features to efficiently accelerate the search for minima and saddle points on potential energy surfaces, bridging theoretical formulation with practical implementation through accompanying Rust code.

Rohit Goswami (Institute IMX and Lab-COSMO, École polytechnique fédérale de Lausanne)2026-03-12📊 stat

Factorized Neural Implicit DMD for Parametric Dynamics

This paper proposes Factorized Neural Implicit DMD, a data-driven method that parameterizes the Koopman operator's spectral decomposition via a physics-coded neural field to decouple spatial modes and temporal evolution, thereby enabling stable long-term rollouts, parameter generalization, and spectral analysis for high-dimensional nonlinear dynamical systems.

Siyuan Chen, Zhecheng Wang, Yixin Chen, Yue Chang, Peter Yichen Chen, Eitan Grinspun, Jonathan Panuelos2026-03-12🤖 cs.LG

Cross-Species Transfer Learning for Electrophysiology-to-Transcriptomics Mapping in Cortical GABAergic Interneurons

This study demonstrates that a cross-species transfer learning approach, utilizing an attention-based BiLSTM model pretrained on mouse Patch-seq data and fine-tuned on human data, successfully improves the prediction of conserved GABAergic interneuron subclasses from electrophysiological recordings to transcriptomic identities.

Theo Schwider, Ramin Ramezani2026-03-12🧬 q-bio

Leech Lattice Vector Quantization for Efficient LLM Compression

This paper introduces Leech Lattice Vector Quantization (LLVQ), a practical algorithm that leverages the optimal 24-dimensional Leech lattice and an extended Golay code-based search to achieve state-of-the-art LLM compression performance without requiring explicit codebook storage.

Tycho F. A. van der Ouderaa, Mart van Baalen, Paul Whatmough, Markus Nagel2026-03-12🤖 cs.LG

V2M-Zero: Zero-Pair Time-Aligned Video-to-Music Generation

V2M-Zero introduces a zero-pair video-to-music generation framework that achieves superior temporal synchronization and semantic alignment by leveraging shared intra-modal temporal structures via event curves, eliminating the need for paired training data or cross-modal supervision.

Yan-Bo Lin, Jonah Casebeer, Long Mai, Aniruddha Mahapatra, Gedas Bertasius, Nicholas J. Bryan2026-03-12🤖 cs.AI

Neural Field Thermal Tomography: A Differentiable Physics Framework for Non-Destructive Evaluation

The paper introduces Neural Field Thermal Tomography (NeFTY), a differentiable physics framework that parameterizes 3D material diffusivity as a continuous neural field optimized via a rigorous numerical solver to achieve high-resolution, quantitative reconstruction of subsurface defects from transient surface temperature measurements, overcoming the limitations of traditional 1D approximations and soft-constrained PINNs.

Tao Zhong, Yixun Hu, Dongzhe Zheng, Aditya Sood, Christine Allen-Blanchette2026-03-12🔬 cond-mat.mtrl-sci

XConv: Low-memory stochastic backpropagation for convolutional layers

XConv is a drop-in replacement for standard convolutional layers that significantly reduces memory usage during training by storing compressed activations and approximating weight gradients via randomized trace estimation, while maintaining performance comparable to exact gradient methods without imposing architectural constraints or requiring codebase modifications.

Anirudh Thatipelli, Jeffrey Sam, Mathias Louboutin, Ali Siahkoohi, Rongrong Wang, Felix J. Herrmann2026-03-11🤖 cs.LG

A Survey on Decentralized Federated Learning

This survey systematically reviews decentralized federated learning methods from 2018 to early 2026, categorizing them into traditional distributed and blockchain-based architectures, proposing a unified challenge-driven taxonomy, and outlining future research directions to address security, privacy, and system-level trade-offs in coordinator-free settings.

Edoardo Gabrielli, Anthony Di Pietro, Dario Fenoglio, Giovanni Pica, Gabriele Tolomei2026-03-11🤖 cs.LG

Polynomially Over-Parameterized Convolutional Neural Networks Contain Structured Strong Winning Lottery Tickets

This paper proves that randomly initialized, polynomially over-parameterized convolutional neural networks contain structured subnetworks capable of approximating smaller networks without training, by developing new mathematical tools to overcome previous limitations in analyzing the Strong Lottery Ticket Hypothesis for structured pruning.

Arthur da Cunha, Francesco d'Amore, Emanuele Natale2026-03-11🤖 cs.LG

Provable Filter for Real-world Graph Clustering

This paper proposes a novel, theoretically grounded graph clustering method that constructs homophilic and heterophilic graphs to build low-pass and high-pass filters enhanced by a squeeze-and-excitation block, effectively addressing the limitations of existing approaches in handling the structural disparities of real-world graphs.

Xuanting Xie, Erlin Pan, Zhao Kang, Wenyu Chen, Bingheng Li2026-03-11🤖 cs.LG

Enhancing Computational Efficiency in Multiscale Systems Using Deep Learning of Coordinates and Flow Maps

This paper proposes a deep learning framework that jointly discovers optimal coordinates and flow maps to enable precise, computationally efficient time-stepping for multiscale systems, achieving state-of-the-art predictive accuracy with reduced costs on complex models like the Fitzhugh-Nagumo neuron and Kuramoto-Sivashinsky equations.

Asif Hamid, Danish Rafiq, Shahkar Ahmad Nahvi, Mohammad Abid Bazaz2026-03-11🤖 cs.LG

Fairness-Aware Multi-Group Target Detection in Online Discussion

This paper proposes a fairness-aware multi-group target detection approach for online discussions that effectively reduces bias across demographic groups while maintaining strong predictive performance, particularly in the context of toxicity detection where harm is highly dependent on the targeted group.

Soumyajit Gupta, Maria De-Arteaga, Matthew Lease2026-03-11🤖 cs.LG

Multi-agent Assessment with QoS Enhancement for HD Map Updates in a Vehicular Network

This paper proposes and evaluates a distributed multi-agent Q-learning solution for HD map updates in vehicular networks that reduces computational burdens and compatibility issues while significantly improving time latencies across various traffic scenarios compared to single-agent approaches.

Jeffrey Redondo, Nauman Aslam, Juan Zhang + 1 more2026-03-11🤖 cs.AI

Sparse Variational Student-t Processes for Heavy-tailed Modeling

This paper introduces Sparse Variational Student-t Processes (SVTP), a scalable framework that extends sparse inducing point methods to Student-t processes via novel inference algorithms and natural gradient optimization, achieving superior robustness to outliers and heavy-tailed data with significantly faster convergence and lower prediction error compared to sparse Gaussian processes on large datasets.

Jian Xu, Delu Zeng, John Paisley2026-03-11🤖 cs.AI

HYGENE: A Diffusion-based Hypergraph Generation Method

This paper introduces HYGENE, the first deep learning-based diffusion method that generates realistic and diverse hypergraphs by iteratively expanding a bipartite representation from a single connected node pair through a progressive local expansion process.

Dorian Gailhard, Enzo Tartaglione, Lirida Naviner, Jhony H. Giraldo2026-03-11🤖 cs.LG

Robust Training of Neural Networks at Arbitrary Precision and Sparsity

This paper introduces a unified framework that models quantization and sparsification as additive noise to derive a principled, noise-corrective gradient path, enabling the stable training of neural networks at arbitrary low precisions and sparsity levels without relying on heuristic estimators like the Straight-Through Estimator.

Chengxi Ye, Grace Chu, Yanfeng Liu, Yichi Zhang, Lukasz Lew, Li Zhang, Mark Sandler, Andrew Howard2026-03-11🤖 cs.AI

ARLBench: Flexible and Efficient Benchmarking for Hyperparameter Optimization in Reinforcement Learning

The paper introduces ARLBench, a flexible and efficient benchmark for hyperparameter optimization in reinforcement learning that utilizes a representative subset of tasks to enable cost-effective comparisons of diverse AutoRL methods and lower the barrier to entry for researchers with limited compute resources.

Jannis Becktepe, Julian Dierkes, Carolin Benjamins, Aditya Mohan, David Salinas, Raghu Rajan, Frank Hutter, Holger Hoos, Marius Lindauer, Theresa Eimer2026-03-11🤖 cs.LG

DRUPI: Dataset Reduction Using Privileged Information

The paper introduces DRUPI (Dataset Condensation using Privileged Information), a framework that enhances dataset condensation by synthesizing auxiliary privileged information, such as feature or attention labels, alongside reduced data to significantly improve model training performance across various benchmarks.

Shaobo Wang, Youxin Jiang, Tianle Niu, Yantai Yang, Ruiji Zhang, Shuhao Hu, Shuaiyu Zhang, Chenghao Sun, Weiya Li, Conghui He, Xuming Hu, Linfeng Zhang2026-03-11🤖 cs.AI

← Previous Next →