cs.LG papers | Gist.Science

HAPEns: Hardware-Aware Post-Hoc Ensembling for Tabular Data

The paper introduces HAPEns, a novel post-hoc ensembling method for tabular data that constructs diverse ensembles along the Pareto front of predictive performance and hardware efficiency, significantly outperforming existing baselines across 83 datasets by explicitly balancing accuracy with resource constraints.

Jannis Maier, Lennart Purucker2026-03-12🤖 cs.LG

Does LLM Alignment Really Need Diversity? An Empirical Study of Adapting RLVR Methods for Moral Reasoning

This paper empirically demonstrates that contrary to the hypothesis that moral reasoning alignment requires diversity-seeking algorithms, standard reward-maximizing RLVR methods are equally or more effective because high-reward moral responses exhibit a concentrated distribution in semantic space similar to logical reasoning tasks.

Zhaowei Zhang, Xiaohan Liu, Xuekai Zhu, Junchao Huang, Ceyao Zhang, Zhiyuan Feng, Yaodong Yang, Xiaoyuan Yi, Xing Xie2026-03-12🤖 cs.AI

Gradient Flow Drifting: Generative Modeling via Wasserstein Gradient Flows of KDE-Approximated Divergences

This paper establishes a mathematical framework called Gradient Flow Drifting that proves the equivalence between the recently proposed Drifting Model and the Wasserstein gradient flow of the forward KL divergence under KDE approximation, while extending the approach to a mixed-divergence strategy on Riemannian manifolds to simultaneously mitigate mode collapse and blurring.

Jiarui Cao, Zixuan Wei, Yuxin Liu2026-03-12🤖 cs.LG

Self-Scaled Broyden Family of Quasi-Newton Methods in JAX

This technical note presents a JAX-compatible implementation of the Self-Scaled Broyden family of quasi-Newton methods, including BFGS, DFP, and Broyden variants with Zoom line search, built upon the Optimistix library to facilitate their adoption within the JAX community.

Ivan Bioli, Mikel Mendibe Abarrategi2026-03-12🤖 cs.LG

Geo-ATBench: A Benchmark for Geospatial Audio Tagging with Geospatial Semantic Context

This paper introduces Geo-ATBench, a new benchmark and the Geo-AT task that leverage geospatial semantic context to resolve acoustic ambiguities in multi-label audio tagging, demonstrating through the GeoFusion-AT framework that incorporating location-based priors significantly improves recognition performance and aligns with human judgment.

Yuanbo Hou, Yanru Wu, Qiaoqiao Ren, Shengchen Li, Stephen Roberts, Dick Botteldooren2026-03-12⚡ eess

Reinforcement Learning with Conditional Expectation Reward

This paper proposes Conditional Expectation Reward (CER), a novel reinforcement learning method that utilizes the large language model itself as an implicit verifier to provide soft, graded reward signals, thereby overcoming the limitations of rule-based verification and enabling effective reasoning training across both mathematical and general free-form answer domains.

Changyi Xiao, Caijun Xu, Yixin Cao2026-03-12🤖 cs.LG

Detecting and Eliminating Neural Network Backdoors Through Active Paths with Application to Intrusion Detection

This paper proposes a novel, explainable approach to detect and eliminate neural network backdoors by analyzing active paths within the model, demonstrating its effectiveness through experiments on intrusion detection systems.

Eirik Høyheim, Magnus Wiik Eckhoff, Gudmund Grov, Robert Flood, David Aspinall2026-03-12🤖 cs.AI

FAME: Formal Abstract Minimal Explanation for Neural Networks

The paper introduces FAME, a novel abductive explanation method for large neural networks that utilizes dedicated perturbation domains and LiRPA-based bounds to efficiently generate formal abstract minimal explanations, demonstrating superior performance in explanation size and runtime compared to VERIX+.

Ryma Boumazouza, Raya Elsaleh, Melanie Ducoffe, Shahaf Bassan, Guy Katz2026-03-12🤖 cs.AI

Spatio-Temporal Attention Graph Neural Network: Explaining Causalities With Attention

This paper proposes a Spatio-Temporal Attention Graph Neural Network (STA-GNN) that integrates conformal prediction to provide unsupervised, explainable, and drift-aware anomaly detection for Industrial Control Systems by modeling dynamic inter-dependencies across physical and network entities.

Kosti Koistinen, Kirsi Hellsten, Joni Herttuainen, Kimmo K. Kaski2026-03-12🤖 cs.LG

Surrogate models for nuclear fusion with parametric Shallow Recurrent Decoder Networks: applications to magnetohydrodynamics

This paper demonstrates that a data-driven framework combining Singular Value Decomposition with Shallow Recurrent Decoder (SHRED) networks can accurately and efficiently reconstruct full spatio-temporal magnetohydrodynamic states from sparse temperature sensor measurements, offering a robust surrogate model for real-time monitoring and control in nuclear fusion applications.

M. Lo Verso, C. Introini, E. Cervi, L. Savoldi, J. N. Kutz, A. Cammi2026-03-12🤖 cs.LG

Contract And Conquer: How to Provably Compute Adversarial Examples for a Black-Box Model?

This paper introduces Contract And Conquer (CAC), a black-box adversarial attack method that combines knowledge distillation on an expanding dataset with precise search space contraction to provably compute adversarial examples within a fixed number of iterations, outperforming existing state-of-the-art approaches on ImageNet.

Anna Chistyakova, Mikhail Pautov2026-03-12🤖 cs.LG

EvoSchema: Towards Text-to-SQL Robustness Against Schema Evolution

This paper introduces EvoSchema, a comprehensive benchmark featuring a novel taxonomy of ten schema perturbation types to evaluate and enhance the robustness of text-to-SQL models against real-world database schema evolution, revealing that table-level changes significantly impact performance and demonstrating that training on diverse schema designs improves model resilience.

Tianshu Zhang, Kun Qian, Siddhartha Sahai, Yuan Tian, Shaddy Garg, Huan Sun, Yunyao Li2026-03-12💬 cs.CL

Riemannian MeanFlow for One-Step Generation on Manifolds

This paper introduces Riemannian MeanFlow (RMF), a novel framework that enables efficient one-step generative modeling on manifolds by defining an average-velocity field via parallel transport and utilizing a log-map tangent representation to avoid costly numerical integration while maintaining high sample quality.

Zichen Zhong, Haoliang Sun, Yukun Zhao, Yongshun Gong, Yilong Yin2026-03-12🤖 cs.LG

Sample-and-Search: An Effective Algorithm for Learning-Augmented k-Median Clustering in High dimensions

This paper introduces "Sample-and-Search," a learning-augmented algorithm for high-dimensional $k$ -median clustering that utilizes a predictor to preprocess data, thereby significantly reducing both computational complexity and exponential dimensionality dependency while achieving lower clustering costs compared to state-of-the-art methods.

Kangke Cheng, Shihong Song, Guanlin Mo, Hu Ding2026-03-12🤖 cs.LG

CacheSolidarity: Preventing Prefix Caching Side Channels in Multi-tenant LLM Serving Systems

CacheSolidarity is a lightweight system that secures multi-tenant LLM serving against Automatic Prefix Caching side-channel attacks by selectively isolating suspicious cache reuse, thereby achieving significantly higher cache efficiency and lower latency compared to existing all-or-nothing isolation defenses.

Panagiotis Georgios Pennas, Konstantinos Papaioannou, Marco Guarnieri, Thaleia Dimitra Doudali2026-03-12🤖 cs.LG

Beyond Accuracy: Reliability and Uncertainty Estimation in Convolutional Neural Networks

This paper compares Monte Carlo Dropout and Conformal Prediction for uncertainty estimation in CNNs trained on Fashion-MNIST, revealing that while H-CNN VGG16 achieves higher accuracy, GoogLeNet offers better calibration and Conformal Prediction provides statistically guaranteed reliability for high-stakes applications.

Sanne Ruijs, Alina Kosiakova, Farrukh Javed2026-03-12📊 stat

A Grammar of Machine Learning Workflows

This paper proposes a structural remedy for data leakage in machine learning by introducing a grammar of seven kernel primitives connected by a typed directed acyclic graph with four hard constraints, specifically a runtime-enforced terminal assess constraint that prevents selection and memorization leakage, a claim validated across 2,047 experimental instances and implemented in Python, R, and Julia.

Simon Roth2026-03-12🤖 cs.LG

CUPID: A Plug-in Framework for Joint Aleatoric and Epistemic Uncertainty Estimation with a Single Model

CUPID is a novel, plug-in framework that enables joint estimation of aleatoric and epistemic uncertainty in pretrained deep learning models without requiring retraining or architectural modifications, thereby enhancing interpretability and trust in high-stakes AI applications.

Xinran Xu, Xiuyi Fan2026-03-12🤖 cs.LG

Deep Randomized Distributed Function Computation (DeepRDFC): Neural Distributed Channel Simulation

This paper proposes a deep learning-based autoencoder architecture for the Randomized Distributed Function Computation (RDFC) framework that minimizes the total variation distance to an unknown target distribution using only data samples, demonstrating superior communication efficiency compared to traditional data compression methods, particularly under limited common randomness.

Didrik Bergström, Onur Günlü2026-03-12🔢 math

A PUF-Based Approach for Copy Protection of Intellectual Property in Neural Network Models

This paper proposes a method to protect intellectual property in neural network models by binding their weights to unique hardware properties using Physically Unclonable Functions (PUFs), thereby preventing accurate execution on cloned hardware.

Daniel Dorfmeister, Flavio Ferrarotti, Bernhard Fischer, Martin Schwandtner, Hannes Sochor2026-03-12🤖 cs.LG

← Previous Next →