cs.LG papers | Gist.Science

Regularized Online RLHF with Generalized Bilinear Preferences

This paper proposes a regularized online RLHF framework using Generalized Bilinear Preference Models to identify Nash Equilibria, establishing the first statistically efficient, dimension-free regret bounds for high-dimensional settings through two simple algorithms that leverage strong convexity and low-rank structures.

Junghyun Lee, Minju Hong, Kwang-Sung Jun + 2 more2026-03-06💻 cs

Lap2: Revisiting Laplace DP-SGD for High Dimensions via Majorization Theory

This paper introduces Lap2, a novel framework that enables L2-norm clipping for Laplace DP-SGD in high-dimensional models by leveraging majorization theory and Schur-convexity to overcome dimensionality barriers, thereby achieving privacy-utility performance comparable to or exceeding Gaussian DP-SGD.

Meisam Mohammady, Qin Yang, Nicholas Stout, Ayesha Samreen, Han Wang, Christopher J Quinn, Yuan Hong2026-03-06🔒 cs.CR

Inference-time optimization for experiment-grounded protein ensemble generation

This paper introduces a general inference-time optimization framework that generates experiment-grounded protein ensembles by optimizing latent representations and employing novel sampling schemes, thereby overcoming the limitations of current diffusion-based methods to produce thermodynamically plausible structures with improved agreement to experimental data while exposing vulnerabilities in existing confidence metrics.

Advaith Maddipatla, Anar Rzayev, Marco Pegoraro + 5 more2026-03-06💻 cs

Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking

The paper introduces Jailbreak Foundry (JBF), a multi-agent system that automatically translates jailbreak research papers into executable modules within a unified harness, enabling rapid, reproducible, and standardized benchmarking of large language model security against rapidly evolving attack techniques.

Zhicheng Fang, Jingjie Zheng, Chenxu Fu, Wei Xu2026-03-06🔒 cs.CR

DiffusionHarmonizer: Bridging Neural Reconstruction and Photorealistic Simulation with Online Diffusion Enhancer

DiffusionHarmonizer is an online, single-step generative framework that leverages a custom data curation pipeline to transform imperfect neural reconstruction renderings into temporally consistent, photorealistic simulations, effectively resolving artifacts and harmonizing inserted dynamic objects for autonomous robot development.

Yuxuan Zhang, Katarína Tóthová, Zian Wang + 7 more2026-03-06💻 cs

Fine-grained Soundscape Control for Augmented Hearing

This paper introduces Aurchestra, a novel system for resource-constrained hearables that enables real-time, fine-grained control over up to five overlapping sound sources by combining a dynamic interface with an optimized on-device multi-output extraction network, effectively transforming the acoustic environment into a programmable mix.

Seunghyun Oh, Malek Itani, Aseem Gauri + 1 more2026-03-06💻 cs

Agents Learn Their Runtime: Interpreter Persistence as Training-Time Semantics

This paper demonstrates that interpreter persistence is a critical training-time semantic that significantly impacts agent efficiency and stability, revealing that misalignment between training data and deployment runtime causes substantial token waste or error rates despite achieving comparable solution quality.

Victor May, Aaditya Salgarkar, Yishan Wang + 2 more2026-03-06💻 cs

Learn Hard Problems During RL with Reference Guided Fine-tuning

This paper introduces Reference-Guided Fine-Tuning (ReGFT), a method that synthesizes model-aligned positive trajectories using partial human reference solutions to overcome reward sparsity and significantly enhance the performance and training efficiency of reinforcement learning for mathematical reasoning.

Yangzhen Wu, Shanda Li, Zixin Wen + 5 more2026-03-06💻 cs

VoxKnesset: A Large-Scale Longitudinal Hebrew Speech Dataset for Aging Speaker Modeling

This paper introduces VoxKnesset, a large-scale open-access dataset of 2,300 hours of longitudinal Hebrew parliamentary speech spanning 2009–2025, which is used to benchmark and demonstrate the challenges of speaker verification and age prediction over time, revealing significant performance degradation in standard models as speakers age.

Yanir Marmor, Arad Zulti, David Krongauz + 4 more2026-03-06💻 cs

MatRIS: Toward Reliable and Efficient Pretrained Machine Learning Interatomic Potentials

MatRIS is a novel, computationally efficient invariant machine learning interatomic potential that utilizes a linear-complexity separable attention mechanism for three-body interactions to achieve accuracy comparable to state-of-the-art equivariant models at a significantly lower training cost.

Yuanchang Zhou, Siyu Hu, Xiangyu Zhang + 3 more2026-03-06💻 cs

Conformal Graph Prediction with Z-Gromov Wasserstein Distances

This paper proposes a conformal prediction framework for graph-valued outputs that ensures distribution-free coverage guarantees by utilizing Z-Gromov-Wasserstein distances for nonconformity scoring and introducing Score Conformalized Quantile Regression (SCQR) to generate adaptive prediction sets.

Gabriel Melo, Thibaut de Saivre, Anna Calissano + 1 more2026-03-06💻 cs

IoUCert: Robustness Verification for Anchor-based Object Detectors

The paper introduces IoUCert, a novel formal verification framework that overcomes the challenges of non-linear coordinate transformations and IoU metrics to enable the first robustness verification of realistic, anchor-based object detection models like SSD and YOLO.

Benedikt Brückner, Alejandro J. Mercado, Yanghao Zhang, Panagiotis Kouvaros, Alessio Lomuscio2026-03-06🔒 cs.CR

Incremental Graph Construction Enables Robust Spectral Clustering of Texts

This paper introduces an incremental $k$ -NN graph construction method that guarantees connectivity by design, thereby enabling robust spectral clustering of text embeddings that outperforms standard approaches in low- $k$ regimes where disconnected components typically degrade performance.

Marko Pranjić, Boshko Koloski, Nada Lavrač + 2 more2026-03-06💻 cs

Inverse Reconstruction of Shock Time Series from Shock Response Spectrum Curves using Machine Learning

This paper proposes a conditional variational autoencoder (CVAE) that efficiently and accurately reconstructs shock acceleration time series from shock response spectrum curves, overcoming the computational limitations and basis function constraints of traditional iterative optimization methods.

Adam Watts, Andrew Jeon, Destry Newton + 1 more2026-03-06💻 cs

AOI: Turning Failed Trajectories into Training Signals for Autonomous Cloud Diagnosis

AOI is a secure, trainable multi-agent framework that automates Site Reliability Engineering by leveraging Group Relative Policy Optimization and a read-write separated architecture to distill expert knowledge into local models and convert failed trajectories into corrective signals, achieving state-of-the-art performance on the AIOpsLab benchmark while ensuring data privacy and safe execution.

Pei Yang, Wanyi Chen, Asuka Yuxi Zheng + 11 more2026-03-06💻 cs

RADAR: Learning to Route with Asymmetry-aware DistAnce Representations

RADAR is a scalable neural framework that enhances vehicle routing problem solvers for asymmetric scenarios by leveraging Singular Value Decomposition to encode static distance asymmetry and Sinkhorn normalization to model dynamic interaction asymmetry, thereby achieving superior generalization and performance on both synthetic and real-world benchmarks.

Hang Yi, Ziwei Huang, Yining Ma + 1 more2026-03-06💻 cs

stratum: A System Infrastructure for Massive Agent-Centric ML Workloads

The paper proposes Stratum, a unified system infrastructure that decouples pipeline execution from agent reasoning to efficiently scale agentic ML pipeline search by integrating with existing Python libraries and utilizing an optimized Rust-based runtime, achieving up to 16.6x speedup.

Arnab Phani, Elias Strauss, Sebastian Schelter2026-03-06💻 cs

Why Are Linear RNNs More Parallelizable?

This paper establishes a theoretical foundation for the superior parallelizability of linear RNNs by demonstrating that they correspond to log-depth arithmetic circuits ( $\mathsf{NC}^1$ -complete), whereas nonlinear RNNs are fundamentally limited by their ability to solve $\mathsf{L}$ - and $\mathsf{P}$ -complete problems, thereby explaining why linear variants can be efficiently parallelized like transformers while traditional nonlinear RNNs cannot.

William Merrill, Hongjian Jiang, Yanhong Li + 2 more2026-03-06💻 cs

DMD-augmented Unpaired Neural Schrödinger Bridge for Ultra-Low Field MRI Enhancement

This paper proposes a DMD-augmented Unpaired Neural Schrödinger Bridge framework that enhances Ultra-Low Field (64 mT) MRI image quality by leveraging diffusion-guided distribution matching and anatomical structure preservation to achieve superior realism and structural fidelity in translating unpaired 64 mT scans to 3 T quality.

Youngmin Kim, Jaeyun Shin, Jeongchan Kim + 5 more2026-03-06💻 cs

LoRA-MME: Multi-Model Ensemble of LoRA-Tuned Encoders for Code Comment Classification

LoRA-MME is a parameter-efficient multi-model ensemble that combines LoRA-tuned UniXcoder, CodeBERT, GraphCodeBERT, and CodeBERTa encoders to achieve strong code comment classification performance, though its high computational cost ultimately limited its final competition score.

Md Akib Haider, Ahsan Bulbul, Nafis Fuad Shahid + 2 more2026-03-06💻 cs

← Previous Next →