cs.LG papers | Gist.Science

Towards Realistic Guarantees: A Probabilistic Certificate for SmoothLLM

This paper introduces a realistic probabilistic framework based on the "(k, $\varepsilon$ )-unstable" assumption to derive data-informed safety certificates for SmoothLLM, overcoming the limitations of strict theoretical guarantees and providing actionable defense guarantees against diverse jailbreaking attacks.

Adarsh Kumarappan, Ayushi MehrotraTue, 10 Ma🤖 cs.LG

Automating Deception: Scalable Multi-Turn LLM Jailbreaks

This paper introduces an automated pipeline for generating large-scale, psychologically-grounded multi-turn jailbreak datasets using Foot-in-the-Door techniques, revealing that while models like GPT are significantly vulnerable to conversational history, others like Gemini 2.5 Flash demonstrate exceptional resilience against such narrative-based manipulation.

Adarsh Kumarappan, Ananya MujooTue, 10 Ma🤖 cs.LG

Shortcut Invariance: Targeted Jacobian Regularization in Disentangled Latent Space

This paper proposes "Shortcut Invariance," a targeted Jacobian regularization method that improves out-of-distribution generalization by injecting anisotropic noise into a disentangled latent space to flatten decision boundaries along shortcut-aligned axes, thereby eliminating the need for explicit shortcut labels or conflicting training samples.

Shivam Pal, Sakshi Varshney, Piyush RaiTue, 10 Ma🤖 cs.LG

Crowdsourcing the Frontier: Advancing Hybrid Physics-ML Climate Simulation via a $50,000 Kaggle Competition

This paper demonstrates that a $50,000 Kaggle competition successfully crowdsourced diverse machine learning architectures for subgrid parameterization, which, when coupled with a full-physics climate model, achieved reproducible online stability and state-of-the-art performance, marking a significant milestone in advancing hybrid physics-ML climate simulations.

Jerry Lin, Zeyuan Hu, Tom Beucler, Katherine Frields, Hannah Christensen, Walter Hannah, Helge Heuer, Peter Ukkonnen, Laura A. Mansfield, Tian Zheng, Liran Peng, Ritwik Gupta, Pierre Gentine, Yusef Al-Naher, Mingjiang Duan, Kyo Hattori, Weiliang Ji, Chunhan Li, Kippei Matsuda, Naoki Murakami, Shlomo Ron, Marec Serlin, Hongjian Song, Yuma Tanabe, Daisuke Yamamoto, Jianyao Zhou, Mike PritchardTue, 10 Ma🤖 cs.LG

CRAwDAD: Causal Reasoning Augmentation with Dual-Agent Debate

The paper introduces CRAwDAD, a dual-agent debate framework that enhances causal inference in reasoning language models by facilitating structured dialogue and adversarial critique between agents, significantly improving accuracy on the CLadder benchmark across all levels of Pearl's causal ladder.

Finn G. Vamosi, Nils D. ForkertTue, 10 Ma🤖 cs.LG

ForamDeepSlice: A High-Accuracy Deep Learning Framework for Foraminifera Species Classification from 2D Micro-CT Slices

This study introduces ForamDeepSlice, a high-accuracy deep learning framework that combines an ensemble of ConvNeXt-Large and EfficientNetV2-Small models with a rigorous specimen-level split dataset to achieve 95.64% accuracy in classifying foraminifera species from 2D micro-CT slices, while also providing an interactive dashboard for real-time identification and 3D matching.

Abdelghafour Halimi, Ali Alibrahim, Didier Barradas-Bautista, Ronell Sicat, Abdulkader M. AfifiTue, 10 Ma🤖 cs.LG

AltNet: Addressing the Plasticity-Stability Dilemma in Reinforcement Learning

The paper introduces AltNet, a twin-network reinforcement learning framework that restores plasticity through periodic parameter resets without causing performance drops, thereby achieving higher sample efficiency and stability in high-dimensional control tasks compared to existing reset-based methods.

Mansi Maheshwari, John C. Raisbeck, Bruno Castro da SilvaTue, 10 Ma🤖 cs.LG

MSPT: Efficient Large-Scale Physical Modeling via Parallelized Multi-Scale Attention

The paper introduces MSPT, a novel architecture that leverages ball trees for efficient spatial partitioning and a dual-scale attention mechanism to enable scalable, memory-efficient, and high-accuracy physical simulations of millions of points on a single GPU across diverse industrial applications.

Pedro M. P. Curvo, Jan-Willem van de Meent, Maksim ZhdanovTue, 10 Ma🤖 cs.LG

Dual Randomized Smoothing: Beyond Global Noise Variance

This paper proposes Dual Randomized Smoothing, a novel framework that overcomes the limitations of global noise variance by introducing input-dependent noise variances via a locally constant variance estimator, thereby achieving superior certified robustness across both small and large perturbation radii on CIFAR-10 and ImageNet.

Chenhao Sun, Yuhao Mao, Martin VechevTue, 10 Ma🤖 cs.LG

Dual-Robust Cross-Domain Offline Reinforcement Learning Against Dynamics Shifts

This paper introduces DROCO, a novel dual-robust cross-domain offline reinforcement learning algorithm that addresses both train-time and test-time dynamics shifts by employing a robust cross-domain Bellman operator alongside dynamic value penalty and Huber loss to enhance policy robustness and prevent value estimation errors.

Zhongjian Qiao, Rui Yang, Jiafei Lyu, Xiu Li, Zhongxiang Dai, Zhuoran Yang, Siyang Gao, Shuang QiuTue, 10 Ma🤖 cs.LG

Evolving Diffusion and Flow Matching Policies for Online Reinforcement Learning

The paper introduces GoRL, an algorithm-agnostic framework that stabilizes online reinforcement learning with expressive generative policies by decoupling optimization in a tractable latent space from action synthesis via a conditional generative decoder, achieving superior performance on challenging continuous-control tasks.

Chubin Zhang, Zhenglin Wan, Feng Chen, Fuchao Yang, Lang Feng, Yaxin Zhou, Xingrui Yu, Yang You, Ivor Tsang, Bo AnTue, 10 Ma🤖 cs.LG

Beyond Additivity: Sparse Isotonic Shapley Regression toward Nonlinear Explainability

This paper introduces Sparse Isotonic Shapley Regression (SISR), a unified framework that simultaneously learns a monotonic transformation to restore additivity and enforces sparsity constraints to provide robust, efficient, and theoretically grounded feature attributions for nonlinear, high-dimensional Explainable AI.

Jialai SheTue, 10 Ma🤖 cs.LG

Two-Step Data Augmentation for Masked Face Detection and Recognition: Turning Fake Masks to Real

This paper presents a two-step generative data augmentation framework combining rule-based mask warping and unpaired image-to-image translation to address the scarcity of masked face datasets, achieving performance improvements with minimal training data while explicitly noting its origins as a resource-constrained coursework project that lacked downstream quantitative evaluation.

Yan Yang, George Bebis, Mircea NicolescuTue, 10 Ma🤖 cs.LG

SALVE: Sparse Autoencoder-Latent Vector Editing for Mechanistic Control of Neural Networks

The paper introduces SALVE, a unified framework that combines sparse autoencoders and feature-level saliency mapping to discover, validate, and precisely edit neural network weights, enabling interpretable and robust control over both convolutional and transformer-based models.

Vegard FlovikTue, 10 Ma🤖 cs.LG

Meta-RL Induces Exploration in Language Agents

The paper introduces LaMer, a Meta-RL framework that enhances language agents' ability to actively explore and adapt to novel environments at test time through cross-episode training and in-context policy reflection, significantly outperforming standard RL baselines across diverse tasks.

Yulun Jiang, Liangze Jiang, Damien Teney, Michael Moor, Maria BrbicTue, 10 Ma🤖 cs.LG

ReDepth Anything: Test-Time Depth Refinement via Self-Supervised Re-lighting

Re-Depth Anything is a test-time self-supervised framework that enhances monocular depth estimation by fusing foundation models with large-scale 2D diffusion priors to perform label-free refinement via generative re-lighting and Score Distillation Sampling, achieving state-of-the-art results without direct depth tensor optimization.

Ananta R. Bhattarai, Helge RhodinTue, 10 Ma🤖 cs.LG

Concurrent training methods for Kolmogorov-Arnold networks: Disjoint datasets and FPGA implementation

This paper proposes three complementary strategies—pre-training tailored to Newton-Kaczmarz updates, training on disjoint data subsets with model merging, and FPGA-based parallelization—to overcome the sequential bottlenecks in Kolmogorov-Arnold network training and significantly accelerate convergence.

Andrew Polar, Michael PoluektovTue, 10 Ma🤖 cs.LG

Latent Sculpting for Zero-Shot Generalization: A Manifold Learning Approach to Out-of-Distribution Anomaly Detection

The paper proposes "Latent Sculpting," a hierarchical two-stage architecture that combines a Transformer-based encoder with a Binary Latent Sculpting loss and a Masked Autoregressive Flow to enforce explicit geometric boundaries on benign data, achieving robust zero-shot generalization and high detection rates for out-of-distribution cyberattacks on the CIC-IDS-2017 benchmark.

Rajeeb Thapa Chhetri, Saurab Thapa, Avinash Kumar, Zhixiong ChenTue, 10 Ma🤖 cs.LG

Certifying the Right to Be Forgotten: Primal-Dual Optimization for Sample and Label Unlearning in Vertical Federated Learning

This paper proposes FedORA, a primal-dual optimization framework that enables efficient and theoretically certified sample and label unlearning in Vertical Federated Learning by introducing a novel uncertainty-promoting loss function and adaptive strategies to minimize computational overhead while preserving model utility.

Yu Jiang, Xindi Tong, Ziyao Liu, Xiaoxi Zhang, Kwok-Yan Lam, Chee Wei TanTue, 10 Ma🤖 cs.LG

Network Traffic Analysis with Process Mining: The UPSIDE Case Study

This paper proposes a process mining-based method that analyzes online gaming network traffic to unsupervisedly characterize device states as interpretable Petri nets and accurately classify different video games, demonstrating its effectiveness through the UPSIDE case study involving Clash Royale and Rocket League.

Francesco Vitale, Paolo Palmiero, Massimiliano Rak, Nicola MazzoccaTue, 10 Ma🤖 cs.LG

← Previous Next →