cs.LG papers | Gist.Science

Reasoned Safety Alignment: Ensuring Jailbreak Defense via Answer-Then-Check

This paper introduces "Answer-Then-Check," a novel safety alignment method that enhances LLM robustness against jailbreak attacks by training models to generate direct answers internally and then critically evaluate their safety before responding, achieving superior protection with reduced over-refusal while maintaining general reasoning capabilities through the newly constructed 80K-sample ReSA dataset.

Chentao Cao, Xiaojun Xu, Bo Han, Hang Li2026-03-09🤖 cs.AI

VEGA: Electric Vehicle Navigation Agent via Physics-Informed Neural Operator and Proximal Policy Optimization

VEGA is an electric vehicle navigation system that combines a physics-informed neural operator for real-time vehicle parameter estimation with a Proximal Policy Optimization agent for efficient, charge-aware route and charging stop planning, demonstrating superior inference speed and generalization across international road networks compared to traditional energy-aware baselines.

Hansol Lim, Minhyeok Im, Jonathan Boyack, Jee Won Lee, Jongseong Brad Choi2026-03-09🤖 cs.LG

Spectral/Spatial Tensor Atomic Cluster Expansion with Universal Embeddings in Cartesian Space

This paper introduces the Tensor Atomic Cluster Expansion (TACE), a unified atomistic machine learning framework that employs irreducible Cartesian tensors to efficiently model both scalar and tensorial observables without complex angular-momentum coupling, demonstrating robust accuracy and scalability across diverse chemical systems and tasks.

Zemin Xu, Wenbo Xie, P. Hu2026-03-09🔬 cond-mat.mtrl-sci

C^2Prompt: Class-aware Client Knowledge Interaction for Federated Continual Learning

This paper proposes C²Prompt, a novel federated continual learning method that mitigates temporal and spatial forgetting by introducing a local class distribution compensation mechanism and a class-aware prompt aggregation scheme to enhance class-wise knowledge coherence across distributed clients.

Kunlun Xu, Yibo Feng, Jiangmeng Li, Yongsheng Qi, Jiahuan Zhou2026-03-09🤖 cs.LG

Auto-Regressive U-Net for Full-Field Prediction of Shrinkage-Induced Damage in Concrete

This paper proposes a computationally efficient dual-network architecture combining an auto-regressive U-Net and a CNN to predict time-dependent full-field damage evolution and key mechanical properties in concrete, thereby enabling insights into aggregate effects and optimizing mix designs for improved durability.

Liya Gaynutdinova, Petr Havlásek, Ondřej Rokoš, Fleur Hendriks, Martin Doškář2026-03-09🤖 cs.LG

Taxonomy-aware Dynamic Motion Generation on Hyperbolic Manifolds

This paper introduces GPHDM, a novel framework that extends Gaussian Process Dynamical Models to hyperbolic manifolds to generate physically consistent, human-like robot motions by preserving the hierarchical taxonomy and temporal dynamics of movement.

Luis Augenstein, Noémie Jaquier, Tamim Asfour, Leonel Rozo2026-03-09🤖 cs.LG

Planner Aware Path Learning in Diffusion Language Models Training

This paper addresses the training-inference mismatch in diffusion language models caused by planner-based sampling strategies by deriving a new Planned Evidence Lower Bound (P-ELBO) and introducing Planner Aware Path Learning (PAPL), a simple training modification that aligns training with planned inference to achieve significant performance gains across protein, text, and code generation tasks.

Fred Zhangzhi Peng, Zachary Bezemek, Jarrid Rector-Brooks, Shuibai Zhang, Anru R. Zhang, Michael Bronstein, Alexander Tong, Avishek Joey Bose2026-03-09🤖 cs.LG

Diffusion Alignment as Variational Expectation-Maximization

The paper introduces Diffusion Alignment as Variational Expectation-Maximization (DAV), an iterative framework that alternates between test-time search for diverse, reward-aligned samples and model refinement to optimize diffusion models for downstream objectives while mitigating reward over-optimization and mode collapse.

Jaewoo Lee, Minsu Kim, Sanghyeok Choi, Inhyuck Song, Sujin Yun, Hyeongyu Kang, Woocheol Shin, Taeyoung Yun, Kiyoung Om, Jinkyoo Park2026-03-09🤖 cs.LG

Online Minimization of Polarization and Disagreement via Low-Rank Matrix Bandits

This paper addresses the online minimization of polarization and disagreement in the Friedkin-Johnsen opinion dynamics model under incomplete information by proposing a two-stage low-rank matrix bandit algorithm that achieves a cumulative regret of $\widetilde{\mathcal{O}}\big(\max(\tfrac{1}{\kappa},\sqrt{|V|})\sqrt{|V|T}\big)$ through subspace estimation and linear bandit optimization.

Federico Cinus, Yuko Kuroki, Atsushi Miyauchi, Francesco Bonchi2026-03-09🤖 cs.LG

Self-Speculative Masked Diffusions

This paper introduces Self-Speculative Masked Diffusions, a novel generative model for discrete data that leverages causal attention and integrated speculative sampling to produce non-factorized predictions, thereby achieving a roughly 2x reduction in neural network forward passes compared to standard masked diffusion models.

Andrew Campbell, Valentin De Bortoli, Jiaxin Shi, Arnaud Doucet2026-03-09🤖 cs.LG

TCR-EML: Explainable Model Layers for TCR-pMHC Prediction

This paper proposes TCR-EML, an explainable-by-design model that integrates prototype layers based on known biochemical binding mechanisms into protein-language model backbones to achieve competitive TCR-pMHC binding prediction accuracy while providing interpretable rationales superior to existing black-box approaches.

Jiarui Li, Zixiang Yin, Zhengming Ding, Samuel J. Landry, Ramgopal R. Mettu2026-03-09🤖 cs.LG

Decoding Partial Differential Equations: Cross-Modal Adaptation of Decoder-only Models to PDEs

This paper demonstrates that while standard decoder-only models underperform compared to encoder-only architectures in cross-modal adaptation for partial differential equations, introducing novel bidirectionality-mimicking techniques like Parallel Flipping and Sequence Doubling effectively closes this performance gap.

Paloma García-de-Herreros, Philipp Slusallek, Dietrich Klakow, Vagrant Gautam2026-03-09🤖 cs.LG

How Reliable is Language Model Micro-Benchmarking?

This paper challenges the reliability of language model micro-benchmarks by demonstrating that they often fail to consistently rank models with small performance differences, frequently requiring as many as 250 examples to achieve accuracy comparable to random sampling, thereby offering actionable guidance on the trade-off between evaluation efficiency and reliability.

Gregory Yauney, Shahzaib Saqib Warraich, Swabha Swayamdipta2026-03-09🤖 cs.LG

CanvasMAR: Improving Masked Autoregressive Video Prediction With Canvas

CanvasMAR enhances masked autoregressive video prediction by introducing a global "canvas" prior and a motion-aware curriculum to generate high-fidelity, coherent videos with fewer sampling steps, achieving performance that rivals advanced diffusion-based methods.

Zian Li, Muhan Zhang2026-03-09🤖 cs.AI

Escaping Model Collapse via Synthetic Data Verification: Near-term Improvements and Long-term Convergence

This paper demonstrates that injecting external verification into synthetic data retraining can prevent model collapse and yield near-term improvements, though theoretical analysis and experiments across linear regression, VAEs, and LLMs show that long-term performance ultimately converges to the verifier's knowledge center and may plateau or decline if the verifier is imperfect.

Bingji Yi, Qiyuan Liu, Yuwei Cheng, Haifeng Xu2026-03-09🤖 cs.LG

Mixed Monotonicity Reachability Analysis of Neural ODE: A Trade-Off Between Tightness and Efficiency

This paper proposes a novel, computationally efficient interval-based reachability analysis method for Neural ODEs that leverages continuous-time mixed monotonicity to trade off tightness for scalability, making it particularly suitable for high-dimensional and safety-critical real-time applications.

Abdelrahman Sayed Sayed, Pierre-Jean Meyer, Mohamed Ghazel2026-03-09🤖 cs.LG

Real-Time Learning of Predictive Dynamic Obstacle Models for Robotic Motion Planning

This paper presents a real-time online framework that utilizes modified sliding-window Hankel Dynamic Mode Decomposition with singular-value hard thresholding and Cadzow projection to denoise partial measurements and construct predictive models for dynamic obstacle motion, enabling stable, variance-aware forecasting suitable for robotic motion planning.

Stella Kombo, Masih Haseli, Skylar X. Wei, Joel W. Burdick2026-03-09🤖 cs.LG

KLASS: KL-Guided Fast Inference in Masked Diffusion Models

The paper introduces KLASS, a training-free, KL-divergence-guided sampling method that significantly accelerates inference in masked diffusion models by unmasking multiple stable tokens per iteration, achieving state-of-the-art speed and performance across text, image, and molecular generation tasks.

Seo Hyun Kim, Sunwoo Hong, Hojung Jung, Youngrok Park, Se-Young Yun2026-03-09🤖 cs.LG

CADM: Cluster-customized Adaptive Distance Metric for Categorical Data Clustering

The paper proposes CADM, a cluster-customized adaptive distance metric that dynamically adjusts distance measurements based on the unique attribute distributions within each cluster to improve categorical and mixed data clustering performance.

Taixi Chen, Yiu-ming Cheung, Yiqun Zhang2026-03-09🤖 cs.LG

FireScope: Wildfire Risk Prediction with a Chain-of-Thought Oracle

The paper introduces FireScope, a novel VLM-based framework and accompanying FireScope-Bench dataset that leverage chain-of-thought reasoning to significantly improve the generalization, interpretability, and accuracy of cross-continental wildfire risk prediction by integrating visual, climatic, and geographic factors.

Mario Markov (INSAIT, Sofia University "St. Kliment Ohridski"), Stefan Maria Ailuro (INSAIT, Sofia University "St. Kliment Ohridski"), Luc Van Gool (INSAIT, Sofia University "St. Kliment Ohridski"), Konrad Schindler (ETH Zurich), Danda Pani Paudel (INSAIT, Sofia University "St. Kliment Ohridski")2026-03-09🤖 cs.LG

← Previous Next →