cs.LG papers | Gist.Science

Koopman Regularized Deep Speech Disentanglement for Speaker Verification

This paper introduces the Deep Koopman Speech Disentanglement Autoencoder (DKSD-AE), a scalable and efficient architecture that leverages Koopman operators and instance normalization to effectively disentangle speaker identity from linguistic content for robust speaker verification without relying on textual supervision or large pretrained models.

Nikos Chazaridis, Mohammad Belal, Rafael Mestre, Timothy J. Norman, Christine Evers2026-03-09🤖 cs.LG

A Novel Hybrid Heuristic-Reinforcement Learning Optimization Approach for a Class of Railcar Shunting Problems

This paper proposes a novel Hybrid Heuristic-Reinforcement Learning (HHRL) framework that integrates railway-specific heuristics with Q-learning to efficiently solve complex railcar shunting problems involving both one-sided and two-sided classification tracks by decomposing multi-locomotive tasks into manageable subproblems.

Ruonan Zhao, Joseph Geunes2026-03-09🤖 cs.LG

Spatiotemporal Heterogeneity of AI-Driven Traffic Flow Patterns and Land Use Interaction: A GeoAI-Based Analysis of Multimodal Urban Mobility

This study proposes and validates a GeoAI hybrid framework integrating MGWR, Random Forest, and ST-GCN to effectively model the spatiotemporal heterogeneity of multimodal traffic flows and their interaction with land use, demonstrating superior predictive accuracy and revealing distinct urban traffic typologies that underscore the critical role of local morphological context in mobility planning.

Olaf Yunus Laitinen Imanov2026-03-09🤖 cs.AI

Bias In, Bias Out? Finding Unbiased Subnetworks in Vanilla Models

This paper introduces Bias-Invariant Subnetwork Extraction (BISE), a method that identifies and isolates fair, bias-agnostic subnetworks within standard pre-trained models through pruning, enabling effective bias mitigation without retraining or additional unbiased data.

Ivan Luiz De Moura Matos, Abdel Djalil Sad Saoud, Ekaterina Iakovleva, Vito Paolo Pastore, Enzo Tartaglione2026-03-09🤖 cs.LG

On the Value of Tokeniser Pretraining in Physics Foundation Models

This paper demonstrates that pretraining tokenizers with an autoencoding objective before training dynamics models significantly enhances the computational efficiency and accuracy of physics foundation models, particularly when the pretraining data aligns with the downstream physical system.

Hadi Sotoudeh, Payel Mukhopadhyay, Ruben Ohana, Michael McCabe, Neil D. Lawrence, Shirley Ho, Miles Cranmer2026-03-09🔭 astro-ph

From Decoupled to Coupled: Robustness Verification for Learning-based Keypoint Detection with Joint Specifications

This paper introduces the first coupled robustness verification framework for heatmap-based keypoint detectors that uses a mixed-integer linear program to jointly bound deviations across all keypoints, thereby providing sound and less conservative guarantees than prior decoupled methods.

Xusheng Luo, Changliu Liu2026-03-09🤖 cs.LG

Behavior-dLDS: A decomposed linear dynamical systems model for neural activity partially constrained by behavior

This paper introduces behavior-decomposed linear dynamical systems (b-dLDS), a novel modeling approach that disentangles behavior-related neural dynamics from internal computations in large-scale brain recordings, demonstrating superior performance over existing supervised models and successfully scaling to tens of thousands of neurons in zebrafish hindbrain data.

Eva Yezerets, En Yang, Misha B. Ahrens, Adam S. Charles2026-03-09🤖 cs.LG

RACAS: Controlling Diverse Robots With a Single Agentic System

The paper introduces RACAS, a robot-agnostic agentic system that uses natural language communication between LLM/VLM-based modules to control diverse robotic platforms without requiring code modifications or retraining, successfully demonstrating its effectiveness across wheeled, multi-jointed, and underwater robots.

Dylan R. Ashley, Jan Przepióra, Yimeng Chen, Ali Abualsaud, Nurzhan Yesmagambet, Shinkyu Park, Eric Feron, Jürgen Schmidhuber2026-03-09🤖 cs.AI

Identifying Adversary Characteristics from an Observed Attack

This paper proposes a domain-agnostic framework to identify the most probable characteristics of an attacker from an observed data-manipulation attack, demonstrating that such identification enables more effective exogenous mitigation and improves the performance of learning-based defenses.

Soyon Choi, Scott Alfeld, Meiyi Ma2026-03-09🤖 cs.LG

Making Reconstruction FID Predictive of Diffusion Generation FID

This paper introduces interpolated FID (iFID), a novel metric that achieves a strong correlation with diffusion generation FID by interpolating latent representations between dataset samples and their nearest neighbors, thereby overcoming the limitations of traditional reconstruction FID in predicting generative model quality.

Tongda Xu, Mingwei He, Shady Abu-Hussein, Jose Miguel Hernandez-Lobato, Haotian Zhang, Kai Zhao, Chao Zhou, Ya-Qin Zhang, Yan Wang2026-03-09🤖 cs.LG

When Rubrics Fail: Error Enumeration as Reward in Reference-Free RL Post-Training for Virtual Try-On

This paper introduces Implicit Error Counting (IEC), a reference-free reinforcement learning post-training method that enumerates and weights errors to generate rewards, demonstrating superior performance over Rubrics as Rewards (RaR) in virtual try-on tasks where multiple valid outputs exist and ideal reference answers are unavailable.

Wisdom Ikezogwo, Mehmet Saygin Seyfioglu, Ranjay Krishna, Karim Bouyarmane2026-03-09🤖 cs.AI

The Value of Graph-based Encoding in NBA Salary Prediction

This paper demonstrates that integrating graph-based embeddings of on-court and off-court player data into tabular datasets significantly improves the accuracy of supervised machine learning models for predicting NBA player salaries, particularly for veterans and high-earning outliers where traditional methods fail.

Junhao Su, David Grimsman, Christopher Archibald2026-03-09🤖 cs.LG

Reinforcement Learning for Power-Flow Network Analysis

This paper proposes a reinforcement learning framework with a probabilistic reward function and a Gaussian baseline to discover power-flow network configurations that yield a significantly higher number of equilibrium points than current computational algebra methods can identify.

Alperen Ergur, Julia Lindberg, Vinny Miller2026-03-09🤖 cs.LG

Improved Scaling Laws via Weak-to-Strong Generalization in Random Feature Ridge Regression

This paper demonstrates that in random feature ridge regression, a strong student model trained on imperfect labels from a weak teacher can achieve substantially improved scaling laws and even reach minimax optimal rates, regardless of whether the teacher's own test error decays with sample size.

Diyuan Wu, Lehan Chen, Theodor Misiakiewicz, Marco Mondelli2026-03-09🤖 cs.LG

Parallelization Strategies for Dense LLM Deployment: Navigating Through Application-Specific Tradeoffs and Bottlenecks

This paper investigates parallelization strategies for deploying dense LLMs, demonstrating that while Tensor Parallelism optimizes latency and Pipeline Parallelism enhances throughput, a hybrid approach allows for effective control over the inherent latency-throughput tradeoff to meet specific application requirements.

Burak Topcu, Musa Oguzhan Cim, Poovaiah Palangappa, Meena Arunachalam, Mahmut Taylan Kandemir2026-03-09🤖 cs.LG

Warm Starting State-Space Models with Automata Learning

This paper establishes a formal correspondence between Moore machines and state-space models to demonstrate that initializing continuous SSMs with symbolically learned automata significantly improves training efficiency and accuracy compared to random initialization, thereby effectively leveraging symbolic inductive bias for learning complex systems.

William Fishell, Sam Nicholas Kouteili, Mark Santolucito2026-03-09🤖 cs.LG

Random Dot Product Graphs as Dynamical Systems: Limitations and Opportunities

This paper establishes a geometric framework using principal fiber bundles to identify fundamental obstructions in learning differential equations from temporal Random Dot Product Graphs, characterizing the interplay between gauge ambiguity, spectral gaps, and holonomy while demonstrating that symmetric dynamics can resolve gauge issues to enable vector field recovery.

Giulio Valentino Dalla Riva2026-03-09🤖 cs.LG

The Rise of AI in Weather and Climate Information and its Impact on Global Inequality

This paper argues that while AI promises to revolutionize climate information, its current reliance on Global North-dominated infrastructure and biased data risks exacerbating global inequality, necessitating a shift toward data-centric development, shared digital public infrastructure, and co-produced knowledge to ensure equitable outcomes.

Amirpasha Mozaffari, Amanda Duarte, Lina Teckentrup, Stefano Materia, Gina E. C. Charnley, Lluis Palma, Eulalia Baulenas Serra, Dragana Bojovic, Paula Checchia, Aude Carreric, Francisco Doblas-Reyes2026-03-09🤖 cs.AI

Unsupervised domain adaptation for radioisotope identification in gamma spectroscopy

This paper demonstrates that unsupervised domain adaptation, specifically through minimizing maximum mean discrepancy (MMD) between synthetic and unlabeled real-world data, significantly improves the generalization and testing accuracy of machine learning models for radioisotope identification in gamma spectroscopy.

Peter Lalor, Ayush Panigrahy, Alex Hagen2026-03-09🤖 cs.LG

Revisiting the (Sub)Optimality of Best-of-N for Inference-Time Alignment

This paper challenges prior claims of Best-of-N's suboptimality by demonstrating that, under practical assumptions and when evaluated via win-rate rather than expected reward, properly tuned Best-of-N is both statistically and computationally optimal, while also proposing a simple variant that eliminates reward hacking without sacrificing performance.

Ved Sriraman, Adam Block2026-03-09🤖 cs.AI

← Previous Next →