cs.LG papers | Gist.Science

Scaling Agentic Capabilities, Not Context: Efficient Reinforcement Finetuning for Large Toolspaces

The paper introduces ATLAS, a reinforcement finetuning framework that enables small language models to effectively navigate large toolspaces by learning adaptive context acquisition and execution strategies, thereby achieving frontier-level performance with significantly reduced parameter and context budgets.

Karan Gupta, Pranav Vajreshwari, Yash Pandya, Raghav Magazine, Akshay Nambi, Ahmed Awadallah2026-03-10🤖 cs.LG

From Statistical Fidelity to Clinical Consistency: Scalable Generation and Auditing of Synthetic Patient Trajectories

This paper presents an integrated pipeline combining knowledge-grounded generative modeling with automated LLM-based auditing to produce clinically consistent, privacy-preserving synthetic patient trajectories that overcome the limitations of existing methods by eliminating clinical inconsistencies while maintaining high statistical fidelity and downstream utility.

Guanglin Zhou, Armin Catic, Motahare Shabestari, Matthew Young, Chaiquan Li, Katrina Poppe, Sebastiano Barbieri2026-03-10🤖 cs.LG

ProtAlign: Contrastive learning paradigm for Sequence and structure alignment

The paper introduces ProtAlign, a contrastive learning framework that unifies protein sequence and structure representations into a shared embedding space, thereby enabling cross-modal retrieval and improving downstream tasks like function annotation and stability estimation.

Aditya Ranganath, Hasin Us Sami, Kowshik Thopalli, Bhavya Kailkhura, Wesam Sakla2026-03-10🤖 cs.LG

Bi Directional Feedback Fusion for Activity Aware Forecasting of Indoor CO2 and PM2.5

This paper proposes a bi-directional feedback fusion framework that integrates human activity embeddings with dual-timescale temporal modules to significantly improve the accuracy and interpretability of indoor CO2 and PM2.5 forecasting compared to traditional data-driven models.

Harshala Gammulle, Lidia Morawska, Sridha Sridharan, Clinton Fookes2026-03-10🤖 cs.LG

Regression Models Meet Foundation Models: A Hybrid-AI Approach to Practical Electricity Price Forecasting

This paper introduces FutureBoosting, a hybrid-AI framework that enhances electricity price forecasting by integrating forecasted features from a frozen time series foundation model into a regression model, thereby achieving significant accuracy improvements over state-of-the-art baselines while maintaining interpretability.

Yunzhong Qiu, Binzhu Li, Hao Wei, Shenglin Weng, Chen Wang, Zhongyi Pei, Mingsheng Long, Jianmin Wang2026-03-10🤖 cs.LG

Safe Transformer: An Explicit Safety Bit For Interpretable And Controllable Alignment

The paper proposes Safe Transformer, a modular approach that inserts an explicit, interpretable safety bit into pre-trained language models to achieve controllable alignment and near-zero attack success rates through lightweight fine-tuning, addressing the opacity of traditional implicit safety methods.

Jingyuan Feng, Andrew Gambardella, Gouki Minegishi, Takeshi Kojima, Yusuke Iwasawa, Yutaka Matsuo2026-03-10🤖 cs.LG

Orion: Characterizing and Programming Apple's Neural Engine for LLM Training and Inference

This paper introduces Orion, the first open end-to-end system that bypasses Apple's opaque CoreML framework to enable direct Neural Engine programming for large language model training and inference, achieving an 8.5x speedup in weight updates through a novel patching mechanism and demonstrating stable training of 110M-parameter models on Apple Silicon.

Ramchand Kumaresan2026-03-10🤖 cs.LG

Don't Freeze, Don't Crash: Extending the Safe Operating Range of Neural Navigation in Dense Crowds

This paper proposes a reinforcement learning approach for dense crowd navigation that achieves zero-shot generalization to higher crowd densities by combining density-invariant observation encoding, density-randomized training, and physics-informed proxemic reward shaping, thereby significantly outperforming existing learning-based and analytical methods in success rate and collision avoidance without freezing.

Jiefu Zhang, Yang Xu, Vaneet Aggarwal2026-03-10🤖 cs.LG

PolyBlocks: A Compiler Infrastructure for AI Chips and Programming Frameworks

This paper presents PolyBlocks, a modular, MLIR-based compiler infrastructure that utilizes pass pipelines and analytical cost models to automatically generate high-performance code for AI chips, demonstrating competitive or superior performance against existing frameworks like Torch Inductor and XLA on NVIDIA GPUs.

Uday Bondhugula, Akshay Baviskar, Navdeep Katel, Vimal Patel, Anoop JS, Arnab Dutta2026-03-10🤖 cs.LG

Calibrated Credit Intelligence: Shift-Robust and Fair Risk Scoring with Bayesian Uncertainty and Gradient Boosting

This paper introduces Calibrated Credit Intelligence (CCI), a deployment-oriented framework that integrates Bayesian uncertainty quantification, fairness-constrained gradient boosting, and shift-aware fusion to deliver accurate, reliable, and equitable credit risk scores that remain robust under temporal distribution shifts.

Srikumar Nayak2026-03-10🤖 cs.LG

Rank-Factorized Implicit Neural Bias: Scaling Super-Resolution Transformer with FlashAttention

This paper proposes Rank-factorized Implicit Neural Bias (RIB), a novel positional bias mechanism that enables the use of hardware-efficient FlashAttention in Super-Resolution Transformers, allowing for significantly larger window sizes and training patches that achieve state-of-the-art performance (35.63 dB PSNR) while reducing training and inference times by 2.1 $\times$ and 2.9 $\times$ , respectively.

Dongheon Lee, Seokju Yun, Jaegyun Im, Youngmin Ro2026-03-10🤖 cs.LG

Heterogeneous Decentralized Diffusion Models

This paper introduces an efficient framework for heterogeneous decentralized diffusion models that enables experts to train with mixed objectives (DDPM and Flow Matching) and reduced resource requirements, achieving a 16x decrease in compute and 14x reduction in data compared to prior approaches while improving image quality and diversity.

Zhiying Jiang, Raihan Seraj, Marcos Villagra, Bidhan Roy2026-03-10🤖 cs.LG

Improved Constrained Generation by Bridging Pretrained Generative Models

This paper proposes a framework that fine-tunes pretrained generative models to directly sample within complex, structured feasible regions, achieving a novel balance between strict constraint satisfaction and high-quality sample realism for safety-critical applications like robotics and autonomous driving.

Xiaoxuan Liang, Saeid Naderiparizi, Yunpeng Liu, Berend Zwartsenberg, Frank Wood2026-03-10🤖 cs.LG

Stabilizing Reinforcement Learning for Diffusion Language Models

This paper identifies that applying Group Relative Policy Optimization (GRPO) to diffusion language models causes reward collapse due to noisy importance ratio estimates and formulation mismatches, and proposes StableDRL, a reformulated algorithm featuring unconditional clipping and self-normalization to stabilize training and prevent policy drift.

Jianyuan Zhong, Kaibo Wang, Ding Ding, Zijin Feng, Haoli Bai, Yang Xiang, Jiacheng Sun, Qiang Xu2026-03-10🤖 cs.LG

Enhancing Instruction Following of LLMs via Activation Steering with Dynamic Rejection

The paper introduces DIRECTER, a novel activation steering method that dynamically modulates steering strength through a plausibility-guided decoding loop and layer sensitivity analysis to enhance LLM instruction-following accuracy while preventing the oversteering that typically degrades text quality.

Minjae Kang, Jaehyung Kim2026-03-10🤖 cs.LG

Property-driven Protein Inverse Folding With Multi-Objective Preference Alignment

This paper introduces ProtAlign, a multi-objective preference alignment framework that fine-tunes pretrained inverse folding models to simultaneously optimize diverse developability properties like solubility and thermostability while preserving structural designability, resulting in the enhanced MoMPNN model for practical protein sequence design.

Xiaoyang Hou, Junqi Liu, Chence Shi, Xin Liu, Zhi Yang, Jian Tang2026-03-10🤖 cs.LG

Latent Autoencoder Ensemble Kalman Filter for Data assimilation

This paper proposes the Latent Autoencoder Ensemble Kalman Filter (LAE-EnKF), a novel data assimilation method that learns a stable, linear state-space model in a latent space to overcome the performance limitations of standard EnKF on strongly nonlinear and chaotic systems while maintaining computational efficiency.

Xin T. Tong, Yanyan Wang, Liang Yan2026-03-10🤖 cs.LG

Implementation of Quantum Implicit Neural Representation in Deterministic and Probabilistic Autoencoders for Image Reconstruction/Generation Tasks

This paper proposes a hybrid quantum-classical autoencoder and variational autoencoder framework utilizing Quantum Implicit Neural Representations (QINR) to achieve stable, high-quality image reconstruction and generation with enhanced diversity and sharp details compared to existing quantum generative models.

Saadet Müzehher Eren2026-03-10⚛️ quant-ph

Learning Unbiased Cluster Descriptors for Interpretable Imbalanced Concept Drift Detection

This paper proposes ICD3, an interpretable and robust approach for detecting concept drift in imbalanced streaming data by employing multi-distribution-granular search to identify small concepts and training independent One-Cluster Classifiers for each, thereby overcoming the masking effect of dominant large clusters.

Yiqun Zhang, Zhanpei Huang, Mingjie Zhao, Chuyao Zhang, Yang Lu, Yuzhu Ji, Fangqing Gu, An Zeng2026-03-10🤖 cs.LG

Enhancing SHAP Explainability for Diagnostic and Prognostic ML Models in Alzheimer Disease

This paper proposes and validates a multi-level explainability framework demonstrating that SHAP explanations for Alzheimer's disease diagnostic and prognostic models are robust, stable, and consistent across different disease stages and prediction tasks, thereby enhancing their reliability for clinical adoption.

Pablo Guillén, Enrique Frias-Martinez2026-03-10🤖 cs.LG

← Previous Next →