cs.LG papers | Gist.Science

Provable Acceleration of Distributed Optimization with Local Updates

This paper rigorously proves that incorporating local updates into the distributed DIGing algorithm can provably accelerate convergence for a broad class of objective functions, demonstrating that two local updates are sufficient to achieve maximal improvement without requiring reduced step sizes.

Zuang Wang, Yongqiang Wang2026-03-11🤖 cs.LG

An AI-powered Bayesian Generative Modeling Approach for Arbitrary Conditional Inference

This paper introduces Bayesian Generative Modeling (BGM), a unified framework that leverages a stochastic iterative Bayesian updating algorithm to learn a single generative model capable of performing arbitrary conditional inference with principled uncertainty quantification, without requiring retraining for different conditioning structures.

Qiao Liu, Wing Hung Wong2026-03-11🤖 cs.AI

Automating Forecasting Question Generation and Resolution for AI Evaluation

This paper presents an automated system using LLM-powered web research agents to generate and resolve diverse, real-world forecasting questions at scale, demonstrating high-quality question creation and resolution rates that surpass human-curated platforms while effectively evaluating and improving AI forecasting performance.

Nikos I. Bosse, Peter Mühlbacher, Jack Wildman, Lawrence Phillips, Dan Schwarz2026-03-11🤖 cs.AI

Infusion: Shaping Model Behavior by Editing Training Data via Influence Functions

The paper introduces "Infusion," a framework that leverages scalable influence-function approximations to compute subtle perturbations in training data, demonstrating that modifying as little as 0.2% of a dataset can effectively and transferably shape model behavior across vision and language domains.

J Rosser, Robert Kirk, Edward Grefenstette, Jakob Foerster, Laura Ruis2026-03-11🤖 cs.AI

Robust Assortment Optimization from Observational Data

This paper proposes a robust, data-driven framework for assortment optimization that maximizes worst-case expected revenue under potential customer preference shifts, establishing computational tractability and deriving statistically optimal algorithms with tight sample complexity bounds to ensure reliable generalization.

Miao Lu, Yuxuan Han, Han Zhong, Zhengyuan Zhou, Jose Blanchet2026-03-11🤖 cs.LG

Bottleneck Transformer-Based Approach for Improved Automatic STOI Score Prediction

This paper proposes a novel bottleneck transformer architecture that integrates convolutional blocks for frame-level feature extraction and multi-head self-attention for information aggregation to achieve improved non-intrusive prediction of the Short-Time Objective Intelligibility (STOI) metric, outperforming state-of-the-art self-supervised learning models in both seen and unseen scenarios.

Amartyaveer, Murali Kadambi, Chandra Mohan Sharma, Anupam Mondal, Prasanta Kumar Ghosh2026-03-11🤖 cs.LG

B-DENSE: Branching For Dense Ensemble Network Supervision Efficiency

The paper proposes B-DENSE, a novel distillation framework that leverages multi-branch trajectory alignment to enforce dense intermediate supervision, thereby overcoming the structural information loss and discretization errors of existing methods to achieve superior image generation quality with reduced inference latency.

Cherish Puniani, Tushar Kumar, Arnav Bendre, Gaurav Kumar, Shree Singhi2026-03-11🤖 cs.AI

Missing-by-Design: Certifiable Modality Deletion for Revocable Multimodal Sentiment Analysis

The paper introduces Missing-by-Design (MBD), a unified framework for revocable multimodal sentiment analysis that combines structured representation learning with a certifiable parameter-modification pipeline to enable the machine-verifiable deletion of specific data modalities while maintaining predictive performance and privacy compliance.

Rong Fu, Ziming Wang, Chunlei Meng, Jiaxuan Lu, Jiekai Wu, Kangan Qian, Hao Zhang, Simon Fong2026-03-11🤖 cs.LG

Continual uncertainty learning

This paper proposes a curriculum-based continual learning framework that decomposes complex robust control problems with multiple uncertainties into sequential tasks, combining a model-based controller with deep reinforcement learning to achieve efficient, non-forgetting policy updates and successful sim-to-real transfer for automotive powertrain vibration control.

Heisei Yonezawa, Ansei Yonezawa, Itsuro Kajiwara2026-03-11🤖 cs.AI

Breaking the Factorization Barrier in Diffusion Language Models

The paper introduces Coupled Discrete Diffusion (CoDD), a hybrid framework that overcomes the "factorization barrier" in diffusion language models by replacing fully factorized outputs with a lightweight probabilistic inference layer, thereby enabling efficient parallel generation of coherent, high-quality text without the prohibitive costs of full joint modeling or reinforcement learning.

Ian Li, Zilei Shao, Benjie Wang, Rose Yu, Guy Van den Broeck, Anji Liu2026-03-11🤖 cs.AI

Detecting Transportation Mode Using Dense Smartphone GPS Trajectories and Transformer Models

This paper introduces SpeedTransformer, a novel Transformer-based model that utilizes only speed inputs from dense smartphone GPS trajectories to achieve superior accuracy and transferability in transportation mode detection compared to traditional deep learning approaches.

Yuandong Zhang, Othmane Echchabi, Tianshu Feng, Wenyi Zhang, Hsuai-Kai Liao, Charles Chang2026-03-11🤖 cs.LG

Non-Rectangular Average-Reward Robust MDPs: Optimal Policies and Their Transient Values

This paper establishes that history-dependent policies with sublinear expected regret are robust-optimal for non-rectangular average-reward robust MDPs without requiring rectangularity, and introduces a transient-value framework with an epoch-based policy that achieves constant-order finite-time performance by combining worst-case optimality with online learning.

Shengbo Wang, Nian Si2026-03-11🤖 cs.LG

DUEL: Exact Likelihood for Masked Diffusion via Deterministic Unmasking

The paper introduces DUEL, a framework that enables exact likelihood computation for masked diffusion models under the test-time distribution, revealing that their true performance significantly surpasses previous estimates and establishing a new standard for comparing and optimizing parallel text generation.

Gilad Turok, Chris De Sa, Volodymyr Kuleshov2026-03-11🤖 cs.LG

Reasoning as Gradient: Scaling MLE Agents Beyond Tree Search

This paper introduces \textsc{Gome}, a gradient-based MLE agent that outperforms traditional tree search methods on MLE-Bench by mapping diagnostic reasoning to gradient computation, demonstrating that as LLM reasoning capabilities improve, gradient-based optimization becomes increasingly superior to exhaustive enumeration.

Yifei Zhang, Xu Yang, Xiao Yang, Bowen Xian, Qizheng Li, Shikai Fang, Jingyuan Li, Jian Wang, Mingrui Xu, Weiqing Liu, Jiang Bian2026-03-11🤖 cs.AI

FinTexTS: Financial Text-Paired Time-Series Dataset via Semantic-Based and Multi-Level Pairing

The paper introduces FinTexTS, a large-scale financial text-paired time-series dataset constructed via a novel semantic-based and multi-level pairing framework that overcomes the limitations of simple keyword matching by leveraging LLMs to align news articles with stock prices across macro, sector, related company, and target-company levels, thereby significantly improving stock price forecasting performance.

Jaehoon Lee, Suhwan Park, Tae Yoon Lim, Seunghan Lee, Jun Seo, Dongwan Kang, Hwanil Choi, Minjae Kim, Sungdong Yoo, SoonYoung Lee, Yongjae Lee, Wonbin Ahn2026-03-11🤖 cs.AI

Unveiling the Potential of Quantization with MXFP4: Strategies for Quantization Error Reduction

This paper introduces two software-only techniques, Overflow-Aware Scaling (OAS) and Macro Block Scaling (MBS), that significantly reduce the accuracy gap between the hardware-efficient MXFP4 format and NVIDIA's NVFP4 standard in Large Language Models, achieving near-parity performance with minimal computational overhead.

Jatin Chhugani, Geonhwa Jeong, Bor-Yiing Su, Yunjie Pan, Hanmei Yang, Aayush Ankit, Jiecao Yu, Summer Deng, Yunqing Chen, Nadathur Satish, Changkyu Kim2026-03-11🤖 cs.AI

Equitable Multi-Task Learning for AI-RANs

This paper proposes the Online-Within-Online Fair Multi-Task Learning (OWO-FMTL) framework, which leverages a dual-loop mechanism with primal-dual updates to ensure long-term equitable inference performance for heterogeneous users in AI-RANs while maintaining low computational overhead.

Panayiotis Raptis, Fatih Aslan, George Iosifidis2026-03-11🤖 cs.LG

KernelCraft: Benchmarking for Agentic Close-to-Metal Kernel Generation on Emerging Hardware

KernelCraft introduces the first benchmark evaluating agentic LLM systems that use feedback-driven workflows to automatically generate and optimize low-level kernels for emerging hardware with novel ISAs, demonstrating their ability to produce valid, high-performance code that rivals or exceeds traditional compiler baselines.

Jiayi Nie, Haoran Wu, Yao Lai, Zeyu Cao, Cheng Zhang, Binglei Lou, Erwei Wang, Jianyi Cheng, Timothy M. Jones, Robert Mullins, Rika Antonova, Yiren Zhao2026-03-11🤖 cs.LG

ALADIN: Accuracy-Latency-Aware Design-space Inference Analysis for Embedded AI Accelerators

This paper presents ALADIN, an accuracy-latency-aware framework that enables the pre-deployment evaluation of mixed-precision quantized neural networks on scratchpad-based embedded AI accelerators by transforming models into platform-aware representations to analyze trade-offs and bottlenecks without requiring physical hardware.

T. Baldi, D. Casini, A. Biondi2026-03-11🤖 cs.AI

Performance Analysis of Edge and In-Sensor AI Processors: A Comparative Review

This paper reviews the landscape of ultra-low-power edge and in-sensor AI processors and empirically benchmarks a segmentation model on GAP9, STM32N6, and Sony IMX500 platforms to demonstrate that while in-sensor processing offers superior energy-delay performance, different architectures provide distinct trade-offs between latency, energy efficiency, and power budgets.

Luigi Capogrosso, Pietro Bonazzi, Michele Magno2026-03-11🤖 cs.LG

← Previous Next →