A Trust-Region Interior-Point Stochastic Sequential Quadratic Programming Method

This paper proposes a trust-region interior-point stochastic sequential quadratic programming (TR-IP-SSQP) method that utilizes adaptive stochastic oracles to solve optimization problems with stochastic objectives and deterministic nonlinear constraints, proving its global almost-sure convergence to first-order stationary points and demonstrating practical performance on benchmark and logistic regression problems.

Yuchen Fang, Jihun Kim, Sen Na, James Demmel, Javad Lavaei2026-03-12🔢 math

Improving TabPFN's Synthetic Data Generation by Integrating Causal Structure

This paper addresses the limitation of TabPFN's autoregressive synthetic data generation, which produces spurious correlations when feature order conflicts with causal structure, by introducing DAG-aware and CPDAG-based conditioning strategies that significantly improve the fidelity, stability, and causal preservation of the generated synthetic tabular data.

Davide Tugnoli, Andrea De Lorenzo, Marco Virgolin, Giovanni Cinà2026-03-12🤖 cs.LG

Discovery of a Hematopoietic Manifold in scGPT Yields a Method for Extracting Performant Algorithms from Biological Foundation Model Internals

This paper introduces a novel three-stage mechanistic interpretability method that extracts a compact, high-performing hematopoietic algorithm directly from the internal attention weights of the scGPT foundation model, achieving superior zero-shot classification and pseudotime ordering on independent datasets with significantly fewer parameters and training time than standard probing or retraining approaches.

Ihor Kendiukhov2026-03-12🧬 q-bio

Robust Post-Training for Generative Recommenders: Why Exponential Reward-Weighted SFT Outperforms RLHF

This paper proposes and validates exponential reward-weighted SFT as a robust, fully offline post-training method for generative recommenders that eliminates reward hacking and propensity score requirements while offering theoretical guarantees and a controllable tradeoff between robustness and performance.

Keertana Chidambaram, Sanath Kumar Krishnamurthy, Qiuling Xu, Ko-Jen Hsiao, Moumita Bhattacharya2026-03-12🤖 cs.LG

Quantum entanglement provides a competitive advantage in adversarial games

This study demonstrates that quantum entanglement serves as a functional resource in competitive reinforcement learning, enabling hybrid quantum-classical agents trained on the game Pong to consistently outperform separable quantum circuits and match or exceed classical baselines by learning structurally distinct features that better model dynamic agent interactions.

Peiyong Wang, Kieran Hymas, James Quach2026-03-12⚛️ quant-ph