Scalable Contrastive Causal Discovery under Unknown Soft Interventions

This paper proposes a scalable, contrastive causal discovery model that leverages paired observational and single-regime soft interventional data to construct globally consistent causal structures, theoretically proving its ability to recover identifiable edges and outperform non-contrastive methods in both in-distribution and out-of-distribution scenarios.

Mingxuan Zhang, Khushi Desai, Sopho Kevlishvili + 1 more2026-03-05🤖 cs.LG

[Re] FairDICE: A Gap Between Theory And Practice

This replication study of FairDICE, a multi-objective offline reinforcement learning algorithm, reveals that while its theoretical claims hold, a critical code error initially reduced it to standard behavior cloning and underspecified hyperparameters hindered reproducibility, though corrected experiments demonstrate its potential to scale to complex environments despite a reliance on online tuning.

Peter Adema, Karim Galliamov, Aleksey Evstratovskiy + 1 more2026-03-05🤖 cs.LG

Half the Nonlinearity Is Wasted: Measuring and Reallocating the Transformer's MLP Budget

This paper demonstrates that a significant portion of transformer MLP nonlinearity is redundant and context-dependent, showing that a lightweight gating mechanism can dynamically replace these computations with linear surrogates to reduce computational waste or, when applied strategically with full retraining, actively improve model performance by eliminating harmful nonlinearities.

Peter Balogh2026-03-05🤖 cs.LG

Optimal trajectory-guided stochastic co-optimization for e-fuel system design and real-time operation

This paper introduces MasCOR, a machine-learning-assisted framework that overcomes the computational limitations of traditional mathematical programming to enable rapid, near-optimal co-optimization of e-fuel system design and real-time operation under renewable uncertainty, demonstrating site-specific strategies for cost-effective carbon-neutral methanol production across diverse European locations.

Jeongdong Kim, Minsu Kim, Jonggeol Na + 1 more2026-03-05🤖 cs.AI

When Small Variations Become Big Failures: Reliability Challenges in Compute-in-Memory Neural Accelerators

This paper addresses the critical reliability challenges in Compute-in-Memory neural accelerators caused by device non-idealities by demonstrating the disproportionate impact of small variations on safety-critical workloads and proposing cross-layer solutions, including a selective write-verify mechanism (SWIM) and noise-aware training, to ensure robust and efficient deployment.

Yifan Qin, Jiahao Zheng, Zheyu Yan + 3 more2026-03-05🤖 cs.LG

Quantifying Ranking Instability Across Evaluation Protocol Axes in Gene Regulatory Network Benchmarking

This paper introduces a diagnostic framework demonstrating that rankings of gene regulatory network inference methods exhibit significant instability across evaluation protocol axes, driven primarily by shifts in relative discrimination ability rather than base rate effects, thereby challenging the assumption of ranking invariance in current benchmarking practices.

Ihor Kendiukhov2026-03-05🤖 cs.LG

Geographically-Weighted Weakly Supervised Bayesian High-Resolution Transformer for 200m Resolution Pan-Arctic Sea Ice Concentration Mapping and Uncertainty Estimation using Sentinel-1, RCM, and AMSR2 Data

This study proposes a novel Geographically-Weighted Weakly Supervised Bayesian High-Resolution Transformer that fuses Sentinel-1, RCM, and AMSR2 data to generate 200m resolution pan-Arctic sea ice concentration maps with reliable uncertainty estimates, effectively overcoming challenges related to subtle feature extraction, inexact labels, and data heterogeneity.

Mabel Heffring, Lincoln Linlin Xu2026-03-05🤖 cs.LG

Orbital Transformers for Predicting Wavefunctions in Time-Dependent Density Functional Theory

This paper introduces OrbEvo, an equivariant graph transformer model that efficiently predicts time-dependent electronic wavefunctions and related physical properties under external fields by learning to evolve orbital coefficients through autoregressive rollout, thereby overcoming the computational bottlenecks of conventional real-time time-dependent density functional theory.

Xuan Zhang, Haiyang Yu, Chengdong Wang + 3 more2026-03-05🔬 cond-mat.mtrl-sci