This section explores the intersection where physics meets data analysis, a rapidly evolving frontier where complex datasets reveal hidden patterns in the universe. From tracking particle collisions to modeling cosmic structures, these studies rely on advanced statistical methods to turn raw numbers into fundamental insights about how reality works.

Gist.Science monitors every new preprint in this category as it appears on arXiv, ensuring you never miss a breakthrough. We process each entry to provide both plain-language overviews for general understanding and detailed technical summaries for experts, bridging the gap between dense research and clear comprehension.

Below are the latest papers in physics and data analysis, organized for easy reading and discovery.

End-to-End Large Portfolio Optimization for Variance Minimization with Neural Networks through Covariance Cleaning

This paper introduces a rotation-invariant, end-to-end neural network that jointly learns to transform historical returns and regularize covariance eigenvalues to produce a global minimum-variance portfolio, demonstrating superior out-of-sample performance, robust generalization across different market sizes, and resilience to transaction costs compared to state-of-the-art estimators.

Christian Bongiorno, Efstratios Manolakis, Rosario Nunzio Mantegna2026-04-22💰 q-fin

The Role of Deep Mesoscale Eddies in Ensemble Forecast Performance

This study demonstrates that accurately representing deep ocean features, particularly mesoscale eddies, in initial conditions is critical for improving ensemble forecast performance of surface fields in the Gulf of Mexico, thereby motivating the assimilation of deep observations to better constrain full-water-column circulation.

Justin Cooke, Kathleen Donohue, Clark D Rowley, Prasad G Thoppil, D Randolph Watts2026-04-21🔬 physics

Learn your entropy from informative data: an axiom ensuring the consistent identification of generalized entropies

This paper introduces an axiom stating that entropic parameters cannot be inferred from uniform distributions, which uniquely selects Rényi entropy among generalized families and enables the consistent, data-driven estimation of these parameters while ensuring that the maximized log-likelihood always equals the negative Shannon entropy.

Andrea Somazzi, Diego Garlaschelli2026-04-20📊 stat