This section explores the intersection where physics meets data analysis, a rapidly evolving frontier where complex datasets reveal hidden patterns in the universe. From tracking particle collisions to modeling cosmic structures, these studies rely on advanced statistical methods to turn raw numbers into fundamental insights about how reality works.

Gist.Science monitors every new preprint in this category as it appears on arXiv, ensuring you never miss a breakthrough. We process each entry to provide both plain-language overviews for general understanding and detailed technical summaries for experts, bridging the gap between dense research and clear comprehension.

Below are the latest papers in physics and data analysis, organized for easy reading and discovery.

SpecTUS: Spectral Translator for Unknown Structures annotation from EI-MS spectra

SpecTUS is a deep neural network that performs *de novo* structural annotation of small molecules from low-resolution GC-EI-MS spectra, significantly outperforming traditional database search methods by achieving perfect structure reconstruction for 43% of test compounds with a single suggestion and surpassing hybrid search results in 76% of cases.

Adam Hájek, Michal Starý, Elliott Price, Filip Jozefov, Helge Hecht, Aleš Křenek2026-02-23🤖 cs.LG

Amortized Inference of Multi-Modal Posteriors using Likelihood-Weighted Normalizing Flows

This paper introduces a novel amortized inference technique using likelihood-weighted Normalizing Flows that overcomes the limitations of standard unimodal base distributions in capturing multi-modal posteriors by initializing the flow with a Gaussian Mixture Model, thereby enabling efficient and accurate parameter estimation in high-dimensional inverse problems without requiring posterior training samples.

Rajneil Baruah2026-02-23⚛️ hep-ex

A Practical Guide to Unbinned Unfolding

This paper provides practical recommendations and considerations from researchers across major particle physics experiments on adopting emerging machine learning-based unbinned unfolding techniques to replace traditional binned histogram methods for more flexible, high-dimensional data analysis.

Florencia Canelli, Kyle Cormier, Andrew Cudd, Dag Gillberg, Roger G. Huang, Weijie Jin, Sookhyun Lee, Vinicius Mikuni, Laura Miller, Benjamin Nachman, Jingjing Pan, Tanmay Pani, Mariel Pettee, Youqi S (…)2026-02-20⚛️ hep-ex