This section explores the intersection where physics meets data analysis, a rapidly evolving frontier where complex datasets reveal hidden patterns in the universe. From tracking particle collisions to modeling cosmic structures, these studies rely on advanced statistical methods to turn raw numbers into fundamental insights about how reality works.

Gist.Science monitors every new preprint in this category as it appears on arXiv, ensuring you never miss a breakthrough. We process each entry to provide both plain-language overviews for general understanding and detailed technical summaries for experts, bridging the gap between dense research and clear comprehension.

Below are the latest papers in physics and data analysis, organized for easy reading and discovery.

Follow the wobble: Statistical methods to detect astrometric binary asteroids in Gaia FPR

This paper details the statistical methods used to detect astrometric binary asteroids in Gaia FPR data, presenting an updated list of 343 candidates and demonstrating the method's reliability through performance evaluations that show significantly higher detection rates compared to noise-only simulations.

Luana Liberato, Paolo Tanga, David Mary, Raphael Lallemand, Ziu Liu, Benoit Carry, Josselin Desmars, Daniel Hestroffer, Kate Minker, Alexandros Siakas2026-05-22🔭 astro-ph

Lumina: An AI-Augmented Multiscale Material Informatics Framework for Extreme Aero-Chemo-Thermo-Mechanical Regimes

This paper introduces Lumina, a modular Python-based framework that unifies fragmented multiscale material data for extreme aero-chemo-thermo-mechanical regimes into a centralized, AI-augmented ecosystem to streamline experimental design, validate chemical behaviors, and enhance predictive modeling for advanced defense and aerospace applications.

Pradeep Kumar Seshadri, Vigneshwaran N, Sudaroli Dhananjeyan, Karthikeyan S, Navbila K, Sridhar S, Subhadevi K, Hari Sree Charan H, Abdul Azeez A, Jeswin Mickle, Harsha C2026-05-21🔬 physics

Requirements for Early Quantum Utility and Quantum Utility in the Capacitated Vehicle Routing Problem

This paper introduces a transparent, encoding-agnostic framework that uses resource counts and hardware benchmarks to demonstrate that achieving early quantum utility for the Capacitated Vehicle Routing Problem (CVRP) is currently unlikely on NISQ devices, revealing a massive qubit advantage for higher-order encodings over direct QUBO mappings while suggesting that innovative problem decomposition is essential for future quantum advantage.

Chinonso Onah, Kristel Michielsen2026-05-20🔬 physics.app-ph

Activation Functions, Statistics and Learning of Higher-Order Interactions in Restricted Boltzmann Machines

This paper analytically characterizes how different hidden unit activation functions in Restricted Boltzmann Machines influence the statistics of induced interactions and the ability to learn complex, higher-order data structures, demonstrating that rapidly increasing nonlinearities like the Exponential function can significantly facilitate the representation and learning of such patterns.

Giovanni di Sarra, Yasser Roudi2026-05-20🔬 cond-mat

GenL: An extensible fitting program for Laue oscillations and whole pattern fitting

GenL is a flexible, extensible, and open-source MATLAB-based program that utilizes a genetic algorithm to simulate and fit X-ray reflectivity and diffraction data from epitaxial thin films, offering both source code and pre-compiled binary options for extracting structural parameters like strain profiles and crystal roughness.

Anna L. Ravensburg, Johan Bylin, Vassilios Kapaklis, Gunnar K. Pálsson2026-05-19🔬 cond-mat.mtrl-sci

vega-mir: An information-theoretic Python toolkit for symbolic music, with applications to harmonic graphs and rubato spectra

This paper introduces *vega-mir*, an open-source Python toolkit for symbolic music analysis featuring nine information-theoretic metrics, and demonstrates its utility through case studies revealing a correlation between harmonic graph centrality and harmonic distance across composers, as well as evidence that Glenn Gould's rubato is characterized by structured periodicity rather than metronomic rigidity.

Fred Jalbert-Desforges2026-05-19🔬 physics

Neural simulation-based inference of the Higgs trilinear self-coupling via off-shell Higgs production

This paper proposes a hybrid neural simulation-based inference approach to constrain the Higgs trilinear self-coupling and other SMEFT operators using off-shell Higgs production at the High-Luminosity LHC, achieving near-theoretical-optimal sensitivity by combining matrix-element-enhanced training with classification-based background estimation.

Aishik Ghosh, Maximilian Griese, Ulrich Haisch, Tae Hyoun Park2026-05-18⚛️ hep-ex