AI predictions and the expansion of scientific frontiers: Evidence from structural biology

Leveraging the 2021 release of AlphaFold2 as a quasi-experiment, this study demonstrates that AI predictions can expand scientific frontiers by reversing the decline in research on novel proteins and redirecting collective attention toward understudied genes and targets, thereby challenging concerns that AI merely reinforces established scientific canons.

Sun, M., Choi, S., Yin, Y.2026-04-07💻 bioinformatics

Machine Learning-Enhanced Nanopore ITS Analysis: Evaluating CPU-GPU Pipelines for High-Accuracy Fungal Taxonomic Resolution

This study benchmarks CPU and GPU pipelines for Nanopore ITS fungal analysis, demonstrating that while GPU-accelerated workflows maximize species-level accuracy through error correction, optimized CPU-based machine learning approaches offer a viable, resource-efficient alternative for achieving high genus-level taxonomic resolution.

Albuja, D. S., Maldonado, P. S., Zambrano, P. E. + 2 more2026-04-07💻 bioinformatics

From Parametric Guessing to Graph-Grounded Answers: Building Reliable ChatGPT-like tools for Plant Science

This paper argues that while large language models fail to provide complete, source-attributed answers for plant science queries due to their parametric nature, a GraphRAG architecture leveraging structured, provenance-linked knowledge graphs offers a reproducible and reliable alternative for generating exhaustive, citation-backed results.

Itharajula, M., Lim, S. C., Mutwil, M.2026-04-06💻 bioinformatics

RAMBO: Resolving Amplicons in Mixed Samples for Accurate DNA Barcoding with Oxford Nanopore

The paper introduces RAMBO, a novel, reference-free pipeline that utilizes unsupervised clustering and consensus generation to accurately resolve mixed amplicon signals and distinguish highly similar DNA variants in Oxford Nanopore sequencing data, thereby enabling high-resolution DNA barcoding even in complex samples containing pseudogenes or contaminants.

Kolter, A., Hebert, P. D. N.2026-04-05💻 bioinformatics

Widespread data leakage inflates accuracy and corrupts biomarker discovery in cancer drug response prediction

This paper demonstrates that a widespread practice of applying supervised feature screening before cross-validation causes severe data leakage in cancer drug response prediction, systematically inflating reported accuracy and corrupting biomarker discovery by introducing statistical artifacts that mimic biological signals.

Asiaee, A., Strauch, J., Azinfar, L. + 4 more2026-04-05💻 bioinformatics

Transcriptomic Integration Reveals a Conserved Inflammatory--Proliferative Paradox in Acquired Resistance to Immune Checkpoint Blockade

By integrating transcriptomic data across four diverse tumor models, this study reveals a conserved "inflammatory-proliferative paradox" in acquired resistance to immune checkpoint blockade, where tumors simultaneously maintain interferon-gamma-driven inflammatory signatures and activate cell-cycle programs through a shared regulatory network, suggesting that resistance is an active, tumor-intrinsic adaptive process rather than a simple loss of immune recognition.

Lee, H., Yeo, H., Bak, I. + 2 more2026-04-05💻 bioinformatics

Interpretable Deep Learning-Based Multi-Omics Integrationfor Prognosis in Hepatocellular Carcinoma

This study presents an interpretable, attention-based deep learning framework that integrates mRNA, miRNA, and DNA methylation data to significantly improve prognostic accuracy for hepatocellular carcinoma patients compared to existing models, while identifying biologically relevant biomarkers and demonstrating robust performance on external validation cohorts.

Znabu, B. F., Atif, Z.2026-04-05💻 bioinformatics