Tracking Cancer Through Text: Longitudinal Extraction From Radiology Reports Using Open-Source Large Language Models
This paper presents a fully open-source, locally deployable pipeline using the Qwen2.5-72B model to accurately extract and link longitudinal tumor burden data from radiology reports in compliance with RECIST criteria, demonstrating that privacy-preserving open-source large language models can achieve clinically meaningful performance in oncology.