A unified foundational framework for knowledge injection and evaluation of Large Language Models in Combustion Science

This study introduces a unified, end-to-end framework for developing combustion-specialized Large Language Models, featuring a massive multimodal knowledge base, a rigorous evaluation benchmark, and a three-stage knowledge-injection pathway that demonstrates the necessity of moving beyond standard retrieval-augmented generation to structured knowledge graphs and continued pretraining to overcome performance ceilings caused by context contamination.

Zonglin Yang, Runze Mao, Tianhao Wu + 3 more2026-03-06💻 cs

Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement

This study evaluates the impact of data augmentation and feature enhancement techniques on hate speech detection across traditional and transformer-based models, revealing that while the open-source gpt-oss-20b achieves the highest overall performance, augmentation strategies significantly boost traditional classifiers like Delta TF-IDF and that detection efficacy varies based on the interaction between dataset properties, model architecture, and enhancement methods.

Brian Jing Hong Nge, Stefan Su, Thanh Thi Nguyen + 3 more2026-03-06💻 cs