FlashPrefill: Instantaneous Pattern Discovery and Thresholding for Ultra-Fast Long-Context Prefilling

FlashPrefill is a novel framework that achieves ultra-fast long-context prefilling by combining instantaneous block-searching for dynamic sparse patterns with a thresholding mechanism to eliminate long-tail attention scores, delivering up to a 27.78x speedup on 256K sequences while maintaining efficiency on shorter contexts.

Qihang Fan, Huaibo Huang, Zhiying Wu, Juqiu Wang, Bingning Wang, Ran He2026-03-09🤖 cs.AI

SPOT: Span-level Pause-of-Thought for Efficient and Interpretable Latent Reasoning in Large Language Models

The paper proposes SPOT, a framework that enhances the efficiency and interpretability of large language model reasoning by compressing explicit Chain-of-Thought into latent pause tokens through span-level semantic alignment and a frozen-head decoding constraint, achieving higher accuracy with significantly fewer generated tokens.

Yunlong Chu, Minglai Shao, Yuhang Liu, Bing Hao, Yumeng Lin, Jialu Wang, Ruijie Wang2026-03-09💬 cs.CL

Mind the Gap: Pitfalls of LLM Alignment with Asian Public Opinion

This paper presents a multilingual audit revealing that while contemporary large language models generally align with public opinion on broad social issues across Asian regions, they consistently fail to accurately represent diverse religious viewpoints—particularly those of minority groups—and often amplify negative stereotypes, a problem that persists despite lightweight prompting interventions and remains undetected by standard bias benchmarks.

Hari Shankar, Vedanta S P, Sriharini Margapuri, Debjani Mazumder, Ponnurangam Kumaraguru, Abhijnan Chakraborty2026-03-09💬 cs.CL

The EpisTwin: A Knowledge Graph-Grounded Neuro-Symbolic Architecture for Personal AI

The paper introduces EpisTwin, a neuro-symbolic framework that overcomes data fragmentation in Personal AI by grounding generative reasoning in a verifiable Personal Knowledge Graph derived from multimodal data, utilizing an agentic coordinator for complex reasoning and visual refinement, and validated by a new synthetic benchmark called PersonalQA-71-100.

Giovanni Servedio, Potito Aghilar, Alessio Mattiace, Gianni Carmosino, Francesco Musicco, Gabriele Conte, Vito Walter Anelli, Tommaso Di Noia, Francesco Maria Donini2026-03-09🤖 cs.AI

Continual Adaptation for Pacific Indigenous Speech Recognition

This paper presents an empirical study on adapting speech foundation models to low-resource Pacific Indigenous languages, revealing that while strategies like Low-Rank Adaptation offer initial success, they ultimately struggle with catastrophic forgetting and internal representational drift during sequential learning, highlighting the urgent need for robust adaptation frameworks that balance plasticity and stability.

Yang Xiao, Aso Mahmudi, Nick Thieberger, Eliathamby Ambikairajah, Eun-Jung Holden, Ting Dang2026-03-09💬 cs.CL

The Art That Poses Back: Assessing AI Pastiches after Contemporary Artworks

This study evaluates ChatGPT's ability to generate AI pastiches of contemporary artworks by combining human feedback from twelve international artists with computational analysis, revealing a significant gap between superficial visual similarities and the lack of conceptual depth, dimensionality, and emotional resonance in the AI-generated results, thereby advocating for a multi-metric "style transfer dashboard" for more comprehensive evaluation.

Anca Dinu, Andreiana Mihail, Andra-Maria Florescu, Claudiu Creanga2026-03-09💬 cs.CL

SAHOO: Safeguarded Alignment for High-Order Optimization Objectives in Recursive Self-Improvement

The paper introduces SAHOO, a practical framework that employs a Goal Drift Index, constraint preservation checks, and regression-risk quantification to effectively monitor and control alignment drift while significantly improving performance in recursive self-improving systems across code, reasoning, and truthfulness tasks.

Subramanyam Sahoo, Aman Chadha, Vinija Jain, Divya Chaudhary2026-03-09🤖 cs.AI

Transparent AI for Mathematics: Transformer-Based Large Language Models for Mathematical Entity Relationship Extraction with XAI

This paper proposes a transparent AI framework for Mathematical Entity Relation Extraction (MERE) that utilizes BERT to achieve 99.39% accuracy in identifying mathematical relationships and incorporates SHAP-based explainability to enhance model trust and interpretability for applications in automated problem solving and education.

Tanjim Taharat Aurpa2026-03-09💬 cs.CL

Evaluation of Deontic Conditional Reasoning in Large Language Models: The Case of Wason's Selection Task

This study introduces a new deontic-focused Wason Selection Task dataset to demonstrate that large language models, like humans, exhibit superior reasoning with deontic rules and display matching-bias-like error patterns, highlighting the domain-specific nature of their reasoning capabilities.

Hirohiko Abe, Kentaro Ozeki, Risako Ando, Takanobu Morishita, Koji Mineshima, Mitsuhiro Okada2026-03-09💬 cs.CL

From Prompting to Preference Optimization: A Comparative Study of LLM-based Automated Essay Scoring

This paper presents a comprehensive comparative study of four major LLM-based paradigms for Automated Essay Scoring on IELTS Writing Task 2, revealing that a configuration combining k-SFT and RAG achieves the strongest overall performance with a 93% F1-Score while highlighting critical accuracy-cost-robustness trade-offs.

Minh Hoang Nguyen, Vu Hoang Pham, Xuan Thanh Huynh, Phuc Hong Mai, Vinh The Nguyen, Quang Nhut Huynh, Huy Tien Nguyen, Tung Le2026-03-09💬 cs.CL

PONTE: Personalized Orchestration for Natural Language Trustworthy Explanations

PONTE is a human-in-the-loop framework that enhances the reliability and personalization of Explainable AI narratives by modeling user adaptation as a closed-loop process involving preference modeling, grounded generation, and iterative verification to overcome the limitations of one-size-fits-all approaches and Large Language Model hallucinations.

Vittoria Vineis, Matteo Silvestri, Lorenzo Antonelli, Filippo Betello, Gabriele Tolomei2026-03-09🤖 cs.AI

Beyond Rows to Reasoning: Agentic Retrieval for Multimodal Spreadsheet Understanding and Editing

The paper introduces Beyond Rows to Reasoning (BRTR), a multimodal agentic framework that overcomes the limitations of single-pass retrieval and context compression in enterprise spreadsheet analysis by employing an iterative tool-calling loop, achieving state-of-the-art performance on multiple benchmarks while maintaining full auditability.

Anmol Gulati, Sahil Sen, Waqar Sarguroh, Kevin Paul2026-03-09💬 cs.CL