Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews
This paper introduces a maximum likelihood model to estimate LLM usage in AI conference peer reviews, revealing that between 6.5% and 16.9% of text in recent reviews was substantially AI-generated, with higher usage correlated to lower reviewer confidence, proximity to deadlines, and lower engagement with rebuttals.