Queer NLP: A Critical Survey on Literature Gaps, Biases and Trends

This survey critically examines the growing body of LGBTQIA+ NLP research within the ACL Anthology, revealing a reactive focus on identifying bias rather than proactive mitigation, and calls for future work to prioritize stakeholder involvement, intersectionality, interdisciplinarity, and non-English languages to build more just and inclusive technologies.

Sabine Weber, Angelina Wang, Ankush Gupta, Arjun Subramonian, Dennis Ulmer, Eshaan Tanwar, Geetanjali Aich, Hannah Devinney, Jacob Hobbs, Jennifer Mickel, Joshua Tint, Mae Sosto, Ray Groshan, Simone Astarita, Vagrant Gautam, Verena Blaschke, William Agnew, Wilson Y Lee, Yanan LongWed, 11 Ma💻 cs

A Decade of News Forum Interactions: Threaded Conversations, Signed Votes, and Topical Tags

This paper introduces a large-scale, privacy-preserving dataset of ten years of user interactions on the Austrian newspaper DerStandard, comprising over 75 million comments and 400 million votes with anonymized identifiers and pre-computed vector embeddings to facilitate research on online discourse dynamics in the German language.

Emma Fraxanet, Vicenç Gómez, Andreas Kaltenbrunner, Max PellertWed, 11 Ma💻 cs

Generative AI and LLMs in Industry: A text-mining Analysis and Critical Evaluation of Guidelines and Policy Statements Across Fourteen Industrial Sectors

This study employs text-mining techniques to analyze 160 guidelines and policy statements across fourteen industrial sectors, offering critical insights and recommendations for balancing innovation with ethical accountability in the governance of Generative AI and Large Language Models.

Junfeng Jiao, Saleh Afroogh, Kevin Chen, David Atkinson, Amit DhurandharWed, 11 Ma💻 cs

Excess demand in public transportation systems: The case of Pittsburgh's Port Authority

This paper proposes a framework using Poisson regression with censored data filtering to accurately estimate excess demand in public transportation systems, addressing the common issue of underestimation caused by unrecorded passengers left behind on full buses, and validates the approach using simulated data and real-world data from Pittsburgh's Port Authority.

Tianfang Ma, Robizon Khubulashvili, Sera Linardi, Konstantinos PelechrinisWed, 11 Ma💻 cs

PixelConfig: Longitudinal Measurement and Reverse-Engineering of Meta Pixel Configurations

This paper introduces PixelConfig, a framework for reverse-engineering Meta Pixel configurations, which reveals that default settings drive widespread adoption of activity and identity tracking features capable of capturing sensitive health data, while existing tracking restriction mechanisms offer limited practical protection.

Abdullah Ghani (Lahore University of Management Sciences), Yash Vekaria (University of California, Davis), Zubair Shafiq (University of California, Davis)Wed, 11 Ma💻 cs

From Verification to Amplification: Auditing Reverse Image Search as Algorithmic Gatekeeping in Visual Misinformation Fact-checking

This study audits Google's reverse image search and finds that it functions as an ineffective gatekeeper against visual misinformation, often prioritizing irrelevant content and repeated falsehoods over debunking information, particularly during the initial emergence of visual falsehoods.

Cong Lin, Yifei Chen, Jiangyue Chen, Yingdan Lu, Yilang Peng, Cuihua ShenWed, 11 Ma💻 cs

Does Scientific Writing Converge to U.S. English? Evidence from Generative AI-Assisted Publications

Using a large-scale analysis of 5.65 million scientific articles, this study finds that generative AI tools are driving non-English-speaking authors to increasingly converge toward U.S. English stylistic norms, particularly in contexts where language barriers have historically been most significant, thereby reducing publication obstacles while raising questions about linguistic diversity.

Dragan Filimonovic, Christian Rutzer, Jeffrey Macher, Rolf WederWed, 11 Ma💬 cs.CL

Artificial Intelligence (AI) Maturity in Small and Medium-Sized Enterprises: A Framework of Internalized and Ecosystem-Embedded Capabilities

This study proposes a novel, context-sensitive AI maturity framework specifically designed for small and medium-sized enterprises (SMEs) that reconceptualizes maturity as a multidimensional, non-linear, and ecosystem-embedded capability comprising eight dimensions, five levels, and four development pathways to better reflect the unique resource constraints and organizational realities of SMEs.

Sukanlaya Sawang, Virach SornlertlamvanichWed, 11 Ma💻 cs