cs.CY papers | Gist.Science

Queer NLP: A Critical Survey on Literature Gaps, Biases and Trends

This survey critically examines the growing body of LGBTQIA+ NLP research within the ACL Anthology, revealing a reactive focus on identifying bias rather than proactive mitigation, and calls for future work to prioritize stakeholder involvement, intersectionality, interdisciplinarity, and non-English languages to build more just and inclusive technologies.

Sabine Weber, Angelina Wang, Ankush Gupta, Arjun Subramonian, Dennis Ulmer, Eshaan Tanwar, Geetanjali Aich, Hannah Devinney, Jacob Hobbs, Jennifer Mickel, Joshua Tint, Mae Sosto, Ray Groshan, Simone Astarita, Vagrant Gautam, Verena Blaschke, William Agnew, Wilson Y Lee, Yanan LongWed, 11 Ma💻 cs

Towards a Goal-Centric Assessment of Requirements Engineering Methods for Privacy by Design

This paper proposes a goal-centric framework for assessing Requirements Engineering methods for Privacy by Design, arguing that practitioners should evaluate these methods based on organizational goals rather than solely on process characteristics to better support their selection and tailoring.

Oleksandr Kosenkov, Ehsan Zabardast, Jannik Fischbach, Tony Gorschek, Daniel MendezWed, 11 Ma💻 cs

A Decade of News Forum Interactions: Threaded Conversations, Signed Votes, and Topical Tags

This paper introduces a large-scale, privacy-preserving dataset of ten years of user interactions on the Austrian newspaper DerStandard, comprising over 75 million comments and 400 million votes with anonymized identifiers and pre-computed vector embeddings to facilitate research on online discourse dynamics in the German language.

Emma Fraxanet, Vicenç Gómez, Andreas Kaltenbrunner, Max PellertWed, 11 Ma💻 cs

Gender Bias in Perception of Human Managers Extends to AI Managers

This study demonstrates that gender biases in leadership perceptions, where male managers are favored and female managers face greater skepticism upon making unfavorable decisions, extend from human leaders to AI managers, highlighting the need to address these prejudices in the design of AI-driven organizational systems.

Hao Cui, Taha YasseriWed, 11 Ma💻 cs

Generative AI and LLMs in Industry: A text-mining Analysis and Critical Evaluation of Guidelines and Policy Statements Across Fourteen Industrial Sectors

This study employs text-mining techniques to analyze 160 guidelines and policy statements across fourteen industrial sectors, offering critical insights and recommendations for balancing innovation with ethical accountability in the governance of Generative AI and Large Language Models.

Junfeng Jiao, Saleh Afroogh, Kevin Chen, David Atkinson, Amit DhurandharWed, 11 Ma💻 cs

Implicit Biases in Refereeing: Lessons from NBA Referees

This study analyzes NBA play-by-play and Last Two Minutes data to find that while referees exhibit a home-court advantage bias (which has decreased since the pandemic) and show statistical favoritism toward specific players, there is no evidence of negative bias against particular players, teams, or racial groups.

Konstantinos PelechrinisWed, 11 Ma💻 cs

Excess demand in public transportation systems: The case of Pittsburgh's Port Authority

This paper proposes a framework using Poisson regression with censored data filtering to accurately estimate excess demand in public transportation systems, addressing the common issue of underestimation caused by unrecorded passengers left behind on full buses, and validates the approach using simulated data and real-world data from Pittsburgh's Port Authority.

Tianfang Ma, Robizon Khubulashvili, Sera Linardi, Konstantinos PelechrinisWed, 11 Ma💻 cs

Pwned: How Often Are Americans' Online Accounts Breached?

By combining data from a representative sample of 5,000 American adults with breach records from Have I Been Pwned, the study estimates that at least 82.84% of Americans have experienced an account breach, with an average of at least three breaches per person, a risk that is higher among women, White individuals, the middle-aged, and those with higher education levels.

Ken Cor, Gaurav SoodWed, 11 Ma💻 cs

Towards Viewpoint-centric Artifact-based Regulatory Requirements Engineering for Compliance by Design

This paper reports on the synthesis and seeks feedback for the future evaluation of an Artefact Model for Regulatory Requirements Engineering (AM4RRE), aiming to bridge the gap between organizational regulatory processes and ad-hoc software development practices to achieve systematic, integrated compliance by design.

Oleksandr KosenkovWed, 11 Ma💻 cs

PixelConfig: Longitudinal Measurement and Reverse-Engineering of Meta Pixel Configurations

This paper introduces PixelConfig, a framework for reverse-engineering Meta Pixel configurations, which reveals that default settings drive widespread adoption of activity and identity tracking features capable of capturing sensitive health data, while existing tracking restriction mechanisms offer limited practical protection.

Abdullah Ghani (Lahore University of Management Sciences), Yash Vekaria (University of California, Davis), Zubair Shafiq (University of California, Davis)Wed, 11 Ma💻 cs

From Verification to Amplification: Auditing Reverse Image Search as Algorithmic Gatekeeping in Visual Misinformation Fact-checking

This study audits Google's reverse image search and finds that it functions as an ineffective gatekeeper against visual misinformation, often prioritizing irrelevant content and repeated falsehoods over debunking information, particularly during the initial emergence of visual falsehoods.

Cong Lin, Yifei Chen, Jiangyue Chen, Yingdan Lu, Yilang Peng, Cuihua ShenWed, 11 Ma💻 cs

Does Scientific Writing Converge to U.S. English? Evidence from Generative AI-Assisted Publications

Using a large-scale analysis of 5.65 million scientific articles, this study finds that generative AI tools are driving non-English-speaking authors to increasingly converge toward U.S. English stylistic norms, particularly in contexts where language barriers have historically been most significant, thereby reducing publication obstacles while raising questions about linguistic diversity.

Dragan Filimonovic, Christian Rutzer, Jeffrey Macher, Rolf WederWed, 11 Ma💬 cs.CL

Benchmarking Political Persuasion Risks Across Frontier Large Language Models

Through large-scale survey experiments involving over 19,000 participants, this study demonstrates that frontier large language models generally outperform standard political advertisements in persuasiveness, with significant performance variations across models and a model-dependent impact of information-based prompting strategies.

Zhongren Chen, Joshua Kalla, Quan LeWed, 11 Ma💬 cs.CL

Self-hosted Lecture-to-Quiz: Local LLM MCQ Generation with Deterministic Quality Control

This paper presents an end-to-end, self-hosted pipeline that converts lecture PDFs into multiple-choice questions using a local LLM and deterministic quality control, ensuring privacy and accountability while releasing a validated 24-question dataset with a detailed warning taxonomy for educational use.

Seine A. ShintaniWed, 11 Ma💻 cs

Artificial Intelligence (AI) Maturity in Small and Medium-Sized Enterprises: A Framework of Internalized and Ecosystem-Embedded Capabilities

This study proposes a novel, context-sensitive AI maturity framework specifically designed for small and medium-sized enterprises (SMEs) that reconceptualizes maturity as a multidimensional, non-linear, and ecosystem-embedded capability comprising eight dimensions, five levels, and four development pathways to better reflect the unique resource constraints and organizational realities of SMEs.

Sukanlaya Sawang, Virach SornlertlamvanichWed, 11 Ma💻 cs

Infusion: Shaping Model Behavior by Editing Training Data via Influence Functions

The paper introduces "Infusion," a framework that leverages scalable influence-function approximations to compute subtle perturbations in training data, demonstrating that modifying as little as 0.2% of a dataset can effectively and transferably shape model behavior across vision and language domains.

J Rosser, Robert Kirk, Edward Grefenstette, Jakob Foerster, Laura RuisWed, 11 Ma🤖 cs.AI

Singing Syllabi with Virtual Avatars: Enhancing Student Engagement Through AI-Generated Music and Digital Embodiment

This paper proposes and evaluates a novel educational approach that uses AI-generated singing and virtual avatars to transform traditional text-based syllabi into engaging audiovisual performances, demonstrating that this method significantly improves student awareness and recall of critical course information.

Xinxing WuWed, 11 Ma🤖 cs.AI

Why do we Trust Chatbots? From Normative Principles to Behavioral Drivers

This paper argues that user trust in chatbots is often driven by interactional design choices that exploit cognitive biases rather than genuine trustworthiness, urging a reframing of chatbots as skilled salespeople and a distinction between psychological trust formation and normative trustworthiness to better calibrate user expectations.

Aditya Gulati, Nuria OliverWed, 11 Ma🤖 cs.AI

Computational Multi-Agents Society Experiments: Social Modeling Framework Based on Generative Agents

This paper introduces CMASE, a novel framework that integrates generative agent-based modeling with virtual ethnography to transform researchers into embedded participants, enabling real-time social intervention, causal reconstruction of social phenomena, and predictive modeling with high empirical accuracy.

Hanzhong Zhang, Muhua Huang, Jindong WangWed, 11 Ma🤖 cs.AI

Clear, Compelling Arguments: Rethinking the Foundations of Frontier AI Safety Cases

This paper critiques the current limitations of alignment-focused safety cases for frontier AI by drawing on established methodologies from safety-critical industries to propose a more robust, defensible framework, illustrated through a case study on deceptive alignment and CBRN capabilities.

Shaun Feakins, Ibrahim Habli, Phillip MorganWed, 11 Ma🤖 cs.AI