cs.CY 편의 논문 | Gist.Science

Measuring AI R&D Automation

이 논문은 AI 연구개발 자동화 (AIRDA) 의 범위와 영향에 대한 불확실성을 해소하기 위해 자본 지출 비중, 연구자 시간 배분, AI 하위화 사고 등 다양한 차원의 측정 지표를 제안하고, 기업과 정부 차원의 데이터 수집을 권장합니다.

Alan Chan, Ranay Padarath, Joe Kwon + 2 more2026-03-06💻 cs

Signal in the Noise: Decoding the Reality of Airline Service Quality with Large Language Models

본 논문은 16,000 건 이상의 TripAdvisor 리뷰를 분석한 대규모 언어 모델 (LLM) 기반 프레임워크가 기존 지표가 포착하지 못한 항공사 서비스의 미세한 문제와 만족도 하락 원인을 규명하여, 항공 및 관광 산업에 실행 가능한 전략적 통찰을 제공하는 유효한 진단 도구임을 입증합니다.

Ahmed Dawoud, Osama El-Shamy, Ahmed Habashy2026-03-06💻 cs

Invariant Causal Routing for Governing Social Norms in Online Market Economies

이 논문은 온라인 시장 경제에서 사회적 규범을 효과적으로 관리하기 위해, 다양한 환경에서 안정적인 인과 관계를 식별하고 해석 가능한 정책 규칙을 도출하는 '불변 인과 라우팅 (ICR)' 프레임워크를 제안하고 그 유효성을 입증합니다.

Xiangning Yu, Qirui Mi, Xiao Xue + 4 more2026-03-06💻 cs

Token Taxes: mitigating AGI's economic risks

이 논문은 AGI 가 초래할 수 있는 경제적 위험을 완화하기 위해 기존 컴퓨팅 거버넌스 인프라를 통해 집행 가능한 토큰 세금 (모델 추론 시 부과되는 사용량 기반 과세) 을 제안하고, 그 장단점 및 실행 방안을 논의합니다.

Lucas Irwin, Tung-Yu Wu, Fazl Barez2026-03-06💻 cs

A Case Study in Responsible AI-Assisted Video Solutions: Multi-Metric Behavioral Insights in a Public Market Setting

이 논문은 사생활 보호와 윤리적 기준을 철저히 준수하는 사용자 중심 접근법을 통해 도시 공공시장에서 AI 기반 비디오 솔루션이 다중 지표 행동 인사이트를 성공적으로 도출하여 공간 분석의 실용성을 입증한 사례 연구입니다.

Mehrnoush Fereydouni, Eka Ebong, Sahar Maleki + 3 more2026-03-06💻 cs

Stan: An LLM-based thermodynamics course assistant

이 논문은 클라우드 API 에 의존하지 않고 오픈 가중치 모델과 로컬 하드웨어만으로 구동되며, 화학공학 열역학 과정에서 학생에게는 RAG 기반의 질문 응답을, 강사에게는 강의 분석 및 교재 인덱싱을 제공하는 'Stan'이라는 양면형 AI 도구의 설계, 구현 및 배포 경험을 제시합니다.

Eric M. Furst, Vasudevan Venkateshwaran2026-03-06🔬 physics

Generalizing Fair Top- $k$ Selection: An Integrative Approach

이 논문은 다수의 보호 집단을 고려한 공정한 상위-k 선택 문제를 다루며, 기존 연구의 한계를 극복하기 위해 계산 복잡성 분석을 통해 효율성 회복 가능성을 규명하고, 편차 최소화를 넘어 더 안정적인 유틸리티 손실 지표를 도입하여 실세계 데이터에서 우수한 성능을 보이는 통합적 알고리즘을 제안합니다.

Guangya Cai2026-03-06💻 cs

Analysis of Terms of Service on Social Media Platforms: Consent Challenges and Assessment Metrics

본 연구는 13 개 주요 소셜 미디어 플랫폼의 이용약관을 분석하여 동의 관련 정보가 명확히 전달되지 않는 문제를 규명하고, 텍스트 접근성, 의미 투명성, 인터페이스 설계를 평가하는 3 차원 프레임워크를 제안함으로써 이용약관을 단순한 동의 문서가 아닌 사용자의 데이터 관행에 대한 동의 조건을 형성하는 문서로 재정의합니다.

Yong-Bin Kang, Anthony McCosker2026-03-06💻 cs

Evaluating and Correcting Human Annotation Bias in Dynamic Micro-Expression Recognition

이 논문은 문화적 배경에 따른 인간 주석 편향을 줄이고 마이크로표정 인식 성능을 향상시키기 위해, 오프셋 프레임의 불확실성을 해결하는 새로운 전역 반단조 차분 선택 전략 (GAMDSS) 아키텍처를 제안하고 이를 통해 다문화 데이터셋에서 주관적 오류를 효과적으로 감소시켰음을 보여줍니다.

Feng Liu, Bingyu Nan, Xuezhong Qian + 1 more2026-03-06💻 cs

Autoscoring Anticlimax: A Meta-analytic Understanding of AI's Short-answer Shortcomings and Wording Weaknesses

이 논문은 890 개의 결과를 메타 분석하여 단답형 채점에서 LLM 의 성능이 인간 전문가의 난이도 인식과 무관하며, 디코더 전용 아키텍처가 인코더보다 현저히 낮고 토크나이저 어휘 크기 증가에도 한계가 있으며, 고위험 교육 맥락에서 인종 차별적 편향이 발생할 수 있음을 규명했습니다.

Michael Hardy2026-03-06💬 cs.CL

Differential Privacy in Two-Layer Networks: How DP-SGD Harms Fairness and Robustness

이 논문은 이층 ReLU 합성곱 신경망에서 차분 프라이버시 (DP-SGD) 가 노이즈로 인해 특징 학습이 비최적화되어 불공정성과 취약성을 초래하며, 공개 사전 학습과 개인화 미세 조정의 효과도 데이터 분포 편차에 따라 제한적임을 이론적 분석과 실험을 통해 규명합니다.

Ruichen Xu, Kexin Chen2026-03-06🤖 cs.LG

Training for Technology: Adoption and Productive Use of Generative AI in Legal Analysis

이 연구는 법률 분석과 같은 지식 집약적 분야에서 생성형 AI 의 생산성 향상을 위해서는 단순한 접근성 제공보다 사용자 교육이 필수적이며, 이를 통해 AI 활용률과 수행 성과가 모두 유의미하게 개선됨을 보여줍니다.

Benjamin M. Chen, Hong Bao2026-03-06🤖 cs.AI

Small Changes, Big Impact: Demographic Bias in LLM-Based Hiring Through Subtle Sociocultural Markers in Anonymised Resumes

이 논문은 이름 등 명시적 개인 식별 정보를 제거한 이력서에서도 언어, 취미, 봉사 활동과 같은 미묘한 사회문화적 표지가 인종과 성별의 대용물이 되어 LLM 기반 채용 과정에서 체계적인 편향을 유발하고, 특히 설명을 요구하는 프롬프팅이 이러한 편향을 더욱 악화시킨다는 사실을 싱가포르 맥락의 대규모 실험을 통해 규명했습니다.

Bryan Chen Zhengyu Tan, Shaun Khoo, Bich Ngoc Doan + 3 more2026-03-06💻 cs

← 이전 다음 →

cs.CY

Measuring AI R&D Automation

Signal in the Noise: Decoding the Reality of Airline Service Quality with Large Language Models

Invariant Causal Routing for Governing Social Norms in Online Market Economies

Token Taxes: mitigating AGI's economic risks

A Case Study in Responsible AI-Assisted Video Solutions: Multi-Metric Behavioral Insights in a Public Market Setting

Stan: An LLM-based thermodynamics course assistant

Generalizing Fair Top- $k$ Selection: An Integrative Approach

Analysis of Terms of Service on Social Media Platforms: Consent Challenges and Assessment Metrics

Evaluating and Correcting Human Annotation Bias in Dynamic Micro-Expression Recognition

Autoscoring Anticlimax: A Meta-analytic Understanding of AI's Short-answer Shortcomings and Wording Weaknesses

Differential Privacy in Two-Layer Networks: How DP-SGD Harms Fairness and Robustness

Training for Technology: Adoption and Productive Use of Generative AI in Legal Analysis

Small Changes, Big Impact: Demographic Bias in LLM-Based Hiring Through Subtle Sociocultural Markers in Anonymised Resumes

Cognitive Warfare: Definition, Framework, and Case Study

The role of spatial scales in assessing urban mobility models

NL2GDS: LLM-aided interface for Open Source Chip Design

Synthetic emotions and consciousness: exploring architectural boundaries

RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents

Dutch Metaphor Extraction from Cancer Patients' Interviews and Forum Data using LLMs and Human in the Loop

A Systematic Analysis of Biases in Large Language Models

cs.CY

Measuring AI R&D Automation

Signal in the Noise: Decoding the Reality of Airline Service Quality with Large Language Models

Invariant Causal Routing for Governing Social Norms in Online Market Economies

Token Taxes: mitigating AGI's economic risks

A Case Study in Responsible AI-Assisted Video Solutions: Multi-Metric Behavioral Insights in a Public Market Setting

Stan: An LLM-based thermodynamics course assistant

Generalizing Fair Top-kkk Selection: An Integrative Approach

Analysis of Terms of Service on Social Media Platforms: Consent Challenges and Assessment Metrics

Evaluating and Correcting Human Annotation Bias in Dynamic Micro-Expression Recognition

Autoscoring Anticlimax: A Meta-analytic Understanding of AI's Short-answer Shortcomings and Wording Weaknesses

Differential Privacy in Two-Layer Networks: How DP-SGD Harms Fairness and Robustness

Training for Technology: Adoption and Productive Use of Generative AI in Legal Analysis

Small Changes, Big Impact: Demographic Bias in LLM-Based Hiring Through Subtle Sociocultural Markers in Anonymised Resumes

Cognitive Warfare: Definition, Framework, and Case Study

The role of spatial scales in assessing urban mobility models

NL2GDS: LLM-aided interface for Open Source Chip Design

Synthetic emotions and consciousness: exploring architectural boundaries

RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents

Dutch Metaphor Extraction from Cancer Patients' Interviews and Forum Data using LLMs and Human in the Loop

A Systematic Analysis of Biases in Large Language Models

Generalizing Fair Top- $k$ Selection: An Integrative Approach