cs.CY 件の論文 | Gist.Science

Measuring AI R&D Automation

この論文は、AI 研究開発の自動化（AIRDA）の現状と影響を把握するため、従来のベンチマークでは捉えきれない資本配分や研究者の時間割、セキュリティ侵害事象などの新たな指標を提案し、企業や政府によるデータ収集の重要性を説いています。

Alan Chan, Ranay Padarath, Joe Kwon + 2 more2026-03-06💻 cs

Signal in the Noise: Decoding the Reality of Airline Service Quality with Large Language Models

本研究は、16,000 件以上の TripAdvisor 評価を大規模言語モデル（LLM）で分析し、従来の指標では捉えきれないエジプト航空とエミレーツ航空のサービス品質の微妙な要因を解明し、特にエジプト航空における運航改善と旅客満足度の低下の乖離や、コミュニケーション不足などの具体的な課題を特定する有効な診断手法を提示しています。

Ahmed Dawoud, Osama El-Shamy, Ahmed Habashy2026-03-06💻 cs

Invariant Causal Routing for Governing Social Norms in Online Market Economies

本論文は、オンライン市場経済における社会的規範の安定性を高めるため、異質な環境下で不変な因果関係を特定し、解釈可能な政策ルールの構築を可能にする「不変因果ルーティング（ICR）」というガバナンス枠組みを提案し、その有効性を実証しています。

Xiangning Yu, Qirui Mi, Xiao Xue + 4 more2026-03-06💻 cs

Token Taxes: mitigating AGI's economic risks

この論文は、汎用人工知能（AGI）がもたらす経済的リスクを軽減するため、既存の計算ガバナンスインフラを活用してモデル推論の使用段階で課税する「トークン税」の導入を提案し、その執行メカニズムや経済的影響の評価、代替案、そして超大国による拒否権の回避策について論じています。

Lucas Irwin, Tung-Yu Wu, Fazl Barez2026-03-06💻 cs

A Case Study in Responsible AI-Assisted Video Solutions: Multi-Metric Behavioral Insights in a Public Market Setting

この論文は、プライバシーや倫理的配慮を最優先としたユーザー中心のアプローチを採用することで、公共市場における AI 支援型動画ソリューションの導入が可能であることを示し、人間の姿勢検出と行動分析に基づいて顧客の滞留時間や動線などの多面的な行動インサイトを抽出し、施設運営の最適化に貢献できることを実証したケーススタディです。

Mehrnoush Fereydouni, Eka Ebong, Sahar Maleki + 3 more2026-03-06💻 cs

Stan: An LLM-based thermodynamics course assistant

本論文は、学生向けに教科書に基づいた回答を提供し、教員向けに講義の分析と振り返りを支援する双方向の AI ツール「Stan」を、クラウドに依存せずオープンウェイトモデルとローカルハードウェアのみで構築・実装し、その設計と課題解決について記述したものである。

Eric M. Furst, Vasudevan Venkateshwaran2026-03-06🔬 physics

Generalizing Fair Top- $k$ Selection: An Integrative Approach

本論文は、複数の保護グループを考慮した公平なトップ $k$ 選択問題において、参照スコア関数からの乖離を最小化する課題の計算複雑性を分析し、特定の条件下で効率的なアルゴリズムを導出するとともに、重みの摂動に対して安定したスコア関数を得るための新たな「有用性損失」指標を導入し、実データを用いた実験でその有効性を示す統合的なアプローチを提案する。

Guangya Cai2026-03-06💻 cs

Analysis of Terms of Service on Social Media Platforms: Consent Challenges and Assessment Metrics

この論文は、主要なソーシャルメディアプラットフォームのサービス利用規約を対象に、言語的複雑さや非確定的な表現などの課題を明らかにする新たな評価枠組みを提案し、規約が形式的な同意手段ではなく、ユーザーのデータ同意の条件を形作る文書として再定義すべきだと論じています。

Yong-Bin Kang, Anthony McCosker2026-03-06💻 cs

Evaluating and Correcting Human Annotation Bias in Dynamic Micro-Expression Recognition

本論文は、多文化環境におけるマイクロ表情認識の人間によるアノテーション誤差を軽減するため、キーフレームの動的再選択と共有パラメータを持つ二ブランチ構造を用いた「グローバル反単調微分選択戦略（GAMDSS）」を提案し、既存モデルのパラメータ増加なしに認識性能を向上させることを示しています。

Feng Liu, Bingyu Nan, Xuezhong Qian + 1 more2026-03-06💻 cs

Autoscoring Anticlimax: A Meta-analytic Understanding of AI's Short-answer Shortcomings and Wording Weaknesses

本論文は、890 の研究結果をメタ分析し、自動短回答採点における LLM の限界（難易度との非相関、デコーダ型とエンコーダ型の性能差、トークナイザーの限界、および教育現場における人種的バイアスなど）を明らかにし、より適切なシステム設計の必要性を提言するものである。

Michael Hardy2026-03-06💬 cs.CL

Differential Privacy in Two-Layer Networks: How DP-SGD Harms Fairness and Robustness

本論文は、二層 ReLU 畳み込みニューラルネットワークにおける DP-SGD の学習ダイナミクスを特徴中心の枠組みで分析し、プライバシー保護に必要なノイズが特徴学習を阻害し、クラス間の不均衡や長尾分布、敵対的攻撃に対する脆弱性、そしてドメインシフト下の転移学習の失敗といった公平性とロバスト性の低下を理論的に解明したものである。

Ruichen Xu, Kexin Chen2026-03-06🤖 cs.LG

Training for Technology: Adoption and Productive Use of Generative AI in Legal Analysis

法学生を対象とした無作為化実験により、生成 AI へのアクセス権の付与だけでなく、短時間のトレーニングが利用促進と試験成績の向上に不可欠であることが実証されました。

Benjamin M. Chen, Hong Bao2026-03-06🤖 cs.AI

Small Changes, Big Impact: Demographic Bias in LLM-Based Hiring Through Subtle Sociocultural Markers in Anonymised Resumes

本論文は、匿名化された履歴書に残る言語や趣味などの微妙な社会文化的な手がかりが、大規模言語モデル（LLM）による採用選考において人種や性別に基づくバイアスを再生産し、公平な選考を阻害する可能性を、シンガポールを事例とした大規模な実験を通じて実証したものである。

Bryan Chen Zhengyu Tan, Shaun Khoo, Bich Ngoc Doan + 3 more2026-03-06💻 cs

← 前へ次へ →

cs.CY

Measuring AI R&D Automation

Signal in the Noise: Decoding the Reality of Airline Service Quality with Large Language Models

Invariant Causal Routing for Governing Social Norms in Online Market Economies

Token Taxes: mitigating AGI's economic risks

A Case Study in Responsible AI-Assisted Video Solutions: Multi-Metric Behavioral Insights in a Public Market Setting

Stan: An LLM-based thermodynamics course assistant

Generalizing Fair Top- $k$ Selection: An Integrative Approach

Analysis of Terms of Service on Social Media Platforms: Consent Challenges and Assessment Metrics

Evaluating and Correcting Human Annotation Bias in Dynamic Micro-Expression Recognition

Autoscoring Anticlimax: A Meta-analytic Understanding of AI's Short-answer Shortcomings and Wording Weaknesses

Differential Privacy in Two-Layer Networks: How DP-SGD Harms Fairness and Robustness

Training for Technology: Adoption and Productive Use of Generative AI in Legal Analysis

Small Changes, Big Impact: Demographic Bias in LLM-Based Hiring Through Subtle Sociocultural Markers in Anonymised Resumes

Cognitive Warfare: Definition, Framework, and Case Study

The role of spatial scales in assessing urban mobility models

NL2GDS: LLM-aided interface for Open Source Chip Design

Synthetic emotions and consciousness: exploring architectural boundaries

RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents

Dutch Metaphor Extraction from Cancer Patients' Interviews and Forum Data using LLMs and Human in the Loop

A Systematic Analysis of Biases in Large Language Models

cs.CY

Measuring AI R&D Automation

Signal in the Noise: Decoding the Reality of Airline Service Quality with Large Language Models

Invariant Causal Routing for Governing Social Norms in Online Market Economies

Token Taxes: mitigating AGI's economic risks

A Case Study in Responsible AI-Assisted Video Solutions: Multi-Metric Behavioral Insights in a Public Market Setting

Stan: An LLM-based thermodynamics course assistant

Generalizing Fair Top-kkk Selection: An Integrative Approach

Analysis of Terms of Service on Social Media Platforms: Consent Challenges and Assessment Metrics

Evaluating and Correcting Human Annotation Bias in Dynamic Micro-Expression Recognition

Autoscoring Anticlimax: A Meta-analytic Understanding of AI's Short-answer Shortcomings and Wording Weaknesses

Differential Privacy in Two-Layer Networks: How DP-SGD Harms Fairness and Robustness

Training for Technology: Adoption and Productive Use of Generative AI in Legal Analysis

Small Changes, Big Impact: Demographic Bias in LLM-Based Hiring Through Subtle Sociocultural Markers in Anonymised Resumes

Cognitive Warfare: Definition, Framework, and Case Study

The role of spatial scales in assessing urban mobility models

NL2GDS: LLM-aided interface for Open Source Chip Design

Synthetic emotions and consciousness: exploring architectural boundaries

RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents

Dutch Metaphor Extraction from Cancer Patients' Interviews and Forum Data using LLMs and Human in the Loop

A Systematic Analysis of Biases in Large Language Models

Generalizing Fair Top- $k$ Selection: An Integrative Approach