cs.HC 件の論文 | Gist.Science

NeuralOS: Towards Simulating Operating Systems via Neural Generative Models

本論文は、ユーザー入力を直接画面フレームに変換する再帰型ニューラルネットワークと拡散ベースのレンダラーを組み合わせ、実際の操作記録や AI エージェントによる合成データから学習することで、既存の OS の GUI 再現だけでなく、インストールされていないアプリケーション（例：Doom）の動作さえもシミュレート可能なニューラル OS「NeuralOS」を提案するものである。

Luke Rivard, Sun Sun, Hongyu Guo, Wenhu Chen, Yuntian DengFri, 13 Ma💬 cs.CL

TRACE: AI-Assisted Assessment of Collaborative Projects in Computer Science Education

この論文は、大規模なコンピュータサイエンス教育におけるグループプロジェクトの個人貢献度を公平かつ客観的に評価するための半自動化 AI 支援フレームワーク「TRACE」を提案し、そのパイロット運用において教員の評価との高い一致、学生の満足度向上、教員の負荷軽減が確認されたことを報告しています。

Songmei Yu, Andrew ZagulaFri, 13 Ma🤖 cs.AI

Agentic Explainable Artificial Intelligence (Agentic XAI) Approach To Explore Better Explanation

本研究は、SHAP による説明と大規模言語モデルの自律的反復改善を組み合わせた「アジェンティック XAI」フレームワークを提案し、米収量データを用いた実証実験により、適切な早期停止戦略が採用された場合にのみ、専門家の評価で推奨品質が最大 33% 向上し、過度な反復による品質低下を防ぐことができることを示しました。

Tomoaki Yamaguchi, Yutong Zhou, Masahiro Ryo, Keisuke KatsuraFri, 13 Ma🤖 cs.AI

Learning Through Dialogue: Engagement and Efficacy Matter More Than Explanations

この論文は、LLM による学習が単なる説明の質ではなく、ユーザーの関与や政治的効力感といった対話的動態に依存しており、効果的な学習システム設計にはユーザーの関与状態に合わせた LLM の説明行動の調整が不可欠であることを示しています。

Shaz Furniturewala, Gerard Christopher Yeo, Kokil JaidkaFri, 13 Ma💬 cs.CL

Do LLMs Truly Benefit from Longer Context in Automatic Post-Editing?

この論文は、プロプライエタリな大規模言語モデルが単純なプロンプトでも人間レベルの自動ポストエディティング品質を達成する一方で、文書レベルのコンテキストを十分に活用できず、コストや遅延の課題も残っていることを示し、より効率的な長文脈モデルの必要性を浮き彫りにしています。

Ahrii Kim, Seong-heum KimFri, 13 Ma💬 cs.CL

Evaluating Zero-Shot and One-Shot Adaptation of Small Language Models in Leader-Follower Interaction

本論文は、リソース制約のあるロボット向けに、ゼロショットおよびワンショット適応を用いた小規模言語モデル（SLM）のリーダー・フォロワー役割分類を評価し、ファインチューニングされたモデルが低遅延かつ高精度な役割割り当てを実現できる一方で、対話の複雑さが増すと性能が低下するトレードオフを明らかにしたものである。

Rafael R. Baptista, André de Lima Salgado, Ricardo V. Godoy, Marcelo Becker, Thiago Boaventura, Gustavo J. G. LahrFri, 13 Ma⚡ eess

Exploring Collatz Dynamics with Human-LLM Collaboration

この論文は、人間と大規模言語モデル（LLM）の協働を通じてコラッツ予想の軌道構造を解析し、モジュラーな攪乱やバースト・ギャップ分解などの新たな性質を証明するとともに、収束への条件的枠組みを提案する探索的研究である。

Edward Y. ChangFri, 13 Ma🔢 math

"I followed what felt right, not what I was told": Autonomy, Coaching, and Recognizing Bias Through AI-Mediated Dialogue

本研究は、AI を介した対話が障害差別（アビリズム）の認識に与える影響を検証し、対話形式が読みのみよりも効果的である一方、バイアスを指摘するAI の働きかけは否定的感情を増幅させる可能性があるが、包括的な支援は学習の足がかりとして機能することを明らかにした。

Atieh Taheri, Hamza El Alaoui, Patrick Carrington, Jeffrey P. BighamFri, 13 Ma🤖 cs.AI

Ghost Framing Theory: Exploring the role of generative AI in new venture rhetorical legitimation

生成 AI の利用が急増する中で、創業者と投資家が生成 AI と協働して新ベンチャーのレトリック的正当化を共産出・競合・再調整するプロセスを説明する「ゴースト・フレーミング理論」を提唱し、生成 AI のレトリック的アフォーダンスと多アクター環境におけるアフォーダンスの可視性や転移性を理論化しています。

Greg NyilasyFri, 13 Ma🤖 cs.AI

Evaluation format, not model capability, drives triage failure in the assessment of consumer health AI

Ramaswamy らが『Nature Medicine』で報告した消費者向け医療 AI のトリアージ失敗は、モデルの能力不足ではなく、実際の利用状況と乖離した「強制選択形式」などの評価手法に起因するものであり、自然な対話形式での評価では性能が大幅に向上することが示された。

David Fraile Navarro, Farah Magrabi, Enrico CoieraFri, 13 Ma🤖 cs.AI

Managing Cognitive Bias in Human Labeling Operations for Rare-Event AI: Evidence from a Field Experiment

この論文は、医療画像のレアイベント検出における人間のラベリングで生じる認知バイアスを、フィードバックの偏在を調整し確率的ラベリングを採用することで軽減し、さらに線形対数オッズ再較正を用いて下流の CNN モデルの性能と較正を大幅に改善することを、実証実験を通じて示しています。

Gunnar P. Epping, Andrew Caplin, Erik Duhaime, William R. Holmes, Daniel Martin, Jennifer S. TruebloodFri, 13 Ma💰 q-fin

AI Knows What's Wrong But Cannot Fix It: Helicoid Dynamics in Frontier LLMs Under High-Stakes Decisions

この論文は、臨床診断や投資判断など検証が困難な高リスクな意思決定において、最先端の LLM が「問題の特定はできるが修正ができず、誤ったパターンを高度化しながら繰り返す」という「ヘリコイド動力学」と呼ばれる失敗様式を示すことを明らかにし、信頼性の高い AI 連携に向けた仮説と対策を提案しています。

Alejandro R JadadFri, 13 Ma🤖 cs.AI

A technology-oriented mapping of the language and translation industry: Analysing stakeholder values and their potential implication for translation pedagogy

本論文は、LT-LiDER プロジェクトのインタビューデータに基づき、自動化が進む言語・翻訳業界において、効率性やサービス倫理が基盤となりつつも、専門性や適応力といった人的価値が再配置され、技術と人間の相互依存的な関係が翻訳教育に示唆を与えることを明らかにしている。

María Isabel Rivas Ginel, Janiça Hackenbuchner, Alina Secar\u{a}, Ralph Krüger, Caroline RossiFri, 13 Ma💬 cs.CL

← 前へ次へ →

cs.HC

NeuralOS: Towards Simulating Operating Systems via Neural Generative Models

TRACE: AI-Assisted Assessment of Collaborative Projects in Computer Science Education

Agentic Explainable Artificial Intelligence (Agentic XAI) Approach To Explore Better Explanation

Learning Through Dialogue: Engagement and Efficacy Matter More Than Explanations

Do LLMs Truly Benefit from Longer Context in Automatic Post-Editing?

Evaluating Zero-Shot and One-Shot Adaptation of Small Language Models in Leader-Follower Interaction

Exploring Collatz Dynamics with Human-LLM Collaboration

"I followed what felt right, not what I was told": Autonomy, Coaching, and Recognizing Bias Through AI-Mediated Dialogue

Ghost Framing Theory: Exploring the role of generative AI in new venture rhetorical legitimation

Evaluation format, not model capability, drives triage failure in the assessment of consumer health AI

Managing Cognitive Bias in Human Labeling Operations for Rare-Event AI: Evidence from a Field Experiment

AI Knows What's Wrong But Cannot Fix It: Helicoid Dynamics in Frontier LLMs Under High-Stakes Decisions

A technology-oriented mapping of the language and translation industry: Analysing stakeholder values and their potential implication for translation pedagogy

From Control to Foresight: Simulation as a New Paradigm for Human-Agent Collaboration

Modeling Trial-and-Error Navigation With a Sequential Decision Model of Information Scent

An Intent of Collaboration: On Agencies between Designers and Emerging (Intelligent) Technologies

Human-Centred LLM Privacy Audits: Findings and Frictions

MHDash: An Online Platform for Benchmarking Mental Health-Aware AI Assistants

A Temporal-Spectral Fusion Transformer with Subject-Specific Adapter for Enhancing RSVP-BCI Decoding

ExSampling: a system for the real-time ensemble performance of field-recorded environmental sounds