cs 件の論文 | Gist.Science

IPPO Learns the Game, Not the Team: A Study on Generalization in Heterogeneous Agent Teams

本論文は、異種エージェント環境における自己対戦ベースの IPPO が、多様なトレーニングパートナーを意図的に導入する手法（RPT）と同等の汎化性能を示すことを明らかにし、単純な IPPO ベースラインが新規チームメイトに対しても十分な適応能力を有していることを実証しています。

Ryan LeRoy, Jack Kolb2026-03-10💻 cs

Beyond Endpoints: Path-Centric Reasoning for Vectorized Off-Road Network Extraction

オフロード環境における道路ネットワーク抽出の課題を解決するため、大規模なオフロードデータセット「WildRoad」を公開し、従来のノード中心アプローチの限界を克服する経路中心のフレームワーク「MaGRoad」を提案し、高い精度と高速推論を実現する研究です。

Wenfei Guan, Jilin Mei, Tong Shen, Xumin Wu, Shuo Wang, Chen Min, Yu Hu2026-03-10💻 cs

ReMeDI: Refined Memory for Disambiguation of Identities with SAM3 in Surgical Segmentation

本論文は、手術映像における器具セグメンテーションの課題を解決するため、SAM3 のメモリ更新や容量制限、再出現時の識別問題を克服するトレーニング不要な拡張手法「ReMeDI-SAM3」を提案し、複数のデータセットで既存手法を上回る性能を達成したことを報告しています。

Valay Bundele, Mehran Hosseinzadeh, Hendrik P. A. Lensch2026-03-10💻 cs

It is not always greener on the other side: Greenery perception across demographics and personalities in multiple cities

この論文は、5 か国 1,000 人の調査とストリートビュー画像を用いた分析を通じて、都市の緑化に対する主観的認識と客観的測定値の乖離が世界的に普遍的であり、個人の属性や性格よりも居住地域による文化的・環境的経験の影響が最も大きいことを明らかにしています。

Matias Quintana, Fangqi Liu, Jussi Torkko, Youlong Gu, Xiucheng Liang, Yujun Hou, Koichi Ito, Yihan Zhu, Mahmoud Abdelrahman, Tuuli Toivonen, Yi Lu, Filip Biljecki2026-03-10💻 cs

VOIC: Visible-Occluded Integrated Guidance for 3D Semantic Scene Completion

この論文は、単一画像からの 3D 意味シーン補完において、可視領域の知覚と遮蔽領域の推論を分離・統合する「VOIC」という新たな双デコーダフレームワークを提案し、既存手法を上回る性能を達成したことを示しています。

Zaidao Han, Risa Higashita, Jiang Liu2026-03-10💻 cs

Cost Trade-offs of Reasoning and Non-Reasoning Large Language Models in Text-to-SQL

この論文は、Google BigQuery 上の大規模データセットを用いた実験を通じて、推論モデルが非推論モデルと比較してデータ転送量を大幅に削減しつつ同等の精度を維持し、実行時間とクラウドコストの相関が弱いことを示し、Text-to-SQL 導入におけるコスト最適化の指針を提示しています。

Saurabh Deochake, Debajyoti Mukhopadhyay2026-03-10💻 cs

NashOpt -- A Python Library for Computing Generalized Nash Equilibria

NashOpt は、共有制約を持つ非協力ゲームにおける一般ナッシュ均衡の計算と設計を可能にするオープンソースの Python ライブラリであり、JAX を活用した非線形最小二乗法や混合整数線形計画法を通じて、非線形ゲームから線形二次ゲーム、逆ゲームやスタッケルベルグゲーム設計問題までを包括的にサポートします。

Alberto Bemporad2026-03-10💻 cs

Toward a Physical Theory of Intelligence

この論文は、保存則と整合的な符号化（CCE）フレームワークを導入し、情報処理を不可逆的な物理過程として記述することで、知性・意識・量子測定・時空幾何学を熱力学的散逸の観点から統一的に理解する物理理論を提案しています。

Peter David Fagan2026-03-10💻 cs

DrivingGen: A Comprehensive Benchmark for Generative Video World Models in Autonomous Driving

本論文は、自動運転における生成ワールドモデルの進捗を測定し、視覚的リアリズム、軌道の妥当性、時間的整合性、制御性を包括的に評価する初のベンチマーク「DrivingGen」を提案し、既存モデルの課題とトレードオフを明らかにしたものである。

Yang Zhou, Hao Shao, Letian Wang, Zhuofan Zong, Hongsheng Li, Steven L. Waslander2026-03-10💻 cs

Machine Learning Guided Cooling System Optimization for Data Center

Frontier 超計算機の実運用データを用いた物理ガイド型機械学習フレームワークにより、冷却システムの非効率を特定し、安全な設定値の微調整を通じて年間 85 MWh に及ぶ過剰な冷却エネルギーの最大 96% を削減可能であることを示しました。

Shrenik Jadhav, Zheng Liu2026-03-10💻 cs

Batch-of-Thought: Cross-Instance Learning for Enhanced LLM Reasoning

この論文は、関連するクエリを独立して処理するのではなく、バッチ単位で共同処理することで推論パターンや一貫性制約を共有し、精度向上とコスト削減を実現する「Batch-of-Thought（BoT）」というトレーニング不要の手法を提案しています。

Xuan Yang, Furong Jia, Roy Xie, Xiong Xi, Hengwei Bian, Jian Li, Monica Agrawal2026-03-10💻 cs

Route, Retrieve, Reflect, Repair: Self-Improving Agentic Framework for Visual Detection and Linguistic Reasoning in Medical Imaging

本論文は、医療画像分析における大規模視覚言語モデルの限界を克服するため、タスクに応じたプロンプト構成、例示記憶に基づく生成、臨床的誤りの批判的検証、そして修正という 4 つの協調エージェントからなる自己改善型フレームワーク「R^4」を提案し、微調整なしでレポート生成および物体検出の精度を大幅に向上させることを示しています。

Md. Faiyaz Abdullah Sayeedi, Rashedur Rahman, Siam Tahsin Bhuiyan, Sefatul Wasi, Ashraful Islam, Saadia Binte Alam, AKM Mahbubur Rahman2026-03-10💻 cs

← 前へ次へ →

cs

IPPO Learns the Game, Not the Team: A Study on Generalization in Heterogeneous Agent Teams

Beyond Endpoints: Path-Centric Reasoning for Vectorized Off-Road Network Extraction

ReMeDI: Refined Memory for Disambiguation of Identities with SAM3 in Surgical Segmentation

It is not always greener on the other side: Greenery perception across demographics and personalities in multiple cities

VOIC: Visible-Occluded Integrated Guidance for 3D Semantic Scene Completion

Cost Trade-offs of Reasoning and Non-Reasoning Large Language Models in Text-to-SQL

NashOpt -- A Python Library for Computing Generalized Nash Equilibria

Toward a Physical Theory of Intelligence

DrivingGen: A Comprehensive Benchmark for Generative Video World Models in Autonomous Driving

Machine Learning Guided Cooling System Optimization for Data Center

Batch-of-Thought: Cross-Instance Learning for Enhanced LLM Reasoning

Route, Retrieve, Reflect, Repair: Self-Improving Agentic Framework for Visual Detection and Linguistic Reasoning in Medical Imaging

The Algorithmic Gaze of Image Quality Assessment: An Audit and Trace Ethnography of the LAION-Aesthetics Predictor

CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents

User Detection and Response Patterns of Sycophantic Behavior in Conversational AI

BoxMind: Closed-loop AI strategy optimization for elite boxing validated in the 2024 Olympics

Multifaceted Scenario-Aware Hypergraph Learning for Next POI Recommendation

S2DiT: Sandwich Diffusion Transformer for Mobile Streaming Video Generation

Equal-Pay Contracts

ReViP: Mitigating False Completion in Vision-Language-Action Models with Vision-Proprioception Rebalance