cs.AI 件の論文 | Gist.Science

PromptDLA: A Domain-aware Prompt Document Layout Analysis Framework with Descriptive Knowledge as a Cue

この論文は、異なるドメインのレイアウト構造やラベル付けスタイルの差異を考慮し、記述知識を手がかりとしてドメイン固有のプロンプトを生成する「PromptDLA」という新しいドメイン認識型プロンプターを提案し、複数の主要なドキュメントレイアウト分析データセットにおいて最先端の性能を達成したことを示しています。

Zirui Zhang, Yaping Zhang, Lu Xiang, Yang Zhao, Feifei Zhai, Yu Zhou, Chengqing ZongWed, 11 Ma🤖 cs.AI

From Flow to One Step: Real-Time Multi-Modal Trajectory Policies via Implicit Maximum Likelihood Estimation-based Distribution Distillation

この論文は、反復的な積分による遅延を回避しつつ多様な動作分布を単一ステップで保持するよう、条件付きフローマッチングの教師モデルを IMLE ベースの分布蒸留と双方向チャンバー距離を用いて高速な単一ステップ学生モデルへ転移するフレームワークを提案し、リアルタイムの多モーダルロボット制御を実現するものである。

Ju Dong, Liding Zhang, Lei Zhang, Yu Fu, Kaixin Bai, Zoltan-Csaba Marton, Zhenshan Bing, Zhaopeng Chen, Alois Christian Knoll, Jianwei ZhangWed, 11 Ma🤖 cs.AI

← 前へ次へ →

cs.AI

PromptDLA: A Domain-aware Prompt Document Layout Analysis Framework with Descriptive Knowledge as a Cue

From Flow to One Step: Real-Time Multi-Modal Trajectory Policies via Implicit Maximum Likelihood Estimation-based Distribution Distillation

Investigating Gender Stereotypes in Large Language Models via Social Determinants of Health

Open-World Motion Forecasting

CERES: A Probabilistic Early Warning System for Acute Food Insecurity

Common Sense vs. Morality: The Curious Case of Narrative Focus Bias in LLMs

AI Act Evaluation Benchmark: An Open, Transparent, and Reproducible Evaluation Dataset for NLP and RAG Systems

A Guideline-Aware AI Agent for Zero-Shot Target Volume Auto-Delineation

Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers

Declarative Scenario-based Testing with RoadLogic

An Empirical Study and Theoretical Explanation on Task-Level Model-Merging Collapse

EvoDriveVLA: Evolving Autonomous Driving Vision-Language-Action Model via Collaborative Perception-Planning Distillation

Telogenesis: Goal Is All U Need

GenePlan: Evolving Better Generalized PDDL Plans using Large Language Models

Vibe-Creation: The Epistemology of Human-AI Emergent Cognition

Temporal-Conditioned Normalizing Flows for Multivariate Time Series Anomaly Detection

Evolving Prompt Adaptation for Vision-Language Models

Efficiently Aligning Draft Models via Parameter- and Data-Efficient Adaptation

Enhancing Debunking Effectiveness through LLM-based Personality Adaptation

Compiler-First State Space Duality and Portable $O(1)$ Autoregressive Caching for Inference

cs.AI

PromptDLA: A Domain-aware Prompt Document Layout Analysis Framework with Descriptive Knowledge as a Cue

From Flow to One Step: Real-Time Multi-Modal Trajectory Policies via Implicit Maximum Likelihood Estimation-based Distribution Distillation

Investigating Gender Stereotypes in Large Language Models via Social Determinants of Health

Open-World Motion Forecasting

CERES: A Probabilistic Early Warning System for Acute Food Insecurity

Common Sense vs. Morality: The Curious Case of Narrative Focus Bias in LLMs

AI Act Evaluation Benchmark: An Open, Transparent, and Reproducible Evaluation Dataset for NLP and RAG Systems

A Guideline-Aware AI Agent for Zero-Shot Target Volume Auto-Delineation

Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers

Declarative Scenario-based Testing with RoadLogic

An Empirical Study and Theoretical Explanation on Task-Level Model-Merging Collapse

EvoDriveVLA: Evolving Autonomous Driving Vision-Language-Action Model via Collaborative Perception-Planning Distillation

Telogenesis: Goal Is All U Need

GenePlan: Evolving Better Generalized PDDL Plans using Large Language Models

Vibe-Creation: The Epistemology of Human-AI Emergent Cognition

Temporal-Conditioned Normalizing Flows for Multivariate Time Series Anomaly Detection

Evolving Prompt Adaptation for Vision-Language Models

Efficiently Aligning Draft Models via Parameter- and Data-Efficient Adaptation

Enhancing Debunking Effectiveness through LLM-based Personality Adaptation

Compiler-First State Space Duality and Portable O(1)O(1)O(1) Autoregressive Caching for Inference

Compiler-First State Space Duality and Portable $O(1)$ Autoregressive Caching for Inference