Prithvi-EO-2.0: A Versatile Multi-Temporal Foundation Model for Earth Observation Applications

本文介绍了基于 420 万条全球时间序列数据训练、在多项任务中性能显著优于前代及同类模型的开源地理空间基础模型 Prithvi-EO-2.0,该模型通过融合时空嵌入与用户反馈机制,实现了从灾害响应到生态系统监测等多样化地球观测应用的高效覆盖。

Daniela Szwarcman, Sujit Roy, Paolo Fraccaro, {\TH}orsteinn Elí Gíslason, Benedikt Blumenstiel, Rinki Ghosal, Pedro Henrique de Oliveira, Joao Lucas de Sousa Almeida, Rocco Sedona, Yanghui Kang, Srija Chakraborty, Sizhe Wang, Carlos Gomes, Ankur Kumar, Myscon Truong, Denys Godwin, Hyunho Lee, Chia-Yu Hsu, Rohit Lal, Ata Akbari Asanjan, Besart Mujeci, Disha Shidham, Trevor Keenan, Paulo Arevalo, Wenwen Li, Hamed Alemohammad, Pontus Olofsson, Christopher Hain, Robert Kennedy, Bianca Zadrozny, David Bell, Gabriele Cavallaro, Campbell Watson, Manil Maskey, Rahul Ramachandran, Juan Bernabe Moreno2026-03-10💻 cs

From Pixels to Predicates: Learning Symbolic World Models via Pretrained Vision-Language Models

该论文提出了一种利用预训练视觉 - 语言模型(VLM)从少量演示中学习抽象符号世界模型的方法,通过自动构建和筛选谓词,使机器人能够在未见过的复杂场景中实现零样本泛化,从而解决长视野的决策规划问题。

Ashay Athalye, Nishanth Kumar, Tom Silver, Yichao Liang, Jiuguang Wang, Tomás Lozano-Pérez, Leslie Pack Kaelbling2026-03-10🤖 cs.LG

Enhancing Alzheimer's Diagnosis: Leveraging Anatomical Landmarks in Graph Convolutional Neural Networks on Tetrahedral Meshes

该研究提出了一种结合解剖学标志点与 Transformer 架构的新型图卷积神经网络,利用四面体网格处理 sMRI 数据,在无需昂贵 PET 扫描的情况下显著提升了阿尔茨海默病诊断及脑淀粉样蛋白阳性(尤其是中风险人群)的预测精度。

Yanxi Chen, Mohammad Farazi, Zhangsihao Yang, Yonghui Fan, Nicholas Ashton, Eric M Reiman, Yi Su, Yalin Wang2026-03-10💻 cs

Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars

本文提出了 Snapmoji 系统,通过高斯域自适应(GDA)技术将用户自拍即时转换为 3D 主风格头像并进一步应用二次风格化,从而在保留用户身份的同时生成可在移动设备上流畅动画的个性化双风格化虚拟形象。

Eric M. Chen, Di Liu, Sizhuo Ma, Michael Vasilkovsky, Bing Zhou, Qiang Gao, Wenzhou Wang, Jiahao Luo, Dimitris N. Metaxas, Vincent Sitzmann, Jian Wang2026-03-10💻 cs

SceneEval: Evaluating Semantic Coherence in Text-Conditioned 3D Indoor Scene Synthesis

本文提出了名为 SceneEval 的评估框架及包含 500 个文本描述与详细标注的基准数据集 SceneEval-500,旨在通过细粒度的显性需求指标(如物体数量、属性及空间关系)和隐性期望指标(如支撑、碰撞及可导航性),全面且可解释地评估文本条件 3D 室内场景生成方法的语义连贯性与合理性。

Hou In Ivan Tam, Hou In Derek Pun, Austin T. Wang, Angel X. Chang, Manolis Savva2026-03-10💻 cs

From 2D Alignment to 3D Plausibility: Unifying Heterogeneous 2D Priors and Penetration-Free Diffusion for Occlusion-Robust Two-Hand Reconstruction

该论文提出了一种从 2D 对齐到 3D 合理性的统一框架,通过融合异构基础模型先验进行 2D 结构对齐,并引入无穿透扩散模型优化 3D 空间交互,从而在单目图像中实现抗遮挡、无穿透且符合物理真实性的双手重建。

Gaoge Han, Yongkang Cheng, Zhe Chen, Shaoli Huang, Tongliang Liu2026-03-10💻 cs