cs.CV 件の論文 | Gist.Science

Event-Only Drone Trajectory Forecasting with RPM-Modulated Kalman Filtering

本論文は、イベントカメラの生データからプロペラ回転数を抽出し、これを考慮したカルマンフィルタを適用することで、RGB 画像や学習データに依存せずドローンの軌道を高精度に予測する手法を提案し、FRED データセットにおける評価で既存の学習ベース手法や標準的なカルマンフィルタを上回る性能を実証したものである。

Hari Prasanth S. M., Pejman Habibiroudkenar, Eerik Alamikkotervo + 2 more2026-03-03⚡ eess

3D Field of Junctions: A Noise-Robust, Training-Free Structural Prior for Volumetric Inverse Problems

この論文は、2D 画像の Field of Junctions を 3D 空間に拡張した「3D Field of Junctions」を提案し、学習データが不要でハルシネーションのリスクがなく、低 SNR 環境における 3D 画像のノイズ除去や構造復元において、従来の古典的および深層学習手法を上回る性能を発揮することを示しています。

Namhoon Kim, Narges Moeini, Justin Romberg + 1 more2026-03-03⚡ eess

Data Augmentation via Mixed Class Interpolation using Cycle-Consistent Generative Adversarial Networks Applied to Cross-Domain Imagery

この論文は、可視光画像から合成開口レーダー（SAR）画像への変換を行うサイクル整合型 GAN を用いた混合クラス補間手法（C2GMA）を提案し、SAR 画像のデータ不足を解消して分類精度を大幅に向上させることを実証しています。

Hiroshi Sasaki, Chris G. Willcocks, Toby P. Breckon2026-03-02🤖 cs.LG

← 前へ次へ →

cs.CV

Event-Only Drone Trajectory Forecasting with RPM-Modulated Kalman Filtering

3D Field of Junctions: A Noise-Robust, Training-Free Structural Prior for Volumetric Inverse Problems

Data Augmentation via Mixed Class Interpolation using Cycle-Consistent Generative Adversarial Networks Applied to Cross-Domain Imagery

Dite-HRNet: Dynamic Lightweight High-Resolution Network for Human Pose Estimation

CO^3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving

A Fault Detection Scheme Utilizing Convolutional Neural Network for PV Solar Panels with High Accuracy

Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases

Uni-ISP: Toward Unifying the Learning of ISPs from Multiple Mobile Cameras

R2GenCSR: Mining Contextual and Residual Information for LLMs-based Radiology Report Generation

Shuffle Mamba: State Space Models with Random Shuffle for Multi-Modal Image Fusion

Towards Privacy-Guaranteed Label Unlearning in Vertical Federated Learning: Few-Shot Forgetting without Disclosure

Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts

Aligning Few-Step Diffusion Models with Dense Reward Difference Learning

TREND: Unsupervised 3D Representation Learning via Temporal Forecasting for LiDAR Perception

CLAP: Unsupervised 3D Representation Learning for Fusion 3D Perception via Curvature Sampling and Prototype Learning

GenVidBench: A 6-Million Benchmark for AI-Generated Video Detection

Multi-illuminant Color Constancy via Multi-scale Illuminant Estimation and Fusion

DSV: Exploiting Dynamic Sparsity to Accelerate Large-Scale Video DiT Training

Spread them Apart: Towards Robust Watermarking of Generated Content

JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data