Explore the Full Program of SIGGRAPH Asia 2025!
Close

Presentation

AeroVis3R: Geometry-Aware Vision–Language Models for 3D Reasoning over UAV Landmark Videos
SessionPosters
DescriptionAeroVis3R introduces geometry-aware vision–language models for UAV landmark videos, uniting 3D reconstruction and GPT reasoning. We also release a DJI Mini 3 Pro landmark dataset with Wikipedia annotations.
Contributor