Presentation
PITAR: an LLM-powered Agent towards Intelligent and Accurate Manipulations in Extended Reality with Multimodal Interactions
SessionXR
DescriptionPITAR is an LLM-powered XR agent that fuses eye gaze, gestures, and speech to interpret pronoun-based commands and control virtual objects. It enables real-time, human-like interaction through multimodal reasoning and few-shot prompting on the Meta Quest Pro.
Contributors









