Explore the Full Program of SIGGRAPH Asia 2025!
Close

Presentation

Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning
DescriptionThis paper introduces an online self-exploration loop that enables multimodal agents to self-improve via AI-generated tasks and LLM-verified preference tuning without human annotations.