Contributor
Biography
He is a Ph.D. Student from Harbin Institute of Technology, Shenzhen. He focuses on Multimodal Collaborative Reasoning
Video Understanding and Generation
Multimodal Agent
Embodied Intelligence
Video Understanding and Generation
Multimodal Agent
Embodied Intelligence
Presentations


