[IROS'25] This repository is the official implementation of WMNav, a novel World Model-based Object Goal Navigation framework powered by Vision-Language Models. agent_cfg: ... vlm_cfg: model_cls: ...
Multiple Object Tracking (MOT) is a critical area within computer vision, with a broad spectrum of practical implementations. Current research has primarily focused on the development of tracking ...
I am a Senior Member of Technical Staff at Salesforce, where I build AI-driven enterprise solutions that integrate LLM. I am a Senior Member of Technical Staff at Salesforce, where I build AI-driven ...