Search Results for author: Xuweiyi Chen

Found 2 papers, 2 papers with code

UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control

1 code implementation • 4 Mar 2024 • Xuweiyi Chen, Tian Xia, Sihan Xu

Video Diffusion Models have been developed for video generation, usually integrating text and image conditioning to enhance control over the generated content.

Video Generation

Paper
Code

LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent

1 code implementation • 21 Sep 2023 • Jianing Yang, Xuweiyi Chen, Shengyi Qian, Nikhil Madaan, Madhavan Iyengar, David F. Fouhey, Joyce Chai

While existing approaches often rely on extensive labeled data or exhibit limitations in handling complex language queries, we propose LLM-Grounder, a novel zero-shot, open-vocabulary, Large Language Model (LLM)-based 3D visual grounding pipeline.

Language Modelling Large Language Model +3

256

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.