1 code implementation • 2 Apr 2024 • Fangzhou Mu, Sicheng Mo, Yin Li
In this paper, we study the effect of cross-modal fusion on the scalability of video grounding models.
no code implementations • 12 Dec 2023 • Sicheng Mo, Fangzhou Mu, Kuan Heng Lin, Yanli Liu, Bochen Guan, Yin Li, Bolei Zhou
Recent approaches such as ControlNet offer users fine-grained spatial control over text-to-image (T2I) diffusion models.
2 code implementations • 16 Nov 2022 • Fangzhou Mu, Sicheng Mo, Gillian Wang, Yin Li
This report describes our submission to the Ego4D Moment Queries Challenge 2022.
Ranked #1 on Temporal Action Localization on Ego4D MQ test
1 code implementation • 16 Nov 2022 • Sicheng Mo, Fangzhou Mu, Yin Li
This report describes Badgers@UW-Madison, our submission to the Ego4D Natural Language Queries (NLQ) Challenge.
no code implementations • 3 May 2022 • Fangzhou Mu, Sicheng Mo, Jiayong Peng, Xiaochun Liu, Ji Hyun Nam, Siddeshwar Raghavan, Andreas Velten, Yin Li
Computational approach to imaging around the corner, or non-line-of-sight (NLOS) imaging, is becoming a reality thanks to major advances in imaging hardware and reconstruction algorithms.