no code implementations • 8 Jan 2025 • Andrew Bond, Jui-Hsien Wang, Long Mai, Erkut Erdem, Aykut Erdem
To address these issues, we introduce a novel neural video representation that combines 3D Gaussian splatting with continuous camera motion modeling.
1 code implementation • 6 Jan 2025 • Luozhou Wang, Yijun Li, Zhifei Chen, Jui-Hsien Wang, Zhifei Zhang, He Zhang, Zhe Lin, Yingcong Chen
Text-to-video generative models have made significant strides, enabling diverse applications in entertainment, advertising, and education.
no code implementations • 27 Dec 2024 • Shaoteng Liu, Tianyu Wang, Jui-Hsien Wang, Qing Liu, Zhifei Zhang, Joon-Young Lee, Yijun Li, Bei Yu, Zhe Lin, Soo Ye Kim, Jiaya Jia
Large-scale video generation models have the inherent ability to realistically model natural scenes.
no code implementations • 17 Dec 2024 • Hsin-Ping Huang, Yang Zhou, Jui-Hsien Wang, Difan Liu, Feng Liu, Ming-Hsuan Yang, Zhan Xu
Generating realistic human videos remains a challenging task, with the most effective methods currently relying on a human motion sequence as a control signal.
no code implementations • 19 Nov 2024 • Alejandro Pardo, Jui-Hsien Wang, Bernard Ghanem, Josef Sivic, Bryan Russell, Fabian Caba Heilbron
The objective of this work is to manipulate visual timelines (e. g. a video) through natural language instructions, making complex timeline editing tasks accessible to non-expert or potentially even disabled users.
no code implementations • 17 Jun 2024 • Salma Abdel Magid, Jui-Hsien Wang, Kushal Kafle, Hanspeter Pfister
Vision Language Models (VLMs) such as CLIP are powerful models; however they can exhibit unwanted biases, making them less safe when deployed directly in applications such as text-to-image, text-to-video retrievals, reverse search, or classification tasks.
no code implementations • CVPR 2024 • Reuben Tan, Ximeng Sun, Ping Hu, Jui-Hsien Wang, Hanieh Deilamsalehy, Bryan A. Plummer, Bryan Russell, Kate Saenko
Long video question answering is a challenging task that involves recognizing short-term activities and reasoning about their fine-grained relationships.
no code implementations • 6 Feb 2024 • Tica Lin, Hanspeter Pfister, Jui-Hsien Wang
This research underscores the importance of robust early-stage evaluation tools in GenAI development, contributing to the advancement of fair and high-quality GenAI models.
no code implementations • NeurIPS 2023 • Mason Wang, Samuel Clarke, Jui-Hsien Wang, Ruohan Gao, Jiajun Wu
A room's acoustic properties are a product of the room's geometry, the objects within the room, and their specific positions.
no code implementations • CVPR 2023 • Samuel Clarke, Ruohan Gao, Mason Wang, Mark Rau, Julia Xu, Jui-Hsien Wang, Doug L. James, Jiajun Wu
Objects make unique sounds under different perturbations, environment conditions, and poses relative to the listener.
1 code implementation • 4 Apr 2022 • Paul Liu, Jui-Hsien Wang
Trajectory estimation is a fundamental component of racket sport analytics, as the trajectory contains information not only about the winning and losing of each point, but also how it was won or lost.
Ranked #2 on
Sports Ball Detection and Tracking
on Basketball