Search Results for author: Jui-Hsien Wang

Found 11 papers, 2 papers with code

GaussianVideo: Efficient Video Representation via Hierarchical Gaussian Splatting

no code implementations8 Jan 2025 Andrew Bond, Jui-Hsien Wang, Long Mai, Erkut Erdem, Aykut Erdem

To address these issues, we introduce a novel neural video representation that combines 3D Gaussian splatting with continuous camera motion modeling.

Video Compression

TransPixeler: Advancing Text-to-Video Generation with Transparency

1 code implementation6 Jan 2025 Luozhou Wang, Yijun Li, Zhifei Chen, Jui-Hsien Wang, Zhifei Zhang, He Zhang, Zhe Lin, Yingcong Chen

Text-to-video generative models have made significant strides, enabling diverse applications in entertainment, advertising, and education.

Text-to-Video Generation Video Generation

Move-in-2D: 2D-Conditioned Human Motion Generation

no code implementations17 Dec 2024 Hsin-Ping Huang, Yang Zhou, Jui-Hsien Wang, Difan Liu, Feng Liu, Ming-Hsuan Yang, Zhan Xu

Generating realistic human videos remains a challenging task, with the most effective methods currently relying on a human motion sequence as a control signal.

Motion Generation

Generative Timelines for Instructed Visual Assembly

no code implementations19 Nov 2024 Alejandro Pardo, Jui-Hsien Wang, Bernard Ghanem, Josef Sivic, Bryan Russell, Fabian Caba Heilbron

The objective of this work is to manipulate visual timelines (e. g. a video) through natural language instructions, making complex timeline editing tasks accessible to non-expert or potentially even disabled users.

Language Modelling

They're All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias

no code implementations17 Jun 2024 Salma Abdel Magid, Jui-Hsien Wang, Kushal Kafle, Hanspeter Pfister

Vision Language Models (VLMs) such as CLIP are powerful models; however they can exhibit unwanted biases, making them less safe when deployed directly in applications such as text-to-image, text-to-video retrievals, reverse search, or classification tasks.

All counterfactual +2

Koala: Key frame-conditioned long video-LLM

no code implementations CVPR 2024 Reuben Tan, Ximeng Sun, Ping Hu, Jui-Hsien Wang, Hanieh Deilamsalehy, Bryan A. Plummer, Bryan Russell, Kate Saenko

Long video question answering is a challenging task that involves recognizing short-term activities and reasoning about their fine-grained relationships.

Action Recognition Question Answering +2

GenLens: A Systematic Evaluation of Visual GenAI Model Outputs

no code implementations6 Feb 2024 Tica Lin, Hanspeter Pfister, Jui-Hsien Wang

This research underscores the importance of robust early-stage evaluation tools in GenAI development, contributing to the advancement of fair and high-quality GenAI models.

Fairness

SoundCam: A Dataset for Finding Humans Using Room Acoustics

no code implementations NeurIPS 2023 Mason Wang, Samuel Clarke, Jui-Hsien Wang, Ruohan Gao, Jiajun Wu

A room's acoustic properties are a product of the room's geometry, the objects within the room, and their specific positions.

MonoTrack: Shuttle trajectory reconstruction from monocular badminton video

1 code implementation4 Apr 2022 Paul Liu, Jui-Hsien Wang

Trajectory estimation is a fundamental component of racket sport analytics, as the trajectory contains information not only about the winning and losing of each point, but also how it was won or lost.

3D Reconstruction Sports Ball Detection and Tracking

Cannot find the paper you are looking for? You can Submit a new open access paper.