Search Results for author: Fangzhou Mu

Found 16 papers, 7 papers with code

SnAG: Scalable and Accurate Video Grounding

1 code implementation • 2 Apr 2024 • Fangzhou Mu, Sicheng Mo, Yin Li

In this paper, we study the effect of cross-modal fusion on the scalability of video grounding models.

384

Paper
Code

Towards 3D Vision with Low-Cost Single-Photon Cameras

no code implementations • 26 Mar 2024 • Fangzhou Mu, Carter Sifferman, Sacha Jungerman, Yiquan Li, Mark Han, Michael Gleicher, Mohit Gupta, Yin Li

We present a method for reconstructing 3D shape of arbitrary Lambertian objects based on measurements by miniature, energy-efficient, low-cost single-photon cameras.

3D Object Reconstruction Neural Rendering

Paper
Add Code

Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning

1 code implementation • 22 Feb 2024 • Zhuoyan Xu, Zhenmei Shi, Junyi Wei, Fangzhou Mu, Yin Li, YIngyu Liang

An emerging solution with recent success in vision and NLP involves finetuning a foundation model on a selection of relevant tasks, before its adaptation to a target task with limited labeled samples.

Paper
Code

FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition

no code implementations • 12 Dec 2023 • Sicheng Mo, Fangzhou Mu, Kuan Heng Lin, Yanli Liu, Bochen Guan, Yin Li, Bolei Zhou

Recent approaches such as ControlNet offer users fine-grained spatial control over text-to-image (T2I) diffusion models.

Paper
Add Code

Towards 4D Human Video Stylization

1 code implementation • 7 Dec 2023 • Tiantian Wang, Xinxin Zuo, Fangzhou Mu, Jian Wang, Ming-Hsuan Yang

To overcome these limitations, we leverage Neural Radiance Fields (NeRFs) to represent videos, conducting stylization in the rendered feature space.

Novel View Synthesis Style Transfer +1

Paper
Code

NMS Threshold matters for Ego4D Moment Queries -- 2nd place solution to the Ego4D Moment Queries Challenge 2023

1 code implementation • 5 Jul 2023 • Lin Sui, Fangzhou Mu, Yin Li

This report describes our submission to the Ego4D Moment Queries Challenge 2023.

Moment Queries Temporal Action Localization

384

Paper
Code

SimHaze: game engine simulated data for real-world dehazing

no code implementations • 25 May 2023 • Zhengyang Lou, Huan Xu, Fangzhou Mu, Yanli Liu, XiaoYu Zhang, Liang Shang, Jiang Li, Bochen Guan, Yin Li, Yu Hen Hu

Using a modern game engine, our approach renders crisp clean images and their precise depth maps, based on which high-quality hazy images can be synthesized for training dehazing models.

Depth Estimation Image Dehazing +1

Paper
Add Code

GLIGEN: Open-Set Grounded Text-to-Image Generation

1 code implementation • CVPR 2023 • Yuheng Li, Haotian Liu, Qingyang Wu, Fangzhou Mu, Jianwei Yang, Jianfeng Gao, Chunyuan Li, Yong Jae Lee

Large-scale text-to-image diffusion models have made amazing advances.

Ranked #4 on Conditional Text-to-Image Synthesis on COCO-MIG

Conditional Text-to-Image Synthesis Image Inpainting

1,791

Paper
Code

Learned Compressive Representations for Single-Photon 3D Imaging

no code implementations • ICCV 2023 • Felipe Gutierrez-Barragan, Fangzhou Mu, Andrei Ardelean, Atul Ingle, Claudio Bruschini, Edoardo Charbon, Yin Li, Mohit Gupta, Andreas Velten

Single-photon 3D cameras can record the time-of-arrival of billions of photons per second with picosecond accuracy.

Paper
Add Code

Where a Strong Backbone Meets Strong Features -- ActionFormer for Ego4D Moment Queries Challenge

2 code implementations • 16 Nov 2022 • Fangzhou Mu, Sicheng Mo, Gillian Wang, Yin Li

This report describes our submission to the Ego4D Moment Queries Challenge 2022.

Ranked #1 on Temporal Action Localization on Ego4D MQ test

Moment Queries Temporal Action Localization

384

Paper
Code

A Simple Transformer-Based Model for Ego4D Natural Language Queries Challenge

1 code implementation • 16 Nov 2022 • Sicheng Mo, Fangzhou Mu, Yin Li

This report describes Badgers@UW-Madison, our submission to the Ego4D Natural Language Queries (NLQ) Challenge.

Natural Language Queries Temporal Action Localization +1

Paper
Code

Physics to the Rescue: Deep Non-line-of-sight Reconstruction for High-speed Imaging

no code implementations • 3 May 2022 • Fangzhou Mu, Sicheng Mo, Jiayong Peng, Xiaochun Liu, Ji Hyun Nam, Siddeshwar Raghavan, Andreas Velten, Yin Li

Computational approach to imaging around the corner, or non-line-of-sight (NLOS) imaging, is becoming a reality thanks to major advances in imaging hardware and reconstruction algorithms.

Paper
Add Code

SmartAdapt: Multi-Branch Object Detection Framework for Videos on Mobiles

no code implementations • CVPR 2022 • ran Xu, Fangzhou Mu, Jayoung Lee, Preeti Mukherjee, Somali Chaterji, Saurabh Bagchi, Yin Li

In this paper, we ask, and answer, the wide-ranging question across all MBODFs: How to expose the right set of execution branches and then how to schedule the optimal one at inference time?

object-detection Video Object Detection

Paper
Add Code

3D Photo Stylization: Learning to Generate Stylized Novel Views from a Single Image

no code implementations • CVPR 2022 • Fangzhou Mu, Jian Wang, Yicheng Wu, Yin Li

Our key intuition is that style transfer and view synthesis have to be jointly modeled for this task.

Style Transfer

Paper
Add Code

Towards Non-Line-of-Sight Photography

no code implementations • 16 Sep 2021 • Jiayong Peng, Fangzhou Mu, Ji Hyun Nam, Siddeshwar Raghavan, Yin Li, Andreas Velten, Zhiwei Xiong

Non-line-of-sight (NLOS) imaging is based on capturing the multi-bounce indirect reflections from the hidden objects.

Paper
Add Code

Gradients as Features for Deep Representation Learning

no code implementations • ICLR 2020 • Fangzhou Mu, YIngyu Liang, Yin Li

We address the challenging problem of deep representation learning--the efficient adaption of a pre-trained deep network to different tasks.

Representation Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.