Search Results for author: Yuyang Zhao

Found 18 papers, 8 papers with code

X-Ray: A Sequential 3D Representation For Generation

1 code implementation22 Apr 2024 Tao Hu, Wenhang Ge, Yuyang Zhao, Gim Hee Lee

We introduce X-Ray, a novel 3D sequential representation inspired by the penetrability of x-ray scans.

3D Generation Object

Zero-shot Point Cloud Completion Via 2D Priors

no code implementations10 Apr 2024 Tianxin Huang, Zhiwen Yan, Yuyang Zhao, Gim Hee Lee

3D point cloud completion is designed to recover complete shapes from partially observed point clouds.

Colorization Point Cloud Completion

Segment Any 3D Object with Language

no code implementations2 Apr 2024 Seungjun Lee, Yuyang Zhao, Gim Hee Lee

In addition, to align the 3D segmentation model with various language instructions and enhance the mask quality, we introduce three types of multimodal associations as supervision.

3D Instance Segmentation Decoder +3

Animate124: Animating One Image to 4D Dynamic Scene

no code implementations24 Nov 2023 Yuyang Zhao, Zhiwen Yan, Enze Xie, Lanqing Hong, Zhenguo Li, Gim Hee Lee

We introduce Animate124 (Animate-one-image-to-4D), the first work to animate a single in-the-wild image into 3D video through textual motion descriptions, an underexplored problem with significant applications.

SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels

2 code implementations15 Sep 2023 Henry Hengyuan Zhao, Pichao Wang, Yuyang Zhao, Hao Luo, Fan Wang, Mike Zheng Shou

Experiments on 19 visual transfer learning downstream tasks demonstrate that our SCT outperforms full fine-tuning on 18 out of 19 tasks by adding only 0. 11M parameters of the ViT-B, which is 780$\times$ fewer than its full fine-tuning counterpart.

Domain Generalization Few-Shot Learning +1

Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts

no code implementations15 May 2023 Yuyang Zhao, Enze Xie, Lanqing Hong, Zhenguo Li, Gim Hee Lee

The text-driven image and video diffusion models have achieved unprecedented success in generating realistic and diverse content.

Denoising Video Editing +1

Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm

no code implementations14 Mar 2023 Hengyuan Zhao, Hao Luo, Yuyang Zhao, Pichao Wang, Fan Wang, Mike Zheng Shou

In view of the practicality of PETL, previous works focus on tuning a small set of parameters for each downstream task in an end-to-end manner while rarely considering the task distribution shift issue between the pre-training task and the downstream task.

Transfer Learning Vocal Bursts Valence Prediction

Style-Hallucinated Dual Consistency Learning: A Unified Framework for Visual Domain Generalization

1 code implementation18 Dec 2022 Yuyang Zhao, Zhun Zhong, Na Zhao, Nicu Sebe, Gim Hee Lee

Furthermore, we present a novel style hallucination module (SHM) to generate style-diversified samples that are essential to consistency learning.

Domain Generalization Hallucination +4

Synthetic-to-Real Domain Generalized Semantic Segmentation for 3D Indoor Point Clouds

no code implementations9 Dec 2022 Yuyang Zhao, Na Zhao, Gim Hee Lee

In addition, we augment the point patterns of the source data and introduce non-parametric multi-prototypes to ameliorate the intra-class variance enlarged by the augmented point patterns.

Domain Generalization Semantic Segmentation

Adversarial Style Augmentation for Domain Generalized Urban-Scene Segmentation

1 code implementation11 Jul 2022 Zhun Zhong, Yuyang Zhao, Gim Hee Lee, Nicu Sebe

Experiments on two synthetic-to-real semantic segmentation benchmarks demonstrate that AdvStyle can significantly improve the model performance on unseen real domains and show that we can achieve the state of the art.

Domain Generalization Image Classification +1

Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation

2 code implementations6 Apr 2022 Yuyang Zhao, Zhun Zhong, Na Zhao, Nicu Sebe, Gim Hee Lee

Furthermore, we present a novel style hallucination module (SHM) to generate style-diversified samples that are essential to consistency learning.

Domain Generalization Hallucination +3

Novel Class Discovery in Semantic Segmentation

1 code implementation CVPR 2022 Yuyang Zhao, Zhun Zhong, Nicu Sebe, Gim Hee Lee

We introduce a new setting of Novel Class Discovery in Semantic Segmentation (NCDSS), which aims at segmenting unlabeled images containing new classes given prior knowledge from a labeled set of disjoint classes.

Image Classification Novel Class Discovery +3

Source-Free Open Compound Domain Adaptation in Semantic Segmentation

1 code implementation7 Jun 2021 Yuyang Zhao, Zhun Zhong, Zhiming Luo, Gim Hee Lee, Nicu Sebe

Second, CPSS can reduce the influence of noisy pseudo-labels and also avoid the model overfitting to the target domain during self-supervised learning, consistently boosting the performance on the target and open domains.

Domain Generalization Self-Supervised Learning +1

Learning to Generalize Unseen Domains via Memory-based Multi-Source Meta-Learning for Person Re-Identification

1 code implementation CVPR 2021 Yuyang Zhao, Zhun Zhong, Fengxiang Yang, Zhiming Luo, Yaojin Lin, Shaozi Li, Nicu Sebe

In this paper, we study the problem of multi-source domain generalization in ReID, which aims to learn a model that can perform well on unseen domains with only several labeled source domains.

Domain Generalization Meta-Learning +1

STC-Flow: Spatio-temporal Context-aware Optical Flow Estimation

no code implementations1 Mar 2020 Xiaolin Song, Yuyang Zhao, Jingyu Yang

In this paper, we propose a spatio-temporal contextual network, STC-Flow, for optical flow estimation.

Optical Flow Estimation

FPCR-Net: Feature Pyramidal Correlation and Residual Reconstruction for Optical Flow Estimation

no code implementations17 Jan 2020 Xiaolin Song, Yuyang Zhao, Jingyu Yang, Cuiling Lan, Wenjun Zeng

To exploit such flexible and comprehensive information, we propose a semi-supervised Feature Pyramidal Correlation and Residual Reconstruction Network (FPCR-Net) for optical flow estimation from frame pairs.

Optical Flow Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.