Search Results for author: Yuyang Zhao

Found 17 papers, 7 papers with code

Zero-shot Point Cloud Completion Via 2D Priors

no code implementations • 10 Apr 2024 • Tianxin Huang, Zhiwen Yan, Yuyang Zhao, Gim Hee Lee

3D point cloud completion is designed to recover complete shapes from partially observed point clouds.

Paper
Add Code

Segment Any 3D Object with Language

no code implementations • 2 Apr 2024 • Seungjun Lee, Yuyang Zhao, Gim Hee Lee

In addition, to align the 3D segmentation model with various language instructions and enhance the mask quality, we introduce three types of multimodal associations as supervision.

3D Instance Segmentation Object +2

Paper
Add Code

Animate124: Animating One Image to 4D Dynamic Scene

no code implementations • 24 Nov 2023 • Yuyang Zhao, Zhiwen Yan, Enze Xie, Lanqing Hong, Zhenguo Li, Gim Hee Lee

We introduce Animate124 (Animate-one-image-to-4D), the first work to animate a single in-the-wild image into 3D video through textual motion descriptions, an underexplored problem with significant applications.

Paper
Add Code

VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence

no code implementations • 8 Oct 2023 • Jianing Qiu, Jian Wu, Hao Wei, Peilun Shi, Minqing Zhang, Yunyun Sun, Lin Li, Hanruo Liu, Hongyi Liu, Simeng Hou, Yuyang Zhao, Xuehui Shi, Junfang Xian, Xiaoxia Qu, Sirui Zhu, Lijie Pan, Xiaoniao Chen, Xiaojia Zhang, Shuai Jiang, Kebing Wang, Chenlong Yang, Mingqiang Chen, Sujie Fan, Jianhua Hu, Aiguo Lv, Hui Miao, Li Guo, Shujun Zhang, Cheng Pei, Xiaojuan Fan, Jianqin Lei, Ting Wei, Junguo Duan, Chun Liu, Xiaobo Xia, Siqi Xiong, Junhong Li, Benny Lo, Yih Chung Tham, Tien Yin Wong, Ningli Wang, Wu Yuan

To be commensurate with this capacity, in addition to the real data used for pre-training, we also generated and leveraged synthetic ophthalmic imaging data.

Disease Prediction Representation Learning

Paper
Add Code

SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels

2 code implementations • 15 Sep 2023 • Henry Hengyuan Zhao, Pichao Wang, Yuyang Zhao, Hao Luo, Fan Wang, Mike Zheng Shou

Recently, many parameter-efficient fine-tuning (PEFT) methods have been proposed, and their experiments demonstrate that tuning only 1% of extra parameters could surpass full fine-tuning in low-data resource scenarios.

Domain Generalization Few-Shot Learning

Paper
Code

Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts

no code implementations • 15 May 2023 • Yuyang Zhao, Enze Xie, Lanqing Hong, Zhenguo Li, Gim Hee Lee

The text-driven image and video diffusion models have achieved unprecedented success in generating realistic and diverse content.

Denoising Video Editing +1

Paper
Add Code

Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm

no code implementations • 14 Mar 2023 • Hengyuan Zhao, Hao Luo, Yuyang Zhao, Pichao Wang, Fan Wang, Mike Zheng Shou

In view of the practicality of PETL, previous works focus on tuning a small set of parameters for each downstream task in an end-to-end manner while rarely considering the task distribution shift issue between the pre-training task and the downstream task.

Transfer Learning Vocal Bursts Valence Prediction

Paper
Add Code

Style-Hallucinated Dual Consistency Learning: A Unified Framework for Visual Domain Generalization

1 code implementation • 18 Dec 2022 • Yuyang Zhao, Zhun Zhong, Na Zhao, Nicu Sebe, Gim Hee Lee

Furthermore, we present a novel style hallucination module (SHM) to generate style-diversified samples that are essential to consistency learning.

Domain Generalization Hallucination +4

Paper
Code

Synthetic-to-Real Domain Generalized Semantic Segmentation for 3D Indoor Point Clouds

no code implementations • 9 Dec 2022 • Yuyang Zhao, Na Zhao, Gim Hee Lee

In addition, we augment the point patterns of the source data and introduce non-parametric multi-prototypes to ameliorate the intra-class variance enlarged by the augmented point patterns.

Domain Generalization Semantic Segmentation

Paper
Add Code

Adversarial Style Augmentation for Domain Generalized Urban-Scene Segmentation

1 code implementation • 11 Jul 2022 • Zhun Zhong, Yuyang Zhao, Gim Hee Lee, Nicu Sebe

Experiments on two synthetic-to-real semantic segmentation benchmarks demonstrate that AdvStyle can significantly improve the model performance on unseen real domains and show that we can achieve the state of the art.

Ranked #12 on Domain Generalization on GTA-to-Avg(Cityscapes,BDD,Mapillary)

Domain Generalization Image Classification +1

Paper
Code

Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation

2 code implementations • 6 Apr 2022 • Yuyang Zhao, Zhun Zhong, Na Zhao, Nicu Sebe, Gim Hee Lee

Furthermore, we present a novel style hallucination module (SHM) to generate style-diversified samples that are essential to consistency learning.

Ranked #4 on Robust Object Detection on DWD

Domain Generalization Hallucination +3

Paper
Code

Novel Class Discovery in Semantic Segmentation

1 code implementation • CVPR 2022 • Yuyang Zhao, Zhun Zhong, Nicu Sebe, Gim Hee Lee

We introduce a new setting of Novel Class Discovery in Semantic Segmentation (NCDSS), which aims at segmenting unlabeled images containing new classes given prior knowledge from a labeled set of disjoint classes.

Image Classification Novel Class Discovery +3

Paper
Code

SA-MATD3:Self-attention-based multi-agent continuous control method in cooperative environments

no code implementations • 1 Jul 2021 • Kai Liu, Yuyang Zhao, Gang Wang, Bei Peng

Cooperative problems under continuous control have always been the focus of multi-agent reinforcement learning.

Continuous Control Multi-agent Reinforcement Learning

Paper
Add Code

Source-Free Open Compound Domain Adaptation in Semantic Segmentation

1 code implementation • 7 Jun 2021 • Yuyang Zhao, Zhun Zhong, Zhiming Luo, Gim Hee Lee, Nicu Sebe

Second, CPSS can reduce the influence of noisy pseudo-labels and also avoid the model overfitting to the target domain during self-supervised learning, consistently boosting the performance on the target and open domains.

Domain Generalization Self-Supervised Learning +1

Paper
Code

Learning to Generalize Unseen Domains via Memory-based Multi-Source Meta-Learning for Person Re-Identification

1 code implementation • CVPR 2021 • Yuyang Zhao, Zhun Zhong, Fengxiang Yang, Zhiming Luo, Yaojin Lin, Shaozi Li, Nicu Sebe

In this paper, we study the problem of multi-source domain generalization in ReID, which aims to learn a model that can perform well on unseen domains with only several labeled source domains.

Domain Generalization Meta-Learning +1

Paper
Code

STC-Flow: Spatio-temporal Context-aware Optical Flow Estimation

no code implementations • 1 Mar 2020 • Xiaolin Song, Yuyang Zhao, Jingyu Yang

In this paper, we propose a spatio-temporal contextual network, STC-Flow, for optical flow estimation.

Optical Flow Estimation

Paper
Add Code

FPCR-Net: Feature Pyramidal Correlation and Residual Reconstruction for Optical Flow Estimation

no code implementations • 17 Jan 2020 • Xiaolin Song, Yuyang Zhao, Jingyu Yang, Cuiling Lan, Wenjun Zeng

To exploit such flexible and comprehensive information, we propose a semi-supervised Feature Pyramidal Correlation and Residual Reconstruction Network (FPCR-Net) for optical flow estimation from frame pairs.

Optical Flow Estimation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.