no code implementations • 12 Aug 2024 • Yinhuai Wang, Qihan Zhao, Runyi Yu, Ailing Zeng, Jing Lin, Zhengyi Luo, Hok Wai Tsui, Jiwen Yu, Xiu Li, Qifeng Chen, Jian Zhang, Lei Zhang, Ping Tan
SkillMimic employs a unified configuration to learn diverse skills from human-ball motion datasets, with skill diversity and generalization improving as the dataset grows.
no code implementations • 7 Dec 2023 • Yinhuai Wang, Jing Lin, Ailing Zeng, Zhengyi Luo, Jian Zhang, Lei Zhang
To make up for the lack of dynamic HOI scenarios in this area, we introduce the BallPlay dataset that contains eight whole-body basketball skills.
no code implementations • 18 Aug 2023 • Shuzhou Yang, Xuanyu Zhang, Yinhuai Wang, Jiwen Yu, YuHan Wang, Jian Zhang
Specifically, we adopt a naive unsupervised enhancement algorithm to realize preliminary restoration and design two zero-shot plug-and-play modules based on diffusion model to improve generalization and effectiveness.
1 code implementation • ICCV 2023 • Jiwen Yu, Yinhuai Wang, Chen Zhao, Bernard Ghanem, Jian Zhang
In this work, we propose a training-Free conditional Diffusion Model (FreeDoM) used for various conditions.
1 code implementation • 1 Mar 2023 • Yinhuai Wang, Jiwen Yu, Runyi Yu, Jian Zhang
Our simple, parameter-free approaches can be used not only for image restoration but also for image generation of unlimited sizes, with the potential to be a general tool for diffusion models.
1 code implementation • ICCV 2023 • Runyi Yu, Zhennan Wang, Yinhuai Wang, Kehan Li, Chang Liu, Haoyi Duan, Xiangyang Ji, Jie Chen
A typical way to introduce position information is adding the absolute Position Embedding (PE) to patch embedding before entering VTs.
1 code implementation • 10 Dec 2022 • Runyi Yu, Zhennan Wang, Yinhuai Wang, Kehan Li, Yian Zhao, Jian Zhang, Guoli Song, Jie Chen
By analyzing the input and output of each encoder layer in VTs using reparameterization and visualization, we find that the default PE joining method (simply adding the PE and patch embedding together) operates the same affine transformation to token embedding and PE, which limits the expressiveness of PE and hence constrains the performance of VTs.
4 code implementations • 1 Dec 2022 • Yinhuai Wang, Jiwen Yu, Jian Zhang
Most existing Image Restoration (IR) models are task-specific, which can not be generalized to different degradation operators.
Ranked #1 on Image Compressed Sensing on CelebA
1 code implementation • 24 Nov 2022 • Yinhuai Wang, Yujie Hu, Jiwen Yu, Jian Zhang
Consistency and realness have always been the two critical issues of image super-resolution.
1 code implementation • 16 Mar 2022 • Yinhuai Wang, Yujie Hu, Jian Zhang
Emerging high-quality face restoration (FR) methods often utilize pre-trained GAN models (\textit{i. e.}, StyleGAN2) as GAN Prior.
1 code implementation • 10 Mar 2022 • Yinhuai Wang, Shuzhou Yang, Yujie Hu, Jian Zhang
Unlike the pinhole, the thin lens refracts rays of a scene point, so its imaging on the sensor plane is scattered as a circle of confusion (CoC).