no code implementations • 7 Mar 2023 • Yi Yuan, Haohe Liu, Jinhua Liang, Xubo Liu, Mark D. Plumbley, Wenwu Wang
Deep neural networks have recently achieved breakthroughs in sound generation with text prompts.
2 code implementations • 29 Jan 2023 • Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo Mandic, Wenwu Wang, Mark D. Plumbley
By learning the latent representations of audio signals and their compositions without modeling the cross-modal relationship, AudioLDM is advantageous in both generation quality and computational efficiency.
Ranked #1 on
Audio Generation
on AudioCaps
no code implementations • 19 Jan 2023 • Shizun Wang, Weihong Zeng, Xu Wang, Hao Yang, Li Chen, Yi Yuan, Yunzhao Zeng, Min Zheng, Chuang Zhang, Ming Wu
To this end, we propose SwiftAvatar, a novel avatar auto-creation framework that is evidently superior to previous works.
no code implementations • 23 Dec 2021 • Guangming Yao, Hongzhi Wu, Yi Yuan, Lincheng Li, Kun Zhou, Xin Yu
In this paper, we present a novel double diffusion based neural radiance field, dubbed DD-NeRF, to reconstruct human body geometry and render the human body appearance in novel views from a sparse set of images.
no code implementations • 8 Aug 2021 • Qi Wen, Shuang Li, Bingfeng Han, Yi Yuan
Chinese character style transfer is a very challenging problem because of the complexity of the glyph shapes or underlying structures and large numbers of existed characters, when comparing with English letters.
1 code implementation • 1 Jun 2021 • Jianbiao Mei, Mengmeng Wang, Yeneng Lin, Yi Yuan, Yong liu
Recently, Space-Time Memory Network (STM) based methods have achieved state-of-the-art performance in semi-supervised video object segmentation (VOS).
One-shot visual object segmentation
Semantic Segmentation
+1
1 code implementation • 1 Mar 2021 • Yinglin Duan, Tianyang Shi, Zhengxia Zou, Yenan Lin, Zhehui Qian, Bohan Zhang, Yi Yuan
Motion completion is a challenging and long-discussed problem, which is of great significance in film and game applications.
Ranked #2 on
Motion Synthesis
on LaFAN1
no code implementations • 8 Feb 2021 • Lijuan Liu, Yin Yang, Yi Yuan, Tianjia Shao, He Wang, Kun Zhou
In this paper, we propose an effective global relation learning algorithm to recommend an appropriate location of a building unit for in-game customization of residential home complex.
no code implementations • 8 Feb 2021 • Guangming Yao, Yi Yuan, Tianjia Shao, Shuang Li, Shanqi Liu, Yong liu, Mengmeng Wang, Kun Zhou
The paper proposes a novel generative adversarial network for one-shot face reenactment, which can animate a single face image to a different pose-and-expression (provided by a driving image) while keeping its original appearance.
no code implementations • 5 Feb 2021 • Jilin Tang, Yi Yuan, Tianjia Shao, Yong liu, Mengmeng Wang, Kun Zhou
In this paper we tackle the problem of pose guided person image generation, which aims to transfer a person image from the source pose to a novel target pose while maintaining the source appearance.
1 code implementation • 4 Feb 2021 • Jiangke Lin, Yi Yuan, Zhengxia Zou
To tackle these problems, we propose 1) a low-cost facial texture acquisition method, 2) a shape transfer algorithm that can transform the shape of a 3DMM mesh to games, and 3) a new pipeline for training 3D game face reconstruction networks.
no code implementations • ICCV 2021 • Tianxin Huang, Hao Zou, Jinhao Cui, Xuemeng Yang, Mengmeng Wang, Xiangrui Zhao, Jiangning Zhang, Yi Yuan, Yifan Xu, Yong liu
The RFE extracts multiple global features from the incomplete point clouds for different recurrent levels, and the FDC generates point clouds in a coarse-to-fine pipeline.
1 code implementation • 31 Dec 2020 • Zhengxia Zou, Tianyang Shi, Yi Yuan, Zhenwei Shi
This paper studies an interesting question that whether a deep CNN can be trained to recover the depth behind an autostereogram and understand its content.
1 code implementation • 14 Dec 2020 • Xiaoyang Lyu, Liang Liu, Mengmeng Wang, Xin Kong, Lina Liu, Yong liu, Xinxin Chen, Yi Yuan
To obtainmore accurate depth estimation in large gradient regions, itis necessary to obtain high-resolution features with spatialand semantic information.
4 code implementations • CVPR 2021 • Zhengxia Zou, Tianyang Shi, Shuang Qiu, Yi Yuan, Zhenwei Shi
Different from previous image-to-image translation methods that formulate the translation as pixel-wise prediction, we deal with such an artistic creation process in a vectorized environment and produce a sequence of physically meaningful stroke parameters that can be further used for rendering.
no code implementations • 27 Sep 2020 • Yinglin Duan, Tianyang Shi, Zhengxia Zou, Jia Qin, Yifei Zhao, Yi Yuan, Jie Hou, Xiang Wen, Changjie Fan
Previous works of this topic consider music-to-dance as a supervised motion generation problem based on time-series data.
no code implementations • 25 Aug 2020 • Wenheng Chen, He Wang, Yi Yuan, Tianjia Shao, Kun Zhou
We evaluate our model on a wide range of motions and compare it with the state-of-the-art methods.
no code implementations • 20 Aug 2020 • Xinhui Song, Tianyang Shi, Zunlei Feng, Mingli Song, Jackie Lin, Chuan-Jie Lin, Changjie Fan, Yi Yuan
Facial action unit (AU) intensity is an index to describe all visually discernible facial movements.
no code implementations • 18 Aug 2020 • Guangming Yao, Yi Yuan, Tianjia Shao, Kun Zhou
In this paper, we introduce a method for one-shot face reenactment, which uses the reconstructed 3D meshes (i. e., the source mesh and driving mesh) as guidance to learn the optical flow needed for the reenacted face synthesis.
no code implementations • 17 Aug 2020 • Tianyang Shi, Zhengxia Zou, Yi Yuan, Changjie Fan
With the rapid development of Role-Playing Games (RPGs), players are now allowed to edit the facial appearance of their in-game characters with their preferences rather than using default templates.
1 code implementation • 17 Aug 2020 • Tianyang Shi, Zhengxia Zou, Xinhui Song, Zheng Song, Changjian Gu, Changjie Fan, Yi Yuan
Besides, the neural network based renderer used in previous methods is also difficult to be extended to multi-view rendering cases.
no code implementations • ECCV 2020 • Yifan Xu, Tianqi Fan, Yi Yuan, Gurprit Singh
Deep implicit field regression methods are effective for 3D reconstruction from single-view images.
no code implementations • 13 Apr 2020 • Xinhui Song, Tianyang Shi, Tianjia Shao, Yi Yuan, Zunlei Feng, Changjie Fan
The generator learns to "render" a face image from a set of facial parameters in a differentiable way, and the feature extractor extracts deep features for measuring the similarity of the rendered image and input real image.
3 code implementations • CVPR 2020 • Jiangke Lin, Yi Yuan, Tianjia Shao, Kun Zhou
In this paper, we introduce a method to reconstruct 3D facial shapes with high-fidelity textures from single-view images in-the-wild, without the need to capture a large-scale face texture database.
no code implementations • ICCV 2019 • Tianyang Shi, Yi Yuan, Changjie Fan, Zhengxia Zou, Zhenwei Shi, Yong liu
Character customization system is an important component in Role-Playing Games (RPGs), where players are allowed to edit the facial appearance of their in-game characters with their own preferences rather than using default templates.
no code implementations • 27 May 2019 • Guanzhong Tian, Yi Yuan, Yong liu
We propose an end to end deep learning approach for generating real-time facial animation from just audio.