no code implementations • 22 Mar 2024 • Zhengyi Zhao, Chen Song, Xiaodong Gu, Yuan Dong, Qi Zuo, Weihao Yuan, Zilong Dong, Liefeng Bo, QiXing Huang
In particular, the third and fourth stages are iterated, with the cuts obtained in the fourth stage encouraging non-rigid alignment in the third stage to focus on regions close to the cuts.
no code implementations • 18 Mar 2024 • Qi Zuo, Xiaodong Gu, Lingteng Qiu, Yuan Dong, Zhengyi Zhao, Weihao Yuan, Rui Peng, Siyu Zhu, Zilong Dong, Liefeng Bo, QiXing Huang
Images from video generative models are more suitable for multi-view generation because the underlying network architecture that generates them employs a temporal module to enforce frame consistency.
no code implementations • 10 Jan 2024 • Zhaoxi Chen, Gyeongsik Moon, Kaiwen Guo, Chen Cao, Stanislav Pidhorskyi, Tomas Simon, Rohan Joshi, Yuan Dong, Yichen Xu, Bernardo Pires, He Wen, Lucas Evans, Bo Peng, Julia Buffalini, Autumn Trimble, Kevyn McPhail, Melissa Schoeller, Shoou-I Yu, Javier Romero, Michael Zollhöfer, Yaser Sheikh, Ziwei Liu, Shunsuke Saito
To simplify the personalization process while retaining photorealism, we build a powerful universal relightable prior based on neural relighting from multi-view images of hands captured in a light stage with hundreds of identities.
1 code implementation • 16 Dec 2023 • Aodong Chen, Fei Xu, Li Han, Yuan Dong, Li Chen, Zhi Zhou, Fangming Liu
GPUs have become the defacto hardware devices to accelerate Deep Neural Network (DNN) inference in deep learning(DL) frameworks.
1 code implementation • 27 Nov 2023 • Yunhao Du, Cheng Lei, Zhicheng Zhao, Yuan Dong, Fei Su
Previous methods focus on learning from cross-modality person images in different cameras.
1 code implementation • 25 Nov 2023 • Delong Liu, Haiwen Li, Zhicheng Zhao, Fei Su, Yuan Dong
Searching for specific person has great social benefits and security value, and it often involves a combination of visual and textual information.
Ranked #1 on Zero-shot Composed Person Retrieval on ITCPR dataset (using extra training data)
no code implementations • 21 May 2023 • Yuan Dong, Chuan Fang, Liefeng Bo, Zilong Dong, Ping Tan
Panoramic image enables deeper understanding and more holistic perception of $360^\circ$ surrounding environment, which can naturally encode enriched scene context information compared to standard perspective image.
no code implementations • ICCV 2023 • Yurong Guo, Ruoyi Du, Yuan Dong, Timothy Hospedales, Yi-Zhe Song, Zhanyu Ma
In this paper, we first observe the dependence of task-specific parameter configuration on the target task.
no code implementations • 23 Dec 2022 • Weichao Shen, Yuan Dong, Zonghao Chen, Zhengyi Zhao, Yang Gao, Zhu Liu
In this paper, we propose PanoViT, a panorama vision transformer to estimate the room layout from a single panoramic image.
1 code implementation • 18 Apr 2022 • Yiming Zhang, Hong Yu, Ruoyi Du, Zhanyu Ma, Yuan Dong
To eliminate this negative effect, in this paper, we propose a two-stage framework for audio captioning: (i) in the first stage, via the contrastive learning, we construct a proxy feature space to reduce the distances between captions correlated to the same audio, and (ii) in the second stage, the proxy feature space is utilized as additional supervision to encourage the model to be optimized in the direction that benefits all the correlated captions.
no code implementations • 28 Jun 2021 • Donglai Xiang, Fabian Prada, Timur Bagautdinov, Weipeng Xu, Yuan Dong, He Wen, Jessica Hodgins, Chenglei Wu
To address these difficulties, we propose a method to build an animatable clothed body avatar with an explicit representation of the clothing on the upper body from multi-view captured videos.
no code implementations • 25 Jan 2021 • Yurong Guo, Zhanyu Ma, Xiaoxu Li, Yuan Dong
We consider this method of measuring relation of samples only models the sample-to-sample relation, while neglects the specificity of different tasks.
1 code implementation • 21 Aug 2019 • Yuan Dong, Dawei Li, Chi Zhang, Chuhan Wu, Hong Wang, Ming Xin, Jianlin Cheng, Jian Lin
A significant novelty of the proposed RGAN is that it combines the supervised and regressional convolutional neural network (CNN) with the traditional unsupervised GAN, thus overcoming the common technical barrier in the traditional GANs, which cannot generate data associated with given continuous quantitative labels.
Computational Physics Materials Science Applied Physics
no code implementations • 11 Mar 2019 • Qiushan Guo, Yuan Dong, Yu Guo, Hongliang Bai
We simultaneously propose an anchor assignment strategy which can cover faces with a wide range of scales to improve the recall rate of small faces and rotated faces.
1 code implementation • 26 Nov 2018 • Shuai Bai, Zhiqun He, Ting-Bing Xu, Zheng Zhu, Yuan Dong, Hongliang Bai
For visual tracking, most of the traditional correlation filters (CF) based methods suffer from the bottleneck of feature redundancy and lack of motion information.
no code implementations • 28 Sep 2018 • Yuan Dong, Chuhan Wu, Chi Zhang, Yingda Liu, Jianlin Cheng, Jian Lin
Moreover, given ubiquitous existence of topologies in materials, this work will stimulate widespread interests in applying deep learning algorithms to topological design of materials crossing atomic, nano-, meso-, and macro- scales.
Materials Science Computational Physics
no code implementations • ICCV 2017 • Tao Yu, Kaiwen Guo, Feng Xu, Yuan Dong, Zhaoqi Su, Jianhui Zhao, Jianguo Li, Qionghai Dai, Yebin Liu
To reduce the ambiguities of the non-rigid deformation parameterization on the surface graph nodes, we take advantage of the internal articulated motion prior for human performance and contribute a skeleton-embedded surface fusion (SSF) method.