no code implementations • 20 Mar 2025 • Tiange Xiang, Kai Li, Chengjiang Long, Christian Häne, Peihong Guo, Scott Delp, Ehsan Adeli, Li Fei-Fei
Recent advances in text-to-image diffusion models have been driven by the increasing availability of paired 2D data.
no code implementations • 19 Nov 2024 • Haoyu Zhao, Hao Wang, Xingyue Zhao, Hongqiu Wang, Zhiyu Wu, Chengjiang Long, Hua Zou
Recent advancements in 3D generation models have opened new possibilities for simulating dynamic 3D object movements and customizing behaviors, yet creating this content remains challenging.
no code implementations • 13 May 2024 • Clinton Mo, Kun Hu, Chengjiang Long, Dong Yuan, Zhiyong Wang
Comprehensive experiments demonstrate the effectiveness of PC-MRL in motion interpolation for desired skeletons without supervision from native datasets.
1 code implementation • 3 Feb 2024 • Yongwei Nie, Changzhen Liu, Chengjiang Long, Qing Zhang, Guiqing Li, Hongmin Cai
Our key idea is that with multiple RoIs as input, we can estimate multiple local cameras and have the opportunity to design and apply additional constraints between cameras to improve the accuracy of the cameras and, in turn, the accuracy of the corresponding 3D mesh.
1 code implementation • 25 Jan 2024 • Yongwei Nie, Mingxian Fan, Chengjiang Long, Qing Zhang, Jian Zhu, Xuemiao Xu
In this way, we obtain a meta-model, the meta-parameter of which is friendly to the test-time optimization.
1 code implementation • 24 Jan 2024 • Yongwei Nie, Hao Huang, Chengjiang Long, Qing Zhang, Pradipta Maji, Hongmin Cai
Video Anomaly Detection (VAD) has been extensively studied under the settings of One-Class Classification (OCC) and Weakly-Supervised learning (WS), which however both require laborious human-annotated normal/abnormal labels.
no code implementations • 10 Dec 2023 • Wenju Xu, Chengjiang Long, Yongwei Nie, Guanghui Wang
Unlike the existing works leveraging the semantic masks to obtain the representation of each component, we propose to generate disentangled latent code via a novel attribute encoder with transformers trained in a manner of curriculum learning from a relatively easy step to a gradually hard one.
no code implementations • CVPR 2023 • Zhijun Zhai, Jianhui Zhao, Chengjiang Long, Wenju Xu, Shuangjiang He, Huijuan Zhao
Micro-expressions are spontaneous, rapid and subtle facial movements that can neither be forged nor suppressed.
Micro Expression Recognition
Micro-Expression Recognition
+2
no code implementations • CVPR 2023 • Wenju Xu, Chengjiang Long, Yongwei Nie
Arbitrary style transfer has been demonstrated to be efficient in artistic image generation.
1 code implementation • CVPR 2023 • Clinton Ansun Mo, Kun Hu, Chengjiang Long, Zhiyong Wang
Deriving sophisticated 3D motions from sparse keyframes is a particularly challenging problem, due to continuity and exceptionally skeletal precision.
no code implementations • 12 Oct 2022 • Yuanyuan Liu, Chengjiang Long, Zhaoxuan Zhang, Bokai Liu, Qiang Zhang, BaoCai Yin, Xin Yang
3D scene graph generation (SGG) has been of high interest in computer vision.
2 code implementations • 15 Jul 2022 • Lingwei Dang, Yongwei Nie, Chengjiang Long, Qing Zhang, Guiqing Li
In this paper, we propose a novel sampling strategy for sampling very diverse results from an imbalanced multimodal distribution learned by a deep generative model.
Ranked #2 on
Human Pose Forecasting
on HumanEva-I
1 code implementation • CVPR 2022 • Xiao Lu, Yihong Cao, Sheng Liu, Chengjiang Long, Zipei Chen, Xuanyu Zhou, Yimin Yang, Chunxia Xiao
Our proposed approach is extensively validated on the ViSha dataset and a self-annotated dataset.
1 code implementation • 26 May 2022 • Liushuai Shi, Le Wang, Chengjiang Long, Sanping Zhou, Fang Zheng, Nanning Zheng, Gang Hua
Understanding the multiple socially-acceptable future behaviors is an essential task for many vision applications.
1 code implementation • CVPR 2022 • Tiezheng Ma, Yongwei Nie, Chengjiang Long, Qing Zhang, Guiqing Li
This motivates us to propose a novel two-stage prediction framework, including an init-prediction network that just computes the good guess and then a formal-prediction network that predicts the target future poses based on the guess.
Ranked #10 on
Human Pose Forecasting
on Human3.6M
no code implementations • 11 Dec 2021 • Yu Qiao, Jincheng Zhu, Chengjiang Long, Zeyao Zhang, Yuxin Wang, Zhenjun Du, Xin Yang
Acquiring the most representative examples via active learning (AL) can benefit many data-dependent computer vision tasks by minimizing efforts of image-level or pixel-wise annotations.
1 code implementation • 14 Sep 2021 • Hanning Yu, Wentao Liu, Chengjiang Long, Bo Dong, Qin Zou, Chunxia Xiao
Based on this observation, we propose a novel normalization method called " HDR calibration " for HDR images stored in relative luminance, calibrating HDR images into a similar luminance scale according to the LDR images.
1 code implementation • ICCV 2021 • Zipei Chen, Chengjiang Long, Ling Zhang, Chunxia Xiao
In this paper, we propose a novel two-stage context-aware network named CANet for shadow removal, in which the contextual information from non-shadow regions is transferred to shadow regions at the embedded feature spaces.
1 code implementation • ICCV 2021 • Wenju Xu, Chengjiang Long, Ruisheng Wang, Guanghui Wang
The style code is modeled as the shared parameters for Dynamic ResBlocks connecting both the style encoding network and the style transfer network.
1 code implementation • ICCV 2021 • Zhian Liu, Yongwei Nie, Chengjiang Long, Qing Zhang, Guiqing Li
In this paper, we propose $\text{HF}^2$-VAD, a Hybrid framework that integrates Flow reconstruction and Frame prediction seamlessly to handle Video Anomaly Detection.
Ranked #1 on
Video Anomaly Detection
on Ped2
1 code implementation • ICCV 2021 • Lingwei Dang, Yongwei Nie, Chengjiang Long, Qing Zhang, Guiqing Li
The extracted features at each scale are then combined and decoded to obtain the residuals between the input and target poses.
Ranked #13 on
Human Pose Forecasting
on Human3.6M
1 code implementation • 5 Aug 2021 • Xinzhi Dong, Chengjiang Long, Wenju Xu, Chunxia Xiao
With the well-designed Dual-GCN, we can make the linguistic transformer better understand the relationship between different objects in a single image and make full use of similar images as auxiliary information to generate a reasonable caption description for a single image.
no code implementations • CVPR 2022 • Zhongyun Bao, Chengjiang Long, Gang Fu, Daquan Liu, Yuanzhen Li, Jiaming Wu, Chunxia Xiao
Specifically, we firstly apply a physically-based rendering method to construct a large-scale, high-quality dataset (named IH) for our task, which contains various types of foreground objects and background scenes with different lighting conditions.
no code implementations • 28 Jul 2021 • Tao Hu, Chengjiang Long, Chunxia Xiao
Based on those constraints, a category-consistent and relativistic diverse conditional GAN (CRD-CGAN) is proposed to synthesize $K$ photo-realistic images simultaneously.
no code implementations • CVPR 2021 • Liushuai Shi, Le Wang, Chengjiang Long, Sanping Zhou, Mo Zhou, Zhenxing Niu, Gang Hua
Specifically, the SGCN explicitly models the sparse directed interaction with a sparse directed spatial graph to capture adaptive interaction pedestrians.
no code implementations • 24 May 2021 • Mengxiao Tian, Hao Guo, Chengjiang Long
Recently the crowd counting has received more and more attention.
1 code implementation • 21 Apr 2021 • Jiqing Zhang, Chengjiang Long, Yuxin Wang, Haiyin Piao, Haiyang Mei, Xin Yang, BaoCai Yin
Recently, deep convolutional neural networks (CNNs) have been widely explored in single image super-resolution (SISR) and contribute remarkable progress.
4 code implementations • 4 Apr 2021 • Liushuai Shi, Le Wang, Chengjiang Long, Sanping Zhou, Mo Zhou, Zhenxing Niu, Gang Hua
Meanwhile, we use a sparse directed temporal graph to model the motion tendency, thus to facilitate the prediction based on the observed direction.
1 code implementation • 3 Jan 2021 • Ashraful Islam, Chengjiang Long, Richard Radke
Moreover, our temporal semi-soft and hard attention modules, calculating two attention scores for each video snippet, help to focus on the less discriminative frames of an action to capture the full action boundary.
no code implementations • 18 Dec 2019 • Bhavan Vasu, Chengjiang Long
Deep neural networks have achieved great success in many real-world applications, yet it remains unclear and difficult to explain their decision-making process to an end-user.
3 code implementations • 20 Nov 2019 • Ling Zhang, Chengjiang Long, Xiaolong Zhang, Chunxia Xiao
To our best knowledge, we are the first one to explore residual and illumination for shadow removal.
no code implementations • ICCV 2019 • Bin Ding, Chengjiang Long, Ling Zhang, Chunxia Xiao
In this paper we propose an attentive recurrent generative adversarial network (ARGAN) to detect and remove shadows in an image.
Generative Adversarial Network
Shadow Detection And Removal
+1
no code implementations • 26 Jul 2019 • Tao Hu, Chengjiang Long, Leheng Zhang, Chunxia Xiao
In this paper, we propose a novel way to interpret text information by extracting visual feature presentation from multiple high-resolution and photo-realistic synthetic images generated by Text-to-image Generative Adversarial Network (GAN) to improve the performance of image labeling.
no code implementations • 27 Nov 2018 • Chengjiang Long, Arslan Basharat, Anthony Hoogs
First, an I3D network finds coarse-level matches between candidate duplicated frame sequences and the corresponding selected original frame sequences.
no code implementations • 27 Jan 2018 • Chengjiang Long, Roddy Collins, Eran Swears, Anthony Hoogs
We propose a novel method for predicting image labels by fusing image content descriptors with the social media context of each image.
no code implementations • CVPR 2017 • Chengjiang Long, Gang Hua
A set of correlational tensors is adopted to model the relationship within a single domain as well as across multiple domains.
no code implementations • 5 Mar 2017 • Yongwei Nie, Xu Cao, Chengjiang Long, Ping Li, Guiqing Li
Current face alignment algorithms can robustly find a set of landmarks along face contour.
no code implementations • ICCV 2015 • Chengjiang Long, Gang Hua
Based on the EP approximation inference, a generalized Expectation Maximization (GEM) algorithm is derived to estimate both the parameters for instances and the quality of each individual annotator.