no code implementations • ECCV 2020 • Xiangyu He, Zitao Mo, Ke Cheng, Weixiang Xu, Qinghao Hu, Peisong Wang, Qingshan Liu, Jian Cheng
The matrix composed of basis vectors is referred to as the proxy matrix, and auxiliary variables serve as the coefficients of this linear combination.
1 code implementation • ECCV 2020 • Ke Cheng, Yifan Zhang, Congqi Cao, Lei Shi, Jian Cheng, Hanqing Lu
Nevertheless, how to efficiently model the spatial-temporal skeleton graph without introducing extra computation burden is a challenging problem for industrial deployment.
1 code implementation • ICML 2020 • Peisong Wang, Qiang Chen, Xiangyu He, Jian Cheng
Network quantization is essential for deploying deep models to IoT devices due to the high efficiency, no matter on special hardware like TPU or general hardware like CPU and GPU.
no code implementations • 20 Sep 2023 • Xingting Yao, Qinghao Hu, Tielong Liu, Zitao Mo, Zeyu Zhu, Zhengyang Zhuge, Jian Cheng
We propose the temporal condensing-and-padding (TCP) strategy to tackle the masked samples to maintain regular temporal length, i. e., regular tensors, for hardware-friendly computation.
3 code implementations • 11 Sep 2023 • Dingfeng Shi, Qiong Cao, Yujie Zhong, Shan An, Jian Cheng, Haogang Zhu, DaCheng Tao
Temporal action detection (TAD) aims to detect all action boundaries and their corresponding categories in an untrimmed video.
Ranked #1 on
Temporal Action Localization
on MultiTHUMOS
1 code implementation • 21 Jul 2023 • Yiqun Chen, Qiang Chen, Peize Sun, Shoufa Chen, Jingdong Wang, Jian Cheng
We hope our work will bring the attention of the detection community to the localization bottleneck of current DETR-like models and highlight the potential of the RefineBox framework.
no code implementations • 26 Apr 2023 • Yan Wang, Jian Cheng, Yixin Chen, Shuai Shao, Lanyun Zhu, Zhenzhou Wu, Tao Liu, Haogang Zhu
In FVP, the visual prompt is parameterized using only a small amount of low-frequency learnable parameters in the input frequency space, and is learned by minimizing the segmentation loss between the predicted segmentation of the prompted target image and reliable pseudo segmentation label of the target image under the frozen model.
no code implementations • 11 Apr 2023 • Anda Cheng, Zhen Wang, Yaliang Li, Jian Cheng
The client encoding is calculated with a random projection-based procedure to protect each client's privacy.
1 code implementation • 1 Feb 2023 • Zeyu Zhu, Fanrong Li, Zitao Mo, Qinghao Hu, Gang Li, Zejian Liu, Xiaoyao Liang, Jian Cheng
Through an in-depth analysis of the topology of GNNs, we observe that the topology of the graph leads to significant differences between nodes, and most of the nodes in a graph appear to have a small aggregation value.
no code implementations • CVPR 2023 • Tao Xie, Shiguang Wang, Ke Wang, Linqi Yang, Zhiqiang Jiang, Xingcheng Zhang, Kun Dai, Ruifeng Li, Jian Cheng
In this work, we show that it is feasible to perform multiple tasks concurrently on point cloud with a straightforward yet effective multi-task network.
no code implementations • CVPR 2023 • Shiguang Wang, Tao Xie, Jian Cheng, Xingcheng Zhang, Haijun Liu
Technically, MDL-NAS constructs a coarse-to-fine search space, where the coarse search space offers various optimal architectures for different tasks while the fine search space provides fine-grained parameter sharing to tackle the inherent obstacles of multi-domain learning.
1 code implementation • 25 Nov 2022 • Yiqun Chen, Qiang Chen, Qinghao Hu, Jian Cheng
In this paper, we revisit these two assignment methods and find that bringing one-to-many assignment back to end-to-end fully convolutional detectors helps with model convergence.
1 code implementation • 25 Oct 2022 • Xingting Yao, Fanrong Li, Zitao Mo, Jian Cheng
In this paper, we propose GLIF, a unified spiking neuron, to fuse different bio-features in different neuronal behaviors, enlarging the representation space of spiking neurons.
1 code implementation • 20 Oct 2022 • Marcos V. Conde, Radu Timofte, Yibin Huang, Jingyang Peng, Chang Chen, Cheng Li, Eduardo Pérez-Pellitero, Fenglong Song, Furui Bai, Shuai Liu, Chaoyu Feng, Xiaotao Wang, Lei Lei, Yu Zhu, Chenghua Li, Yingying Jiang, Yong A, Peisong Wang, Cong Leng, Jian Cheng, Xiaoyu Liu, Zhicun Yin, Zhilu Zhang, Junyi Li, Ming Liu, WangMeng Zuo, Jun Jiang, Jinha Kim, Yue Zhang, Beiji Zou, Zhikai Zong, Xiaoxiao Liu, Juan Marín Vega, Michael Sloth, Peter Schneider-Kamp, Richard Röttger, Furkan Kınlı, Barış Özcan, Furkan Kıraç, Li Leyi, SM Nadim Uddin, Dipon Kumar Ghosh, Yong Ju Jung
Cameras capture sensor RAW images and transform them into pleasant RGB images, suitable for the human eyes, using their integrated Image Signal Processor (ISP).
1 code implementation • 17 Oct 2022 • Furkan Kınlı, Sami Menteş, Barış Özcan, Furkan Kıraç, Radu Timofte, Yi Zuo, Zitao Wang, Xiaowen Zhang, Yu Zhu, Chenghua Li, Cong Leng, Jian Cheng, Shuai Liu, Chaoyu Feng, Furui Bai, Xiaotao Wang, Lei Lei, Tianzhi Ma, Zihan Gao, Wenxin He, Woon-Ha Yeo, Wang-Taek Oh, Young-Il Kim, Han-Cheol Ryu, Gang He, Shaoyi Long, S. M. A. Sharif, Rizwan Ali Naqvi, Sungjun Kim, Guisik Kim, Seohyeon Lee, Sabari Nathan, Priya Kansal
This paper introduces the methods and the results of AIM 2022 challenge on Instagram Filter Removal.
3 code implementations • 23 Aug 2022 • Ren Yang, Radu Timofte, Qi Zhang, Lin Zhang, Fanglong Liu, Dongliang He, Fu Li, He Zheng, Weihang Yuan, Pavel Ostyakov, Dmitry Vyal, Magauiya Zhussip, Xueyi Zou, Youliang Yan, Lei LI, Jingzhu Tang, Ming Chen, Shijie Zhao, Yu Zhu, Xiaoran Qin, Chenghua Li, Cong Leng, Jian Cheng, Claudio Rota, Marco Buzzelli, Simone Bianco, Raimondo Schettini, Dafeng Zhang, Feiyu Huang, Shizhuo Liu, Xiaobing Wang, Zhezhu Jin, Bingchen Li, Xin Li, Mingxi Li, Ding Liu, Wenbin Zou, Peijie Dong, Tian Ye, Yunchen Zhang, Ming Tan, Xin Niu, Mustafa Ayazoglu, Marcos Conde, Ui-Jin Choi, Zhuang Jia, Tianyu Xu, Yijian Zhang, Mao Ye, Dengyan Luo, Xiaofeng Pan, Liuhan Peng
The homepage of this challenge is at https://github. com/RenYang-home/AIM22_CompressSR.
1 code implementation • 3 Aug 2022 • Qinghao Hu, Gang Li, Qiman Wu, Jian Cheng
In this paper, we propose the PArallel Low-precision Quantization (PalQuant) method that approximates high-precision computations via learning parallel low-precision representations from scratch.
1 code implementation • 5 Jul 2022 • Weihan Cao, Yifan Zhang, Jianfei Gao, Anda Cheng, Ke Cheng, Jian Cheng
First, the difference in feature magnitude between the teacher and the student could enforce overly strict constraints on the student.
no code implementations • 24 Jun 2022 • Xia Jiang, Jian Zhang, Xiaoyu Shi, Jian Cheng
Meanwhile, the simulation results demonstrate the effectiveness of the delay reward, which is designed to outperform distributed reward mechanism} Compared with normal car-following behavior, the sensitivity analysis reveals that the energy can be saved to different extends (39. 27%-82. 51%) by adjusting the relative importance of the optimization goal.
1 code implementation • 13 Jun 2022 • Yanpeng Sun, Qiang Chen, Xiangyu He, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Jian Cheng, Zechao Li, Jingdong Wang
In this paper, we rethink the paradigm and explore a new regime: {\em fine-tuning a small part of parameters in the backbone}.
Ranked #7 on
Few-Shot Semantic Segmentation
on COCO-20i (1-shot)
3 code implementations • CVPR 2022 • Qiang Chen, Qiman Wu, Jian Wang, Qinghao Hu, Tao Hu, Errui Ding, Jian Cheng, Jingdong Wang
We propose MixFormer to find a solution.
1 code implementation • 4 Apr 2022 • Weixiang Xu, Xiangyu He, Tianli Zhao, Qinghao Hu, Peisong Wang, Jian Cheng
The latest STTN shows that ResNet-18 with ternary weights and ternary activations achieves up to 68. 2% Top-1 accuracy on ImageNet.
1 code implementation • 29 Mar 2022 • Jian Cheng, Yanguang Wan, Dexin Zuo, Cuixia Ma, Jian Gu, Ping Tan, Hongan Wang, Xiaoming Deng, yinda zhang
3D hand pose estimation from single depth is a fundamental problem in computer vision, and has wide applications. However, the existing methods still can not achieve satisfactory hand pose estimation results due to view variation and occlusion of human hand.
Ranked #1 on
Hand Pose Estimation
on ICVL Hands
no code implementations • CVPR 2022 • Anda Cheng, Peisong Wang, Xi Sheryl Zhang, Jian Cheng
User-level differential privacy (DP) provides certifiable privacy guarantees to the information that is specific to any user's data in federated learning.
no code implementations • 3 Feb 2022 • Tao Liu, Shu Guo, Hao liu, Rui Kang, Mingyang Bai, Jiyang Jiang, Wei Wen, Xing Pan, Jun Tai, JianXin Li, Jian Cheng, Jing Jing, Zhenzhou Wu, Haijun Niu, Haogang Zhu, Zixiao Li, Yongjun Wang, Henry Brodaty, Perminder Sachdev, Daqing Li
Degeneration and adaptation are two competing sides of the same coin called resilience in the progressive processes of brain aging or diseases.
no code implementations • 25 Jan 2022 • Xiangyu He, Jian Cheng
Super-resolution as an ill-posed problem has many high-resolution candidates for a low-resolution input.
no code implementations • 19 Jan 2022 • Zhexin Li, Tong Yang, Peisong Wang, Jian Cheng
In this paper, we propose a fully differentiable quantization method for vision transformer (ViT) named as Q-ViT, in which both of the quantization scales and bit-widths are learnable parameters.
1 code implementation • CVPR 2022 • Jiahao Lu, Xi Sheryl Zhang, Tianli Zhao, Xiangyu He, Jian Cheng
Showing how vision Transformers are at the risk of privacy leakage via gradients, we urge the significance of designing privacy-safer Transformer models and defending schemes.
1 code implementation • 16 Oct 2021 • Anda Cheng, Jiaxing Wang, Xi Sheryl Zhang, Qiang Chen, Peisong Wang, Jian Cheng
In light of this missing, we propose the very first framework that employs neural architecture search to automatic model design for private deep learning, dubbed as DPNAS.
1 code implementation • 15 Oct 2021 • Tianli Zhao, Xi Sheryl Zhang, Wentao Zhu, Jiaxing Wang, Sen yang, Ji Liu, Jian Cheng
In this paper, we present a unified framework with Joint Channel pruning and Weight pruning (JCW), and achieves a better Pareto-frontier between the latency and accuracy than previous model compression approaches.
no code implementations • ICCV 2021 • Weihan Chen, Peisong Wang, Jian Cheng
Finally, based on the above simplification, we show that the original problem can be reformulated as a Multiple-Choice Knapsack Problem (MCKP) and propose a greedy search algorithm to solve it efficiently.
no code implementations • 12 Oct 2021 • Weixiang Xu, Qiang Chen, Xiangyu He, Peisong Wang, Jian Cheng
Binary Neural Networks (BNNs) rely on a real-valued auxiliary variable W to help binary training.
no code implementations • ICCV Workshop 2021 • Xing Lan, Qinghao Hu, Jian Cheng
The statistical re- sults show the NME generated by quantization error is even larger than 1/3 of the SOTA item, which is a serious obsta- cle for making a new breakthrough in face alignment.
Ranked #7 on
Face Alignment
on COFW
no code implementations • 1 Sep 2021 • Tianli Zhao, Qinghao Hu, Xiangyu He, Weixiang Xu, Jiaxing Wang, Cong Leng, Jian Cheng
Acceleration of deep neural networks to meet a specific latency constraint is essential for their deployment on mobile devices.
no code implementations • 7 Jul 2021 • Chengzhi Jiang, Yanzhou Su, Wen Wang, Haiwei Bai, Haijun Liu, Jian Cheng
This method, named IntraLoss, explicitly performs gradient enhancement in the anisotropic region so that the intra-class distribution continues to shrink, resulting in isotropic and more compact intra-class distribution and further margin between identities.
1 code implementation • 6 Jun 2021 • Jian Cheng, Ziyang Liu, Hao Guan, Zhenzhou Wu, Haogang Zhu, Jiyang Jiang, Wei Wen, DaCheng Tao, Tao Liu
In this paper, a novel 3D convolutional network, called two-stage-age-network (TSAN), is proposed to estimate brain age from T1-weighted MRI data.
1 code implementation • 7 Apr 2021 • Xing Lan, Qinghao Hu, Qiang Chen, Jian Xue, Jian Cheng
In particular, our HIH reaches 4. 08 NME (Normalized Mean Error) on WFLW, and 3. 21 on COFW, which exceeds previous methods by a significant margin.
Ranked #4 on
Face Alignment
on WFW (Extra Data)
no code implementations • ICCV 2021 • Lei Shi, Yifan Zhang, Jian Cheng, Hanqing Lu
Existing methods for skeleton-based action recognition mainly focus on improving the recognition accuracy, whereas the efficiency of the model is rarely considered.
6 code implementations • CVPR 2021 • Qiang Chen, Yingming Wang, Tong Yang, Xiangyu Zhang, Jian Cheng, Jian Sun
From the perspective of optimization, we introduce an alternative way to address the problem instead of adopting the complex feature pyramids - {\em utilizing only one-level feature for detection}.
Ranked #145 on
Object Detection
on COCO test-dev
no code implementations • 4 Mar 2021 • Zejian Liu, Gang Li, Jian Cheng
BERT is the most recent Transformer-based model that achieves state-of-the-art performance in various NLP tasks.
no code implementations • 21 Jan 2021 • Xiangyu He, Qinghao Hu, Peisong Wang, Jian Cheng
Convolutional neural networks are able to learn realistic image priors from numerous training samples in low-level image generation and restoration.
1 code implementation • ICCV 2021 • Fanrong Li, Gang Li, Xiangyu He, Jian Cheng
In particular, dynamic dual gating can provide 59. 7% saving in computing of ResNet50 with 76. 41% top-1 accuracy on ImageNet, which has advanced the state-of-the-art.
no code implementations • 21 Dec 2020 • Zhenyu Liu, Jian Cheng
Time series classification problems exist in many fields and have been explored for a couple of decades.
1 code implementation • NeurIPS 2020 • Jiaxing Wang, Haoli Bai, Jiaxiang Wu, Xupeng Shi, Junzhou Huang, Irwin King, Michael Lyu, Jian Cheng
Nevertheless, it is unclear how parameter sharing affects the searching process.
no code implementations • 10 Nov 2020 • Andrey Ignatov, Radu Timofte, Ming Qian, Congyu Qiao, Jiamin Lin, Zhenyu Guo, Chenghua Li, Cong Leng, Jian Cheng, Juewen Peng, Xianrui Luo, Ke Xian, Zijin Wu, Zhiguo Cao, Densen Puthussery, Jiji C V, Hrishikesh P S, Melvin Kuriakose, Saikat Dutta, Sourya Dipta Das, Nisarg A. Shah, Kuldeep Purohit, Praveen Kandula, Maitreya Suin, A. N. Rajagopalan, Saagara M B, Minnu A L, Sanjana A R, Praseeda S, Ge Wu, Xueqin Chen, Tengyao Wang, Max Zheng, Hulk Wong, Jay Zou
This paper reviews the second AIM realistic bokeh effect rendering challenge and provides the description of the proposed solutions and results.
1 code implementation • 4 Nov 2020 • Ming Qian, Congyu Qiao, Jiamin Lin, Zhenyu Guo, Chenghua Li, Cong Leng, Jian Cheng
A photo captured with bokeh effect often means objects in focus are sharp while the out-of-focus areas are all blurred.
1 code implementation • ACM MM 2020 • Xing Lan, Qinghao Hu, Fangzhou Xiong, Cong Leng, Jian Cheng
Face alignment is an important task in the field of multi-media.
Ranked #8 on
Face Alignment
on COFW
(using extra training data)
no code implementations • 2 Oct 2020 • Zhizhe Liu, Xingxing Zhang, Zhenfeng Zhu, Shuai Zheng, Yao Zhao, Jian Cheng
There have been numerous methods proposed for human identification, such as face identification, person re-identification, and gait identification.
2 code implementations • 27 Sep 2020 • Majed El Helou, Ruofan Zhou, Sabine Süsstrunk, Radu Timofte, Mahmoud Afifi, Michael S. Brown, Kele Xu, Hengxing Cai, Yuzhong Liu, Li-Wen Wang, Zhi-Song Liu, Chu-Tak Li, Sourya Dipta Das, Nisarg A. Shah, Akashdeep Jassal, Tongtong Zhao, Shanshan Zhao, Sabari Nathan, M. Parisa Beham, R. Suganya, Qing Wang, Zhongyun Hu, Xin Huang, Yaning Li, Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan, Densen Puthussery, Hrishikesh P. S, Melvin Kuriakose, Jiji C. V, Yu Zhu, Liping Dong, Zhuolong Jiang, Chenghua Li, Cong Leng, Jian Cheng
The first track considered one-to-one relighting; the objective was to relight an input photo of a scene with a different color temperature and illuminant orientation (i. e., light source position).
3 code implementations • 15 Sep 2020 • Kai Zhang, Martin Danelljan, Yawei Li, Radu Timofte, Jie Liu, Jie Tang, Gangshan Wu, Yu Zhu, Xiangyu He, Wenjie Xu, Chenghua Li, Cong Leng, Jian Cheng, Guangyang Wu, Wenyi Wang, Xiaohong Liu, Hengyuan Zhao, Xiangtao Kong, Jingwen He, Yu Qiao, Chao Dong, Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan, Xiaochuan Li, Zhiqiang Lang, Jiangtao Nie, Wei Wei, Lei Zhang, Abdul Muqeet, Jiwon Hwang, Subin Yang, JungHeum Kang, Sung-Ho Bae, Yongwoo Kim, Geun-Woo Jeon, Jun-Ho Choi, Jun-Hyuk Kim, Jong-Seok Lee, Steven Marty, Eric Marty, Dongliang Xiong, Siang Chen, Lin Zha, Jiande Jiang, Xinbo Gao, Wen Lu, Haicheng Wang, Vineeth Bhaskara, Alex Levinshtein, Stavros Tsogkas, Allan Jepson, Xiangzhen Kong, Tongtong Zhao, Shanshan Zhao, Hrishikesh P. S, Densen Puthussery, Jiji C. V, Nan Nan, Shuai Liu, Jie Cai, Zibo Meng, Jiaming Ding, Chiu Man Ho, Xuehui Wang, Qiong Yan, Yuzhi Zhao, Long Chen, Jiangtao Zhang, Xiaotong Luo, Liang Chen, Yanyun Qu, Long Sun, Wenhao Wang, Zhenbing Liu, Rushi Lan, Rao Muhammad Umer, Christian Micheloni
This paper reviews the AIM 2020 challenge on efficient single image super-resolution with focus on the proposed solutions and results.
1 code implementation • ECCV 2020 • Guan'an Wang, Shaogang Gong, Jian Cheng, Zeng-Guang Hou
In this work, we introduce a new solution for fast ReID by formulating a novel Coarse-to-Fine (CtF) hashing code search strategy, which complementarily uses short and long codes, achieving both faster speed and better accuracy.
1 code implementation • 7 Jul 2020 • Lei Shi, Yifan Zhang, Jian Cheng, Hanqing Lu
Besides, from the data aspect, we introduce a skeletal data decoupling technique to emphasize the specific characteristics of space/time and different motion scales, resulting in a more comprehensive understanding of the human actions. To test the effectiveness of the proposed method, extensive experiments are conducted on four challenging datasets for skeleton-based gesture and action recognition, namely, SHREC, DHG, NTU-60 and NTU-120, where DSTA-Net achieves state-of-the-art performance on all of them.
Ranked #23 on
Skeleton Based Action Recognition
on NTU RGB+D
no code implementations • 7 Apr 2020 • Lei Shi, Yifan Zhang, Jian Cheng, Hanqing Lu
The two perspectives are orthogonal and complementary to each other; and by fusing them in a unified framework, our method achieves a more comprehensive understanding of the skeleton data.
no code implementations • 7 Mar 2020 • Wen Wang, Xiaojiang Peng, Yanzhou Su, Yu Qiao, Jian Cheng
Video action anticipation aims to predict future action categories from observed frames.
no code implementations • 10 Feb 2020 • Fuzhen Li, Zhenfeng Zhu, Xingxing Zhang, Jian Cheng, Yao Zhao
In zero-shot learning (ZSL), the samples to be classified are usually projected into side information templates such as attributes.
2 code implementations • 10 Feb 2020 • Guan-An Wang, Tianzhu Zhang. Yang Yang, Jian Cheng, Jianlong Chang, Xu Liang, Zeng-Guang Hou
Second, given cross-modality unpaired-images of a person, our method can generate cross-modality paired images from exchanged images.
1 code implementation • 21 Jan 2020 • Wen Wang, Xiaojiang Peng, Yu Qiao, Jian Cheng
Online action detection (OAD) is a practical yet challenging task, which has attracted increasing attention in recent years.
3 code implementations • 15 Dec 2019 • Lei Shi, Yifan Zhang, Jian Cheng, Hanqing Lu
Second, the second-order information of the skeleton data, i. e., the length and orientation of the bones, is rarely investigated, which is naturally more informative and discriminative for the human action recognition.
1 code implementation • CVPR 2020 • Shuai Zheng, Zhenfeng Zhu, Xingxing Zhang, Zhizhe Liu, Jian Cheng, Yao Zhao
Graph representation learning aims to encode all nodes of a graph into low-dimensional vectors that will serve as input of many compute vision tasks.
no code implementations • 28 Nov 2019 • Lei Shi, Yifan Zhang, Jian Cheng, Hanqing Lu
Existing methods exploit the joint positions to extract the body-part features from the activation map of the convolutional networks to assist human action recognition.
no code implementations • 28 Nov 2019 • Wen Wang, Lijun Du, Yinxing Gao, Yanzhou Su, Feng Wang, Jian Cheng
Concretely, for remote sensing image scene classification, we would like to map images from the same scene to feature vectors that are close, and map images from different scenes to feature vectors that are widely separated.
1 code implementation • 13 Nov 2019 • Xiangyu He, Zitao Mo, Qiang Chen, Anda Cheng, Peisong Wang, Jian Cheng
Many successful learning targets such as minimizing dice loss and cross-entropy loss have enabled unprecedented breakthroughs in segmentation tasks.
Ranked #34 on
Semantic Segmentation
on PASCAL Context
no code implementations • 22 Oct 2019 • Zhizhe Liu, Xingxing Zhang, Zhenfeng Zhu, Shuai Zheng, Yao Zhao, Jian Cheng
The key to ZSL is to transfer knowledge from the seen to the unseen classes via auxiliary class attribute vectors.
1 code implementation • 19 Oct 2019 • Qiang Chen, Anda Cheng, Xiangyu He, Peisong Wang, Jian Cheng
Object location is fundamental to panoptic segmentation as it is related to all things and stuff in the image scene.
Ranked #16 on
Panoptic Segmentation
on COCO test-dev
1 code implementation • ICCV 2019 • Guan'an Wang, Tianzhu Zhang, Jian Cheng, Si Liu, Yang Yang, Zeng-Guang Hou
First, it can exploit pixel alignment and feature alignment jointly.
Cross-Modality Person Re-identification
Person Re-Identification
+1
no code implementations • 24 Sep 2019 • Fanrong Li, Zitao Mo, Peisong Wang, Zejian Liu, Jiayun Zhang, Gang Li, Qinghao Hu, Xiangyu He, Cong Leng, Yang Zhang, Jian Cheng
As a case study, we evaluate our object detection system on a real-world surveillance video with input size of 512x512, and it turns out that the system can achieve an inference speed of 18 fps at the cost of 6. 9W (with display) with an mAP of 66. 4 verified on the PASCAL VOC 2012 dataset.
1 code implementation • 23 Jul 2019 • Xiangyu He, Ke Cheng, Qiang Chen, Qinghao Hu, Peisong Wang, Jian Cheng
Long-range dependencies modeling, widely used in capturing spatiotemporal correlation, has shown to be effective in CNN dominated computer vision tasks.
Ranked #212 on
Object Detection
on COCO test-dev
no code implementations • 23 Jul 2019 • Haijun Liu, Jian Cheng
To address these two issues, we propose focusing on enhancing the discriminative feature learning (EDFL) with two extreme simple means from two core aspects, (1) skip-connection for mid-level features incorporation to improve the person features with more discriminability and robustness, and (2) dual-modality triplet loss to guide the training procedures by simultaneously considering the cross-modality discrepancy and intra-modality variations.
Cross-Modality Person Re-identification
Person Re-Identification
no code implementations • 11 Jul 2019 • Shuai Zheng, Zhenfeng Zhu, Jian Cheng, Yandong Guo, Yao Zhao
Non-uniform blur, mainly caused by camera shake and motions of multiple objects, is one of the most common causes of image quality degradation.
1 code implementation • arXiv 2019 • Lei Shi, Yifan Zhang, Jian Cheng, Hanqing Lu
However, the topology of the graph is set by hand and fixed over all layers, which may be not optimal for the action recognition task and the hierarchical CNN structures.
Ranked #51 on
Skeleton Based Action Recognition
on NTU RGB+D
no code implementations • WS 2019 • Ch, Chelsea ler, Peter W. Foltz, Jian Cheng, Jared C. Bernstein, Elizabeth P. Rosenfeld, Alex S. Cohen, Terje B. Holmlund, Brita Elvev{\aa}g
A final set of three features were used to both predict expert human ratings with a ridge regression model (r = 0. 88) and to differentiate patients from healthy individuals with an ensemble of logistic regression classifiers (accuracy = 76{\%}).
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
no code implementations • 30 May 2019 • Haijun Liu, Jian Cheng, Wen Wang, Yanzhou Su
A large amount of loss functions based on pair distances have been presented in the literature for guiding the training of deep metric learning.
no code implementations • 30 May 2019 • Haijun Liu, Jian Cheng, Shiguang Wang, Wen Wang
Unlike existing cross-domain Re-ID methods, leveraging the auxiliary information of those unlabeled target-domain data, we aim at enhancing the model generalization and adaptation by discriminative feature learning, and directly exploiting a pre-trained model to new domains (datasets) without any utilization of the information from target domains.
no code implementations • 25 May 2019 • Jiaxing Wang, Yin Zheng, Xiaoshuang Chen, Junzhou Huang, Jian Cheng
Semi-supervised learning (SSL) provides a powerful framework for leveraging unlabeled data when labels are limited or expensive to obtain.
no code implementations • 19 Oct 2018 • Wen Wang, Yongjian Wu, Haijun Liu, Shiguang Wang, Jian Cheng
Temporal action detection aims at not only recognizing action category but also detecting start time and end time for each action instance in an untrimmed video.
no code implementations • 4 Sep 2018 • Lu Bai, Yuhang Jiao, Luca Rossi, Lixin Cui, Jian Cheng, Edwin R. Hancock
This paper proposes a new Quantum Spatial Graph Convolutional Neural Network (QSGCNN) model that can directly learn a classification function for graphs of arbitrary sizes.
no code implementations • ECCV 2018 • Qinghao Hu, Gang Li, Peisong Wang, Yifan Zhang, Jian Cheng
In this paper, we propose a novel semi-binary decomposition method which decomposes a matrix into two binary matrices and a diagonal matrix.
no code implementations • ECCV 2018 • Guan'an Wang, Qinghao Hu, Jian Cheng, Zeng-Guang Hou
Secondly, we design novel structure of the generative model and the discriminative model to learn the distribution of triplet-wise information in a semi-supervised way.
no code implementations • 4 Jun 2018 • Hui Zhou, Wanli Ouyang, Jian Cheng, Xiaogang Wang, Hongsheng Li
In addition, inter-object relations are mostly modeled in a symmetric way, which we argue is not an optimal setting.
1 code implementation • CVPR 2018 • Peisong Wang, Qinghao Hu, Yifan Zhang, Chunjie Zhang, Yang Liu, Jian Cheng
In this paper, we propose a simple yet effective Two-Step Quantization (TSQ) framework, by decomposing the network quantization problem into two steps: code learning and transformation function learning based on the learned codes.
4 code implementations • CVPR 2019 • Lei Shi, Yifan Zhang, Jian Cheng, Hanqing Lu
In addition, the second-order information (the lengths and directions of bones) of the skeleton data, which is naturally more informative and discriminative for action recognition, is rarely investigated in existing methods.
Ranked #3 on
3D Action Recognition
on Assembly101
no code implementations • 12 Apr 2018 • Shiguang Wang, Jian Cheng, Haijun Liu, Ming Tang
To take advantage of the body parts and context information for pedestrian detection, we propose the part and context network (PCN) in this work.
no code implementations • 8 Feb 2018 • Qinghao Hu, Peisong Wang, Jian Cheng
To achieve this goal, we propose a novel approach named BWNH to train Binary Weight Networks via Hashing.
no code implementations • 3 Feb 2018 • Jian Cheng, Peisong Wang, Gang Li, Qinghao Hu, Hanqing Lu
As for hardware implementation of deep neural networks, a batch of accelerators based on FPGA/ASIC have been proposed in recent years.
9 code implementations • 17 Jan 2018 • Feng Wang, Weiyang Liu, Haijun Liu, Jian Cheng
In this work, we introduce a novel additive angular margin for the Softmax loss, which is intuitively appealing and more interpretable than the existing works.
Ranked #2 on
Face Identification
on Trillion Pairs Dataset
no code implementations • ICCV 2017 • Congqi Cao, Yifan Zhang, Yi Wu, Hanqing Lu, Jian Cheng
Gesture is a natural interface in interacting with wearable devices such as VR/AR helmet and glasses.
no code implementations • 6 Jun 2017 • Jian Cheng, Peter J. Basser
2) We propose Orientational Order (OO) and Orientational Dispersion (OD) indices to describe the degree of alignment and dispersion of a spherical function in a single voxel or in a region, respectively; 3) We also show how to construct a local orthogonal coordinate frame in each voxel exhibiting anisotropic diffusion; 4) Finally, we define three indices to describe three types of orientational distortion (splay, bend, and twist) in a local spatial neighborhood, and a total distortion index to describe distortions of all three types.
3 code implementations • 21 Apr 2017 • Feng Wang, Xiang Xiang, Jian Cheng, Alan L. Yuille
We show that both strategies, and small variants, consistently improve performance by between 0. 2% to 0. 4% on the LFW dataset based on two models.
2 code implementations • 22 Feb 2017 • Feng Wang, Xiang Xiang, Chang Liu, Trac. D. Tran, Austin Reiter, Gregory D. Hager, Harry Quon, Jian Cheng, Alan L. Yuille
In this way, the expression intensity regression task can benefit from the rich feature representations trained on a huge amount of data for face verification.
no code implementations • CVPR 2017 • Peisong Wang, Jian Cheng
In recent years, Deep Neural Networks (DNN) based methods have achieved remarkable performance in a wide range of tasks and have been among the most powerful and widely used techniques in computer vision.
1 code implementation • CVPR 2016 • Jiaxiang Wu, Cong Leng, Yuhang Wang, Qinghao Hu, Jian Cheng
Recently, convolutional neural networks (CNN) have demonstrated impressive performance in various computer vision tasks.
no code implementations • CVPR 2015 • Cong Leng, Jiaxiang Wu, Jian Cheng, Xiao Bai, Hanqing Lu
Recently, hashing based approximate nearest neighbor (ANN) search has attracted much attention.
no code implementations • CVPR 2014 • Haichuan Yang, Xiao Bai, Jun Zhou, Peng Ren, Zhihong Zhang, Jian Cheng
Hashing is very useful for fast approximate similarity search on large database.
no code implementations • CVPR 2014 • Jian Cheng, Cong Leng, Jiaxiang Wu, Hainan Cui, Hanqing Lu
Image matching is one of the most challenging stages in 3D reconstruction, which usually occupies half of computational cost and inaccurate matching may lead to failure of reconstruction.
no code implementations • 2 Jul 2013 • Jian Cheng, Tianzi Jiang, Rachid Deriche, Dinggang Shen, Pew-Thian Yap
Compared with the start-of-the-art CR-DL and DR-DL methods proposed by Merlet et al. and Bilgic et al., espectively, our work offers the following advantages.