1 code implementation • 9 Mar 2025 • Yuchen Yang, Wei Wang, Yifei Liu, Linfeng Dong, Hao Wu, Mingxin Zhang, Zhihang Zhong, Xiao Sun
This framework aligns with the feature extraction paradigm in RGB-based methods, enabling direct evaluation of RGB-based models on skeleton-based benchmarks.
Group Activity Recognition
Temporal Group Activity Localization
no code implementations • 7 Mar 2025 • Zherui Huang, Xing Gao, Guanjie Zheng, Licheng Wen, Xuemeng Yang, Xiao Sun
Simulating such safety-critical scenarios is nontrivial, however, from log data that are typically regular scenarios, especially in consideration of dynamic adversarial interactions between the future motions of autonomous vehicles and surrounding traffic participants.
2 code implementations • 7 Feb 2025 • Muhammad Imran, Jonathan R. Krebs, Vishal Balaji Sivaraman, Teng Zhang, Amarjeet Kumar, Walker R. Ueland, Michael J. Fassler, Jinlong Huang, Xiao Sun, Lisheng Wang, Pengcheng Shi, Maximilian Rokuss, Michael Baumgartner, Yannick Kirchhof, Klaus H. Maier-Hein, Fabian Isensee, Shuolin Liu, Bing Han, Bong Thanh Nguyen, Dong-Jin Shin, Park Ji-Woo, Mathew Choi, Kwang-Hyun Uhm, Sung-Jea Ko, Chanwoong Lee, Jaehee Chun, Jin Sung Kim, Minghui Zhang, Hanxiao Zhang, Xin You, Yun Gu, Zhaohong Pan, Xuan Liu, Xiaokun Liang, Markus Tiefenthaler, Enrique Almar-Munoz, Matthias Schwab, Mikhail Kotyushev, Rostislav Epifanov, Marek Wodzinski, Henning Muller, Abdul Qayyum, Moona Mazher, Steven A. Niederer, Zhiwei Wang, Kaixiang Yang, Jintao Ren, Stine Sofia Korreman, Yuchong Gao, Hongye Zeng, Haoyu Zheng, Rui Zheng, Jinghua Yue, Fugen Zhou, Bo Liu, Alexander Cosman, Muxuan Liang, Chang Zhao, Gilbert R. Upchurch Jr., Jun Ma, Yuyin Zhou, Michol A. Cooper, Wei Shao
Furthermore, no open-source dataset is currently available to support the development of multi-class aortic segmentation methods.
1 code implementation • 29 Dec 2024 • Yifei Liu, Zhihang Zhong, Yifan Zhan, Sheng Xu, Xiao Sun
While 3D Gaussian Splatting (3DGS) has demonstrated remarkable performance in novel view synthesis and real-time rendering, the high memory consumption due to the use of millions of Gaussians limits its practicality.
no code implementations • 2 Dec 2024 • Hao Wu, Zhihang Zhong, Xiao Sun
However, current methods face two key challenges: (1) image features used for retrieval are often optimized based on ground-truth (GT) captions, which represent the image from a specific perspective and are influenced by annotator biases, and (2) they underutilize the full potential of retrieved text, typically relying on raw captions or parsed objects, which fail to capture the full semantic richness of the data.
no code implementations • 25 Nov 2024 • Wangze Xu, Yifan Zhan, Zhihang Zhong, Xiao Sun
3D human avatars, through the use of canonical radiance fields and per-frame observed warping, enable high-fidelity rendering and animating.
1 code implementation • 20 Nov 2024 • Yuchen Yang, Xuanyi Liu, Xing Gao, Zhihang Zhong, Xiao Sun
Recent unsupervised methods for monocular 3D pose estimation have endeavored to reduce dependence on limited annotated 3D data, but most are solely formulated in 2D space, overlooking the inherent depth ambiguity issue.
1 code implementation • 10 Oct 2024 • Yifan Zhan, Qingtian Zhu, Muyao Niu, Mingze Ma, Jiancheng Zhao, Zhihang Zhong, Xiao Sun, Yu Qiao, Yinqiang Zheng
In this paper, we highlight a critical yet often overlooked factor in most 3D human tasks, namely modeling humans with complex garments.
1 code implementation • 27 Sep 2024 • Chuang Chen, Xiao Sun, Zhi Liu
To the best of our knowledge, this is the first large-scale pretraining framework that integrates psychological theories with contemporary contrastive learning and masked image modeling techniques for emotion analysis across diverse scenarios.
no code implementations • 22 Sep 2024 • Qiu Yang, Xiao Sun, Xin-yu Li, Feng-Qi Cui, Yu-Tong Guo, Shuang-Zhen Hu, Ping Luo, Si-Ying Li
This approach enables the network to learn priors during the training stage while relying solely on low-resolution facial images during the testing stage, thus mitigating the adverse effects of prior estimation inaccuracies.
1 code implementation • 29 Aug 2024 • Xiangchen Yin, Donglin Di, Lei Fan, Hao Li, Wei Chen, Xiaofei Gou, Yang song, Xiao Sun, Xun Yang
In this paper, we propose a framework that delves into the graph relations of pose priors to provide control information for human image generation.
1 code implementation • 19 Aug 2024 • Heng Li, Yuenan Hou, Xiaohan Xing, Xiao Sun, Yanyong Zhang
Inspired by the global modeling and linear computation complexity of the Mamba architecture, we present the first Mamba-based network for semantic occupancy prediction, termed OccMamba.
1 code implementation • 7 Aug 2024 • Ruiqi Wang, Jinyang Huang, Jie Zhang, Xin Liu, Xiang Zhang, Zhi Liu, Peng Zhao, Sigui Chen, Xiao Sun
Depression is a prevalent mental health disorder that significantly impacts individuals' lives and well-being.
1 code implementation • 6 Aug 2024 • Hui Ma, Bo Zhang, Bo Xu, Jian Wang, Hongfei Lin, Xiao Sun
During reinforcement learning training, the proximal policy optimization algorithm is used to fine-tune the policy, enabling the generation of empathetic responses.
no code implementations • 17 Jul 2024 • Kang Shen, Xuxiong Liu, Boyan Wang, Jun Yao, Xin Liu, Yujie Guan, Yu Wang, Gengchen Li, Xiao Sun
In this paper, we present our approach to addressing the challenges of the 7th ABAW competition.
no code implementations • 17 Jul 2024 • Xuxiong Liu, Kang Shen, Jun Yao, Boyan Wang, Minrui Liu, Liuwei An, Zishun Cui, Weijie Feng, Xiao Sun
Compound Expression Recognition (CER) is vital for effective interpersonal interactions.
no code implementations • 16 Jul 2024 • Lingfeng Chen, Panhe Hu, Zhiliang Pan, Xiao Sun, Zehao Wang
This paper introduces an innovative deep learning-based method for end-to-end target radial length estimation from HRRP (High Resolution Range Profile) sequences.
1 code implementation • 11 Jul 2024 • Lingfeng Chen, Xiao Sun, Zhiliang Pan, Zehao Wang, Xiaolong Su, Zhen Liu, Panhe Hu
High Resolution Range Profiles (HRRP) have become a key area of focus in the domain of Radar Automatic Target Recognition (RATR).
1 code implementation • 8 Jul 2024 • Jinpeng Hu, Tengteng Dong, Luo Gang, Hui Ma, Peng Zou, Xiao Sun, Dan Guo, Xun Yang, Meng Wang
Additionally, to compare the performance of PsycoLLM with other LLMs, we develop a comprehensive psychological benchmark based on authoritative psychological counseling examinations in China, which includes assessments of professional ethics, theoretical proficiency, and case analysis.
no code implementations • 7 Jun 2024 • Wei Qian, Qi Li, Kun Li, Xinke Wang, Xiao Sun, Meng Wang, Dan Guo
This paper briefly introduces the solutions developed by our team, HFUT-VUT, for Track 1 of self-supervised heart rate measurement in the 3rd Vision-based Remote Physiological Signal Sensing (RePSS) Challenge hosted at IJCAI 2024.
no code implementations • 4 Apr 2024 • Yiming Zhang, Zhe Wang, Xinjie Li, Yunchen Yuan, Chengsong Zhang, Xiao Sun, Zhihang Zhong, Jian Wang
Human body restoration plays a vital role in various applications related to the human body.
no code implementations • CVPR 2024 • Hao Wu, Huabin Liu, Yu Qiao, Xiao Sun
We present Dive Into the BoundarieS (DIBS), a novel pretraining framework for dense video captioning (DVC), that elaborates on improving the quality of the generated event captions and their associated pseudo event boundaries from unlabeled videos.
no code implementations • 28 Mar 2024 • Yutong Chen, Yifan Zhan, Zhihang Zhong, Wei Wang, Xiao Sun, Yu Qiao, Yinqiang Zheng
Neural rendering techniques have significantly advanced 3D human body modeling.
1 code implementation • 14 Dec 2023 • Ziteng Cui, Lin Gu, Xiao Sun, Xianzheng Ma, Yu Qiao, Tatsuya Harada
The standard Neural Radiance Fields (NeRF) paradigm employs a viewer-centered methodology, entangling the aspects of illumination and material reflectance into emission solely from 3D points.
1 code implementation • 12 Dec 2023 • Yuchen Yang, Yu Qiao, Xiao Sun
Automatic estimation of 3D human pose from monocular RGB images is a challenging and unsolved problem in computer vision.
Ranked #5 on
Unsupervised 3D Human Pose Estimation
on Human3.6M
1 code implementation • 14 Nov 2023 • Zhihang Zhong, Xiao Sun, Yu Qiao, Gurunandan Krishnan, Sizhuo Ma, Jian Wang
Existing video frame interpolation (VFI) methods blindly predict where each object is at a specific timestep t ("time indexing"), which struggles to predict precise object movements.
no code implementations • 9 Oct 2023 • Ziyang Zhang, Xiao Sun, Liuwei An, Meng Wang
First, the Adaptive Threshold Learning module generates two thresholds, namely the clean and noisy thresholds, for each category.
Facial Expression Recognition
Facial Expression Recognition (FER)
no code implementations • 13 Sep 2023 • Xiangchen Yin, Zhenda Yu, Xin Gao, Xiao Sun
Low-light image enhancement restores the colors and details of a single image and improves high-level visual tasks.
1 code implementation • 23 Aug 2023 • Chenrui Zhang, Lin Liu, Jinpeng Wang, Chuyuan Wang, Xiao Sun, Hongyu Wang, Mingchen Cai
Moreover, to enhance stability of the prompt effect evaluation, we propose a novel prompt bagging method involving forward and backward thinking, which is superior to majority voting and is beneficial for both feedback and weight calculation in boosting.
3 code implementations • 6 Jun 2023 • Jiaqi Zhai, Zhaojie Gong, Yueming Wang, Xiao Sun, Zheng Yan, Fu Li, Xing Liu
A key component of retrieval is to model (user, item) similarity, which is commonly represented as the dot product of two learned embeddings.
no code implementations • 21 May 2023 • Yanjing Li, Sheng Xu, Mingbao Lin, Xianbin Cao, Chuanjian Liu, Xiao Sun, Baochang Zhang
Vision transformers (ViTs) quantization offers a promising prospect to facilitate deploying large pre-trained networks on resource-limited devices.
1 code implementation • 2 May 2023 • Jiashuo Yu, Yaohui Wang, Xinyuan Chen, Xiao Sun, Yu Qiao
To this end, we present Long-Term Rhythmic Video Soundtracker (LORIS), a novel framework to synthesize long-term conditional waveforms.
no code implementations • 17 Apr 2023 • Xiao Sun, Bo Zhang, Chenrui Zhang, Han Ren, Mingchen Cai
AUC is a common metric for evaluating the performance of a classifier.
no code implementations • 19 Mar 2023 • Peng Zou, Rui Wang, Kehua Wen, Yasi Peng, Xiao Sun
The in-the-wild affective behavior analysis has been an important study.
1 code implementation • 18 Mar 2023 • Tao Shu, Xinke Wang, Ruotong Wang, Chuang Chen, Yixin Zhang, Xiao Sun
The continuous improvement of human-computer interaction technology makes it possible to compute emotions.
no code implementations • 16 Mar 2023 • Ziyang Zhang, Liuwei An, Zishun Cui, Ao Xu, Tengteng Dong, Yueqi Jiang, Jingyi Shi, Xin Liu, Xiao Sun, Meng Wang
In this paper, we present our solutions for the 5th Workshop and Competition on Affective Behavior Analysis in-the-wild (ABAW), which includes four sub-challenges of Valence-Arousal (VA) Estimation, Expression (Expr) Classification, Action Unit (AU) Detection and Emotional Reaction Intensity (ERI) Estimation.
1 code implementation • 10 Mar 2023 • Ziteng Cui, Lin Gu, Xiao Sun, Xianzheng Ma, Yu Qiao, Tatsuya Harada
Common capture low-light scenes are challenging for most computer vision techniques, including Neural Radiance Fields (NeRF).
4 code implementations • ICCV 2023 • Huimin Wu, Chenyang Lei, Xiao Sun, Peng-Shuai Wang, Qifeng Chen, Kwang-Ting Cheng, Stephen Lin, Zhirong Wu
Self-supervised representation learning follows a paradigm of withholding some part of the data and tasking the network to predict it from the remaining part.
no code implementations • 7 Nov 2022 • Andrey Ignatov, Radu Timofte, Cheng-Ming Chiang, Hsien-Kai Kuo, Yu-Syuan Xu, Man-Yu Lee, Allen Lu, Chia-Ming Cheng, Chih-Cheng Chen, Jia-Ying Yong, Hong-Han Shuai, Wen-Huang Cheng, Zhuang Jia, Tianyu Xu, Yijian Zhang, Long Bao, Heng Sun, Diankai Zhang, Si Gao, Shaoli Liu, Biao Wu, Xiaofeng Zhang, Chengjian Zheng, Kaidi Lu, Ning Wang, Xiao Sun, HaoDong Wu, Xuncheng Liu, Weizhan Zhang, Caixia Yan, Haipeng Du, Qinghua Zheng, Qi Wang, Wangdu Chen, Ran Duan, Mengdi Sun, Dan Zhu, Guannan Chen, Hojin Cho, Steve Kim, Shijie Yue, Chenghua Li, Zhengyang Zhuge, Wei Chen, Wenxu Wang, Yufeng Zhou, Xiaochen Cai, Hengxing Cai, Kele Xu, Li Liu, Zehua Cheng, Wenyi Lian, Wenjing Lian
While numerous solutions have been proposed for this problem, they are usually quite computationally demanding, demonstrating low FPS rates and power efficiency on mobile devices.
no code implementations • 24 Sep 2022 • Haojie Xu, Weifeng Liu, Jingwei Liu, Mingzheng Li, Yu Feng, Yasi Peng, Yunwei Shi, Xiao Sun, Meng Wang
Our experiments demonstrate the effectiveness of our proposed model and hybrid fusion strategy on multimodal fusion, and the AUC of our proposed model on the test set is 0. 8972.
1 code implementation • 5 Aug 2022 • Jia Li, Ziyang Zhang, Junjie Lang, Yueqi Jiang, Liuwei An, Peng Zou, Yangyang Xu, Sheng Gao, Jie Lin, Chunxiao Fan, Xiao Sun, Meng Wang
In this paper, we present our solutions for the Multimodal Sentiment Analysis Challenge (MuSe) 2022, which includes MuSe-Humor, MuSe-Reaction and MuSe-Stress Sub-challenges.
1 code implementation • 20 Jul 2022 • Zhihang Zhong, Xiao Sun, Zhirong Wu, Yinqiang Zheng, Stephen Lin, Imari Sato
Existing solutions to this problem estimate a single image sequence without considering the motion ambiguity for each region.
1 code implementation • 9 Jun 2022 • Zhirong Wu, Zihang Lai, Xiao Sun, Stephen Lin
The paper presents a scalable approach for learning spatially distributed visual representations over individual tokens and a holistic instance representation simultaneously.
no code implementations • 19 Apr 2022 • Atsuhiro Noguchi, Xiao Sun, Stephen Lin, Tatsuya Harada
We propose an unsupervised method for 3D geometry-aware representation learning of articulated objects, in which no image-pose pairs or foreground masks are used for training.
1 code implementation • 12 Mar 2022 • Zhihang Zhong, Mingdeng Cao, Xiao Sun, Zhirong Wu, Zhongyi Zhou, Yinqiang Zheng, Stephen Lin, Imari Sato
In this paper, instead of two consecutive frames, we propose to exploit a pair of images captured by dual RS cameras with reversed RS directions for this highly challenging task.
4 code implementations • CVPR 2022 • Yutong Chen, Fangyun Wei, Xiao Sun, Zhirong Wu, Stephen Lin
Concretely, we pretrain the sign-to-gloss visual network on the general domain of human actions and the within-domain of a sign-to-gloss dataset, and pretrain the gloss-to-text translation network on the general domain of a multilingual corpus and the within-domain of a gloss-to-text corpus.
Ranked #3 on
Sign Language Translation
on CSL-Daily
no code implementations • Multimedia Systems 2022 • Chunxiao Fan, zhenxing Wang, Jia Li, Shanshan Wang, Xiao Sun
In the proposed method, (1) the topological structure information and texture feature of regions of interest (ROIs) are modeled as graphs and processed with graph convolutional network (GCN) to remain the topological features.
Facial Expression Recognition
Facial Expression Recognition (FER)
+1
no code implementations • CVPR 2022 • Yinghao Xu, Fangyun Wei, Xiao Sun, Ceyuan Yang, Yujun Shen, Bo Dai, Bolei Zhou, Stephen Lin
Typically in recent work, the pseudo-labels are obtained by training a model on the labeled data, and then using confident predictions from the model to teach itself.
1 code implementation • 22 Nov 2021 • Kenneth Li, Xiao Sun, Zhirong Wu, Fangyun Wei, Stephen Lin
For human action understanding, a popular research direction is to analyze short video clips with unambiguous semantic content, such as jumping and drinking.
no code implementations • 29 Sep 2021 • Kenneth Li, Xiao Sun, Zhirong Wu, Fangyun Wei, Stephen Lin
However, methods for understanding short semantic actions cannot be directly translated to long kinematic sequences such as dancing, where it becomes challenging even to semantically label the human movements.
1 code implementation • 9 Sep 2021 • Dong-Jin Kim, Xiao Sun, Jinsoo Choi, Stephen Lin, In So Kweon
A common problem in the task of human-object interaction (HOI) detection is that numerous HOI classes have only a small number of labeled examples, resulting in training sets with a long-tailed distribution.
Ranked #43 on
Human-Object Interaction Detection
on HICO-DET
no code implementations • 27 Aug 2021 • Andrea Fasoli, Chia-Yu Chen, Mauricio Serrano, Xiao Sun, Naigang Wang, Swagath Venkataramani, George Saon, Xiaodong Cui, Brian Kingsbury, Wei zhang, Zoltán Tüske, Kailash Gopalakrishnan
We investigate the impact of aggressive low-precision representations of weights and activations in two families of large LSTM-based architectures for Automatic Speech Recognition (ASR): hybrid Deep Bidirectional LSTM - Hidden Markov Models (DBLSTM-HMMs) and Recurrent Neural Network - Transducers (RNN-Ts).
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
no code implementations • ICCV 2021 • Ailing Zeng, Xiao Sun, Lei Yang, Nanxuan Zhao, Minhao Liu, Qiang Xu
While the average prediction accuracy has been improved significantly over the years, the performance on hard poses with depth ambiguity, self-occlusion, and complex or rare poses is still far from satisfactory.
Ranked #30 on
Skeleton Based Action Recognition
on NTU RGB+D 120
1 code implementation • 20 May 2021 • Xiao Sun, Bahador Bahmani, Nikolaos N. Vlassis, WaiChing Sun, Yanxun Xu
This paper presents a computational framework that generates ensemble predictive mechanics models with uncertainty quantification (UQ).
no code implementations • NeurIPS 2020 • Chia-Yu Chen, Jiamin Ni, Songtao Lu, Xiaodong Cui, Pin-Yu Chen, Xiao Sun, Naigang Wang, Swagath Venkataramani, Vijayalakshmi Srinivasan, Wei zhang, Kailash Gopalakrishnan
Large-scale distributed training of Deep Neural Networks (DNNs) on state-of-the-art platforms is expected to be severely communication constrained.
1 code implementation • ICCV 2021 • Atsuhiro Noguchi, Xiao Sun, Stephen Lin, Tatsuya Harada
We present Neural Articulated Radiance Field (NARF), a novel deformable 3D representation for articulated objects learned from images.
no code implementations • NeurIPS 2020 • Xiao Sun, Naigang Wang, Chia-Yu Chen, Jiamin Ni, Ankur Agrawal, Xiaodong Cui, Swagath Venkataramani, Kaoutar El Maghraoui, Vijayalakshmi (Viji) Srinivasan, Kailash Gopalakrishnan
In this paper, we propose a number of novel techniques and numerical representation formats that enable, for the very first time, the precision of training systems to be aggressively scaled from 8-bits to 4-bits.
1 code implementation • ECCV 2020 • Ailing Zeng, Xiao Sun, Fuyang Huang, Minhao Liu, Qiang Xu, Stephen Lin
With the reduced dimensionality of less relevant body areas, the training set distribution within network branches more closely reflects the statistics of local poses instead of global body poses, without sacrificing information important for joint inference.
Ranked #21 on
Monocular 3D Human Pose Estimation
on Human3.6M
1 code implementation • 17 Jul 2020 • Dong-Jin Kim, Xiao Sun, Jinsoo Choi, Stephen Lin, In So Kweon
A common problem in human-object interaction (HOI) detection task is that numerous HOI classes have only a small number of labeled examples, resulting in training sets with a long-tailed distribution.
1 code implementation • ECCV 2020 • Fangyun Wei, Xiao Sun, Hongyang Li, Jingdong Wang, Stephen Lin
A recent approach for object detection and human pose estimation is to regress bounding boxes or human keypoints from a central point on the object or person.
no code implementations • NeurIPS 2019 • Xiao Sun, Jungwook Choi, Chia-Yu Chen, Naigang Wang, Swagath Venkataramani, Vijayalakshmi (Viji) Srinivasan, Xiaodong Cui, Wei zhang, Kailash Gopalakrishnan
Reducing the numerical precision of data and computation is extremely effective in accelerating deep learning training workloads.
no code implementations • 6 Nov 2019 • Xiao Sun, Zhouhui Lian, Jianguo Xiao
Point cloud analysis has drawn broader attentions due to its increasing demands in various fields.
no code implementations • 17 Apr 2019 • Jia Li, Xing Wei, Guoqiang Yang, Xiao Sun, Changliang Li
A multiscale shared convolution structure is adopted in the discriminator network to further supervise training the generator.
no code implementations • 17 Apr 2019 • Jia Li, Xiao Sun, Xing Wei, Changliang Li, Jian-Hua Tao
In recent years, the generation of conversation content based on deep neural networks has attracted many researchers.
no code implementations • 17 Nov 2018 • Xiao Sun, Chuankang Li, Stephen Lin
We present a method for human pose tracking that is based on learning spatiotemporal relationships among joints.
1 code implementation • 17 Sep 2018 • Xiao Sun, Chuankang Li, Stephen Lin
For the ECCV 2018 PoseTrack Challenge, we present a 3D human pose estimation system based mainly on the integral human pose regression method.
Ranked #1 on
3D Human Pose Estimation
on CHALL H80K
no code implementations • EMNLP 2018 • Jingyuan Li, Xiao Sun
Traditional neural language models tend to generate generic replies with poor logic and no emotion.
2 code implementations • ECCV 2018 • Xiao Sun, Bin Xiao, Fangyin Wei, Shuang Liang, Yichen Wei
State-of-the-art human pose estimation methods are based on heat map representation.
Ranked #23 on
Pose Estimation
on MPII Human Pose
6 code implementations • ICCV 2017 • Xingyi Zhou, Qi-Xing Huang, Xiao Sun, xiangyang xue, Yichen Wei
We propose a weakly-supervised transfer learning method that uses mixed 2D and 3D labels in a unified deep neutral network that presents two-stage cascaded structure.
2D Pose Estimation
3D Multi-Person Pose Estimation (absolute)
+4
1 code implementation • ICCV 2017 • Xiao Sun, Jiaxiang Shang, Shuang Liang, Yichen Wei
A central problem is that the structural information in the pose is not well exploited in the previous regression methods.
Ranked #36 on
Pose Estimation
on MPII Human Pose
no code implementations • 17 Sep 2016 • Xingyi Zhou, Xiao Sun, Wei zhang, Shuang Liang, Yichen Wei
In this work, we propose to directly embed a kinematic object model into the deep neutral network learning for general articulated object pose estimation.
Ranked #331 on
3D Human Pose Estimation
on Human3.6M
no code implementations • CVPR 2015 • Xiao Sun, Yichen Wei, Shuang Liang, Xiaoou Tang, Jian Sun
We extends the previous 2D cascaded object pose regression work [9] in two aspects so that it works better for 3D articulated objects.
no code implementations • CVPR 2014 • Chen Qian, Xiao Sun, Yichen Wei, Xiaoou Tang, Jian Sun
We present a realtime hand tracking system using a depth sensor.