no code implementations • 4 Dec 2022 • Qihuang Zhong, Liang Ding, Yibing Zhan, Yu Qiao, Yonggang Wen, Li Shen, Juhua Liu, Baosheng Yu, Bo Du, Yixin Chen, Xinbo Gao, Chunyan Miao, Xiaoou Tang, DaCheng Tao
This technical report briefly describes our JDExplore d-team's Vega v2 submission on the SuperGLUE leaderboard.
2 code implementations • 18 May 2020 • Yujun Shen, Ceyuan Yang, Xiaoou Tang, Bolei Zhou
In this work, we propose a framework called InterFaceGAN to interpret the disentangled face representation learned by the state-of-the-art GAN models and study the properties of the facial semantics encoded in the latent space.
4 code implementations • CVPR 2020 • Yujun Shen, Jinjin Gu, Xiaoou Tang, Bolei Zhou
In this work, we propose a novel framework, called InterFaceGAN, for semantic face editing by interpreting the latent semantics learned by GANs.
1 code implementation • 23 Apr 2019 • Ke Yu, Xintao Wang, Chao Dong, Xiaoou Tang, Chen Change Loy
To leverage this, we propose Path-Restore, a multi-path CNN with a pathfinder that can dynamically select an appropriate route for each image region.
1 code implementation • ICCV 2019 • Xingang Pan, Xiaohang Zhan, Jianping Shi, Xiaoou Tang, Ping Luo
Unlike existing works that design normalization techniques for specific tasks, we propose Switchable Whitening (SW), which provides a general form unifying different whitening methods as well as standardization methods.
3 code implementations • 15 Mar 2019 • Tak-Wai Hui, Xiaoou Tang, Chen Change Loy
Over four decades, the majority addresses the problem of optical flow estimation using variational methods.
Ranked #3 on
Optical Flow Estimation
on KITTI 2012
5 code implementations • CVPR 2019 • Yuying Ge, Ruimao Zhang, Lingyun Wu, Xiaogang Wang, Xiaoou Tang, Ping Luo
A strong baseline is proposed, called Match R-CNN, which builds upon Mask R-CNN to solve the above four tasks in an end-to-end manner.
1 code implementation • 5 Dec 2018 • Mengya Gao, Yujun Shen, Quanquan Li, Junjie Yan, Liang Wan, Dahua Lin, Chen Change Loy, Xiaoou Tang
Knowledge Distillation (KD) aims at improving the performance of a low-capacity student model by inheriting knowledge from a high-capacity teacher model.
no code implementations • 4 Dec 2018 • Yujun Shen, Bolei Zhou, Ping Luo, Xiaoou Tang
In the second stage, they compete in the image domain to render photo-realistic images that contain high diversity but preserve identity.
2 code implementations • CVPR 2019 • Xintao Wang, Ke Yu, Chao Dong, Xiaoou Tang, Chen Change Loy
Deep convolutional neural network has demonstrated its capability of learning a deterministic mapping for the desired imagery effect.
no code implementations • 7 Sep 2018 • Yubin Deng, Ke Yu, Dahua Lin, Xiaoou Tang, Chen Change Loy
Most methods in deep-RL achieve good results via the maximization of the reward signal provided by the environment, typically in the form of discounted cumulative returns.
39 code implementations • 1 Sep 2018 • Xintao Wang, Ke Yu, Shixiang Wu, Jinjin Gu, Yihao Liu, Chao Dong, Chen Change Loy, Yu Qiao, Xiaoou Tang
To further enhance the visual quality, we thoroughly study three key components of SRGAN - network architecture, adversarial loss and perceptual loss, and improve each of them to derive an Enhanced SRGAN (ESRGAN).
Ranked #2 on
Face Hallucination
on FFHQ 512 x 512 - 16x upscaling
16 code implementations • ECCV 2018 • Xingang Pan, Ping Luo, Jianping Shi, Xiaoou Tang
IBN-Net carefully integrates Instance Normalization (IN) and Batch Normalization (BN) as building blocks, and can be wrapped into many advanced deep networks to improve their performances.
1 code implementation • 1 Jun 2018 • Chen Huang, Yining Li, Chen Change Loy, Xiaoou Tang
Data for face analysis often exhibit highly-skewed class distribution, i. e., most data belong to a few majority classes, while the minority classes only contain a scarce amount of instances.
no code implementations • CVPR 2018 • Yujun Shen, Ping Luo, Junjie Yan, Xiaogang Wang, Xiaoou Tang
Existing methods typically formulate GAN as a two-player game, where a discriminator distinguishes face images from the real and synthesized domains, while a generator reduces its discriminativeness by synthesizing a face of photo-realistic quality.
4 code implementations • CVPR 2018 • Tak-Wai Hui, Xiaoou Tang, Chen Change Loy
FlowNet2, the state-of-the-art convolutional neural network (CNN) for optical flow estimation, requires over 160M parameters to achieve accurate flow estimation.
Ranked #7 on
Optical Flow Estimation
on KITTI 2012
1 code implementation • CVPR 2018 • Kaidi Cao, Yu Rong, Cheng Li, Xiaoou Tang, Chen Change Loy
However, many contemporary face recognition models still perform relatively poor in processing profile faces compared to frontal faces.
Ranked #1 on
Face Identification
on IJB-A
8 code implementations • 17 Dec 2017 • Xingang Pan, Jianping Shi, Ping Luo, Xiaogang Wang, Xiaoou Tang
Although CNN has shown strong capability to extract semantics from raw pixels, its capacity to capture spatial relationships of pixels across rows and columns of an image is not fully explored.
Ranked #35 on
Lane Detection
on CULane
no code implementations • 2 Dec 2017 • Xiaohang Zhan, Ziwei Liu, Ping Luo, Xiaoou Tang, Chen Change Loy
The key of this new form of learning is to design a proxy task (e. g. image colorization), from which a discriminative loss can be formulated on unlabeled data.
no code implementations • ICCV 2017 • Yining Li, Chen Huang, Xiaoou Tang, Chen-Change Loy
In particular, each tuple consists of a pair of images and 4. 6 discriminative questions (as positive samples) and 5. 9 non-discriminative questions (as negative samples) on average.
2 code implementations • 7 Aug 2017 • Sijie Yan, Ziwei Liu, Ping Luo, Shi Qiu, Xiaogang Wang, Xiaoou Tang
This work addresses unconstrained fashion landmark detection, where clothing bounding boxes are not provided in both training and test.
3 code implementations • 1 Aug 2017 • Xiaoxiao Li, Yuankai Qi, Zhe Wang, Kai Chen, Ziwei Liu, Jianping Shi, Ping Luo, Xiaoou Tang, Chen Change Loy
Specifically, our Video Object Segmentation with Re-identification (VS-ReID) model includes a mask propagation module and a ReID module.
1 code implementation • ICCV 2017 • Yu Liu, Hongyang Li, Junjie Yan, Fangyin Wei, Xiaogang Wang, Xiaoou Tang
To further increase efficiency and accuracy, we (a): design a scale-forecast network to globally predict potential scales in the image since there is no need to compute maps on all levels of the pyramid.
Ranked #5 on
Face Detection
on Annotated Faces in the Wild
1 code implementation • 17 Jul 2017 • Yubin Deng, Chen Change Loy, Xiaoou Tang
We introduce EnhanceGAN, an adversarial learning based model that performs automatic image enhancement.
1 code implementation • CVPR 2017 • Haiyu Zhao, Maoqing Tian, Shuyang Sun, Jing Shao, Junjie Yan, Shuai Yi, Xiaogang Wang, Xiaoou Tang
Person re-identification (ReID) is an important task in video surveillance and has various applications.
no code implementations • 9 Jun 2017 • Shuo Yang, Yuanjun Xiong, Chen Change Loy, Xiaoou Tang
Specifically, our method achieves 76. 4 average precision on the challenging WIDER FACE dataset and 96% recall rate on the FDDB dataset with 7 frames per second (fps) for 900 * 1300 input image.
9 code implementations • 8 May 2017 • Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, Luc van Gool
Furthermore, based on the temporal segment networks, we won the video classification track at the ActivityNet challenge 2016 among 24 teams, which demonstrates the effectiveness of TSN and the proposed good practices.
Ranked #19 on
Action Classification
on Moments in Time
(Top 5 Accuracy metric)
15 code implementations • CVPR 2017 • Fei Wang, Mengqing Jiang, Chen Qian, Shuo Yang, Cheng Li, Honggang Zhang, Xiaogang Wang, Xiaoou Tang
In this work, we propose "Residual Attention Network", a convolutional neural network using attention mechanism which can incorporate with state-of-art feed forward network architecture in an end-to-end training fashion.
Ranked #555 on
Image Classification
on ImageNet
6 code implementations • ICCV 2017 • Yue Zhao, Yuanjun Xiong, Li-Min Wang, Zhirong Wu, Xiaoou Tang, Dahua Lin
Detecting actions in untrimmed videos is an important yet challenging task.
Ranked #6 on
Action Recognition
on THUMOS’14
1 code implementation • CVPR 2017 • Xiaoxiao Li, Ziwei Liu, Ping Luo, Chen Change Loy, Xiaoou Tang
Third, in comparison to MC, LC is an end-to-end trainable framework, allowing joint learning of all sub-models.
Ranked #22 on
Semantic Segmentation
on PASCAL VOC 2012 test
1 code implementation • 8 Mar 2017 • Yuanjun Xiong, Yue Zhao, Li-Min Wang, Dahua Lin, Xiaoou Tang
Detecting activities in untrimmed videos is an important but challenging task.
Ranked #22 on
Temporal Action Localization
on ActivityNet-1.3
3 code implementations • ICCV 2017 • Ziwei Liu, Raymond A. Yeh, Xiaoou Tang, Yiming Liu, Aseem Agarwala
We combine the advantages of these two methods by training a deep network that learns to synthesize video frames by flowing pixel values from existing ones, which we call deep voxel flow.
Ranked #2 on
Video Prediction
on DAVIS 2017
no code implementations • 29 Jan 2017 • Shuo Yang, Ping Luo, Chen Change Loy, Xiaoou Tang
We propose a deep convolutional neural network (CNN) for face detection leveraging on facial attributes based supervision.
no code implementations • NeurIPS 2016 • Chen Huang, Chen Change Loy, Xiaoou Tang
Existing deep embedding methods in vision tasks are capable of learning a compact Euclidean space from images, where Euclidean distances correspond to a similarity metric.
Ranked #24 on
Metric Learning
on CUB-200-2011
1 code implementation • 4 Oct 2016 • Yubin Deng, Chen Change Loy, Xiaoou Tang
This survey aims at reviewing recent computer vision techniques used in the assessment of image aesthetic quality.
no code implementations • 21 Sep 2016 • Zhanpeng Zhang, Ping Luo, Chen Change Loy, Xiaoou Tang
Unlike existing models that typically learn from facial expression labels alone, we devise an effective multitask network that is capable of learning from rich auxiliary attributes such as gender, age, and head pose, beyond just facial expression data.
1 code implementation • 7 Sep 2016 • Zhirong Wu, Dahua Lin, Xiaoou Tang
Markov Random Fields (MRFs), a formulation widely used in generative image modeling, have long been plagued by the lack of expressive power.
4 code implementations • 10 Aug 2016 • Ziwei Liu, Sijie Yan, Ping Luo, Xiaogang Wang, Xiaoou Tang
Fashion landmark is also compared to clothing bounding boxes and human joints in two applications, fashion attribute prediction and clothes retrieval, showing that fashion landmark is a more discriminative representation to understand fashion images.
2 code implementations • 9 Aug 2016 • Ke Yu, Chao Dong, Chen Change Loy, Xiaoou Tang
Lossy compression introduces complex compression artifacts, particularly blocking artifacts, ringing effects and blurring.
1 code implementation • 2 Aug 2016 • Yuanjun Xiong, Li-Min Wang, Zhe Wang, Bo-Wen Zhang, Hang Song, Wei Li, Dahua Lin, Yu Qiao, Luc van Gool, Xiaoou Tang
This paper presents the method that underlies our submission to the untrimmed video classification task of ActivityNet Challenge 2016.
19 code implementations • 2 Aug 2016 • Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, Luc van Gool
The other contribution is our study on a series of good practices in learning ConvNets on video data with the help of temporal segment network.
Ranked #3 on
Multimodal Activity Recognition
on EV-Action
13 code implementations • 1 Aug 2016 • Chao Dong, Chen Change Loy, Xiaoou Tang
As a successful deep model applied in image super-resolution (SR), the Super-Resolution Convolutional Neural Network (SRCNN) has demonstrated superior performance to the previous hand-crafted models either in speed and restoration quality.
no code implementations • 18 Jul 2016 • Shizhan Zhu, Sifei Liu, Chen Change Loy, Xiaoou Tang
We present a novel framework for hallucinating faces of unconstrained poses and with very low resolution (face size as small as 5pxIOD).
Ranked #5 on
Image Super-Resolution
on VggFace2 - 8x upscaling
no code implementations • 23 Jun 2016 • Ziwei Liu, Xiaoxiao Li, Ping Luo, Chen Change Loy, Xiaoou Tang
Semantic segmentation tasks can be well modeled by Markov Random Field (MRF).
no code implementations • CVPR 2016 • Ziwei Liu, Ping Luo, Shi Qiu, Xiaogang Wang, Xiaoou Tang
To demonstrate the advantages of DeepFashion, we propose a new deep model, namely FashionNet, which learns clothing features by jointly predicting clothing attributes and landmarks.
no code implementations • CVPR 2016 • Shizhan Zhu, Cheng Li, Chen-Change Loy, Xiaoou Tang
We present a practical approach to address the problem of unconstrained face alignment for a single image.
Ranked #16 on
Face Alignment
on AFLW-19
no code implementations • CVPR 2016 • Chen Huang, Chen Change Loy, Xiaoou Tang
Attributes offer useful mid-level features to interpret visual data.
no code implementations • CVPR 2016 • Chen Huang, Yining Li, Chen Change Loy, Xiaoou Tang
We further demonstrate that more discriminative deep representation can be learned by enforcing a deep network to maintain both inter-cluster and inter-class margins.
no code implementations • CVPR 2016 • Limin Wang, Yu Qiao, Xiaoou Tang, Luc van Gool
Actionness was introduced to quantify the likelihood of containing a generic action instance at a specific location.
Ranked #7 on
Temporal Action Localization
on J-HMDB-21
no code implementations • 3 Feb 2016 • Chen Huang, Chen Change Loy, Xiaoou Tang
These methods further deteriorate on small, imbalanced data that has a large degree of class overlap.
no code implementations • CVPR 2016 • Yi Sun, Xiaogang Wang, Xiaoou Tang
This paper proposes to learn high-performance deep ConvNets with sparse neural connections, referred to as sparse ConvNets, for face recognition.
no code implementations • ICCV 2015 • Yonglong Tian, Ping Luo, Xiaogang Wang, Xiaoou Tang
Third, each part detector in DeepParts is a strong detector that can detect pedestrian by observing only a part of a proposal.
no code implementations • 20 Nov 2015 • Shizhan Zhu, Cheng Li, Chen Change Loy, Xiaoou Tang
The unified framework seamlessly handles different viewpoints and landmark protocols, and it is trained by optimising directly on landmark locations, thus yielding superior results on arbitrary-view face alignment.
1 code implementation • CVPR 2016 • Shuo Yang, Ping Luo, Chen Change Loy, Xiaoou Tang
Face detection is one of the most studied topics in the computer vision community.
Ranked #26 on
Face Detection
on WIDER Face (Medium)
no code implementations • 19 Nov 2015 • Zhirong Wu, Dahua Lin, Xiaoou Tang
This suggests that the semantic structure of a neural network may be manifested through a guided binarization process.
2 code implementations • ICCV 2015 • Shuo Yang, Ping Luo, Chen Change Loy, Xiaoou Tang
In this paper, we propose a novel deep convolutional network (DCN) that achieves outstanding performance on FDDB, PASCAL Face, and AFW.
no code implementations • ICCV 2015 • Zhanpeng Zhang, Ping Luo, Chen Change Loy, Xiaoou Tang
Social relation defines the association, e. g, warm, friendliness, and dominance, between two or more people.
no code implementations • ICCV 2015 • Ziwei Liu, Xiaoxiao Li, Ping Luo, Chen Change Loy, Xiaoou Tang
This paper addresses semantic image segmentation by incorporating rich information into Markov Random Field (MRF), including high-order relations and mixture of label contexts.
Ranked #82 on
Semantic Segmentation
on Cityscapes test
3 code implementations • CVPR 2015 • Linjie Yang, Ping Luo, Chen Change Loy, Xiaoou Tang
Updated on 24/09/2015: This update provides preliminary experiment results for fine-grained classification on the surveillance data of CompCars.
Ranked #5 on
Fine-Grained Image Classification
on CompCars
1 code implementation • 14 Jun 2015 • Pan He, Weilin Huang, Yu Qiao, Chen Change Loy, Xiaoou Tang
We develop a Deep-Text Recurrent Network (DTRN) that regards scene text reading as a sequence labelling problem.
1 code implementation • 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2015 • Shizhan Zhu, Cheng Li, Chen Change Loy, Xiaoou Tang
We present a novel face alignment framework based on coarse-to-fine shape searching.
Ranked #17 on
Face Alignment
on AFLW-19
no code implementations • CVPR 2015 • Xiao Sun, Yichen Wei, Shuang Liang, Xiaoou Tang, Jian Sun
We extends the previous 2D cascaded object pose regression work [9] in two aspects so that it works better for 3D articulated objects.
no code implementations • CVPR 2015 • Yuanjun Xiong, Kai Zhu, Dahua Lin, Xiaoou Tang
A considerable portion of web images capture events that occur in our personal lives or social activities.
1 code implementation • CVPR 2015 • Limin Wang, Yu Qiao, Xiaoou Tang
Visual features are of vital importance for human action understanding in videos.
Ranked #2 on
Activity Recognition In Videos
on DogCentric
3 code implementations • ICCV 2015 • Chao Dong, Yubin Deng, Chen Change Loy, Xiaoou Tang
Lossy compression introduces complex compression artifacts, particularly the blocking artifacts, ringing effects and blurring.
2 code implementations • 3 Feb 2015 • Yi Sun, Ding Liang, Xiaogang Wang, Xiaoou Tang
Very deep neural networks recently achieved great success on general object recognition because of their superb learning capacity.
no code implementations • 5 Jan 2015 • Yubin Deng, Ping Luo, Chen Change Loy, Xiaoou Tang
Learning to recognize pedestrian attributes at far distance is a challenging problem in visual surveillance since face and body close-shots are hardly available; instead, only far-view image frames of pedestrian are given.
57 code implementations • 31 Dec 2014 • Chao Dong, Chen Change Loy, Kaiming He, Xiaoou Tang
We further show that traditional sparse-coding-based SR methods can also be viewed as a deep convolutional network.
Ranked #2 on
Video Super-Resolution
on Xiph HD - 4x upscaling
no code implementations • CVPR 2015 • Wanli Ouyang, Xiaogang Wang, Xingyu Zeng, Shi Qiu, Ping Luo, Yonglong Tian, Hongsheng Li, Shuo Yang, Zhe Wang, Chen-Change Loy, Xiaoou Tang
In this paper, we propose deformable deep convolutional neural networks for generic object detection.
1 code implementation • CVPR 2015 • Yi Sun, Xiaogang Wang, Xiaoou Tang
(2) Its neurons in higher layers are highly selective to identities and identity-related attributes.
Ranked #1 on
Face Verification
on Oulu-CASIA
no code implementations • NeurIPS 2014 • Zhenyao Zhu, Ping Luo, Xiaogang Wang, Xiaoou Tang
Intriguingly, even without accessing 3D data, human not only can recognize face identity, but can also imagine face images of a person under different viewpoints given a single 2D image, making face perception in the brain robust to view changes.
no code implementations • NeurIPS 2014 • Yuanjun Xiong, Wei Liu, Deli Zhao, Xiaoou Tang
Selecting a small informative subset from a given dataset, also called column sampling, has drawn much attention in machine learning.
no code implementations • CVPR 2015 • Yonglong Tian, Ping Luo, Xiaogang Wang, Xiaoou Tang
Rather than expensively annotating scene attributes, we transfer attributes information from existing scene segmentation datasets to the pedestrian dataset, by proposing a novel deep model to learn high-level features from multiple tasks and multiple data sources.
Ranked #28 on
Pedestrian Detection
on Caltech
1 code implementation • ICCV 2015 • Ziwei Liu, Ping Luo, Xiaogang Wang, Xiaoou Tang
LNet is pre-trained by massive general object categories for face localization, while ANet is pre-trained by massive face identities for attribute prediction.
no code implementations • 11 Sep 2014 • Wanli Ouyang, Ping Luo, Xingyu Zeng, Shi Qiu, Yonglong Tian, Hongsheng Li, Shuo Yang, Zhe Wang, Yuanjun Xiong, Chen Qian, Zhenyao Zhu, Ruohui Wang, Chen-Change Loy, Xiaogang Wang, Xiaoou Tang
In the proposed new deep architecture, a new deformation constrained pooling (def-pooling) layer models the deformation of object parts with geometric constraint and penalty.
no code implementations • 2 Sep 2014 • Shizhan Zhu, Cheng Li, Chen Change Loy, Xiaoou Tang
We show extensive results on combining various popular databases (LFW, AFLW, LFPW, HELEN) for improved cross-dataset and unseen data alignment.
no code implementations • 18 Aug 2014 • Zhanpeng Zhang, Ping Luo, Chen Change Loy, Xiaoou Tang
In this study, we show that landmark detection or face alignment task is not a single and independent problem.
Ranked #13 on
Unsupervised Facial Landmark Detection
on MAFL
no code implementations • 5 Jul 2014 • Deli Zhao, Xiaoou Tang
Clustering is indispensable for data analysis in many scientific disciplines.
no code implementations • 26 Jun 2014 • Zhenyao Zhu, Ping Luo, Xiaogang Wang, Xiaoou Tang
Intriguingly, even without accessing 3D data, human not only can recognize face identity, but can also imagine face images of a person under different viewpoints given a single 2D image, making face perception in the brain robust to view changes.
no code implementations • CVPR 2015 • Zhirong Wu, Shuran Song, Aditya Khosla, Fisher Yu, Linguang Zhang, Xiaoou Tang, Jianxiong Xiao
Our model, 3D ShapeNets, learns the distribution of complex 3D shapes across different object categories and arbitrary poses from raw CAD data, and discovers hierarchical compositional part representations automatically.
Ranked #30 on
3D Point Cloud Classification
on ModelNet40
(Mean Accuracy metric)
2 code implementations • NeurIPS 2014 • Yi Sun, Xiaogang Wang, Xiaoou Tang
The learned DeepID2 features can be well generalized to new identities unseen in the training data.
no code implementations • CVPR 2014 • Ping Luo, Yonglong Tian, Xiaogang Wang, Xiaoou Tang
In this paper, we propose a Switchable Deep Network (SDN) for pedestrian detection.
no code implementations • CVPR 2014 • Chen Qian, Xiao Sun, Yichen Wei, Xiaoou Tang, Jian Sun
We present a realtime hand tracking system using a depth sensor.
no code implementations • Conference 2014 • Yi Sun, Xiaogang Wang, Xiaoou Tang
When learned as classifiers to recognize about 10, 000 face identities in the training set and configured to keep reducing the neuron numbers along the feature extraction hierarchy, these deep ConvNets gradually form compact identity-related features in the top layers with only a small number of hidden neurons.
no code implementations • CVPR 2014 • Jianzhou Yan, Stephen Lin, Sing Bing Kang, Xiaoou Tang
We present a machine-learned ranking approach for automatically enhancing the color of a photograph.
no code implementations • 15 Apr 2014 • Chaochao Lu, Xiaoou Tang
For the first time, the human-level performance in face verification (97. 53%) on LFW is surpassed.
no code implementations • 14 Apr 2014 • Zhenyao Zhu, Ping Luo, Xiaogang Wang, Xiaoou Tang
Face images in the wild undergo large intra-personal variations, such as poses, illuminations, occlusions, and low resolutions, which cause great challenges to face-related applications.
3 code implementations • 1 Jan 2014 • Yi Sun, Xiaogang Wang, Xiaoou Tang
When learned as classifiers to recognize about 10, 000 face identities in the training set and configured to keep reducing the neuron numbers along the feature extraction hierarchy, these deep ConvNets gradually form compact identity-related features in the top layers with only a small number of hidden neurons.
Ranked #6 on
Face Verification
on Labeled Faces in the Wild
no code implementations • CVPR 2013 • Jianzhou Yan, Stephen Lin, Sing Bing Kang, Xiaoou Tang
Image cropping is a common operation used to improve the visual quality of photographs.
no code implementations • CVPR 2013 • Yi Sun, Xiaogang Wang, Xiaoou Tang
At each level, the outputs of multiple networks are fused for robust and accurate estimation.
no code implementations • CVPR 2013 • Li-Min Wang, Yu Qiao, Xiaoou Tang
We postulate three key properties of motionlet for action recognition: high motion saliency, multiple scale representation, and representative-discriminative ability.
no code implementations • CVPR 2013 • Bolei Zhou, Xiaoou Tang, Xiaogang Wang
Collective motions are common in crowd systems and have attracted a great deal of attention in a variety of multidisciplinary fields.
2 code implementations • 25 Aug 2012 • Wei Zhang, Xiaogang Wang, Deli Zhao, Xiaoou Tang
We explore the different roles of two fundamental concepts in graph theory, indegree and outdegree, in the context of clustering.
Ranked #1 on
Image Clustering
on Coil-20
(Accuracy metric)
1 code implementation • IEEE Transactions on Pattern Analysis and Machine Intelligence 2010 • Kaiming He, Jian Sun, Xiaoou Tang
The dark channel prior is a kind of statistics of outdoor haze-free images.
Ranked #1 on
Single Image Haze Removal
on RESIDE
no code implementations • NeurIPS 2008 • Deli Zhao, Xiaoou Tang
A mathematical tool, Zeta function of a graph, is introduced for the integration of all cycles, leading to a structural descriptor of the cluster in determinantal form.