1 code implementation • 22 Nov 2023 • Yuxi Wang, Haibin Ling, Bingyao Huang
Full projector compensation is a practical task of projector-camera systems.
no code implementations • 26 Oct 2023 • Jie Wei, Weicong Feng, Erik Blasch, Erika Ardiles-Cruz, Haibin Ling
It is important to quantify Damage Assessment (DA) for Human Assistance and Disaster Response (HADR) applications.
no code implementations • 23 Oct 2023 • Weimin Lyu, Songzhu Zheng, Lu Pang, Haibin Ling, Chao Chen
Recent studies have revealed that \textit{Backdoor Attacks} can threaten the safety of natural language processing (NLP) models.
1 code implementation • 15 Sep 2023 • Gongyang Li, Zhen Bai, Zhi Liu, Xinpeng Zhang, Haibin Ling
KTM models the contextual correlation knowledge of two middle-level features of different scales based on the self-attention mechanism, and transfers the knowledge to the raw features to generate more discriminative features.
1 code implementation • 13 Sep 2023 • Kalyan Garigapati, Erik Blasch, Jie Wei, Haibin Ling
However, with the existing fusion techniques, the addition of new features causes a change in the latent space making it impossible to incorporate transparency awareness on trackers with fixed latent spaces.
no code implementations • 13 Sep 2023 • Yunfan Li, Himanshu Gupta, Haibin Ling, IV Ramakrishnan, Prateek Prasanna, Georgios Georgakis, Aaron Sasson
Compared with classical open cholecystectomy, laparoscopic cholecystectomy (LC) is associated with significantly shorter recovery period, and hence is the preferred method.
no code implementations • 8 Sep 2023 • Xi Yu, Huan-Hsin Tseng, Shinjae Yoo, Haibin Ling, Yuewei Lin
Specifically, we first propose an information theory inspired loss function to ensure the disentangled class-relevant features contain sufficient class label information and the other disentangled auxiliary feature has sufficient domain information.
1 code implementation • 19 Jul 2023 • Mingzhe Guo, Zhipeng Zhang, Liping Jing, Haibin Ling, Heng Fan
To thoroughly evidence the effectiveness of our method, we integrate the proposed framework on three tracking methods with different designs, i. e., the CNN-based SiamCAR, the Transformer-based OSTrack, and the hybrid structure TransT.
no code implementations • 13 Jul 2023 • Haoran Wang, Qinghua Cheng, Baosheng Yu, Yibing Zhan, Dapeng Tao, Liang Ding, Haibin Ling
We evaluated our method on three popular egocentric action recognition datasets, Something-Something V2, H2O, and EPIC-KITCHENS-100, and the experimental results demonstrate the effectiveness of the proposed method for handling data scarcity problems, including long-tailed and few-shot egocentric action recognition.
no code implementations • 4 Apr 2023 • Peiyao Wang, Haibin Ling
Fully supervised action segmentation works on frame-wise action recognition with dense annotations and often suffers from the over-segmentation issue.
1 code implementation • ICCV 2023 • Ruyi Lian, Haibin Ling
Firstly, CheckerPose densely samples 3D keypoints from the surface of the 3D object and finds their 2D correspondences progressively in the 2D image.
1 code implementation • 27 Mar 2023 • Tao Sun, Lu Pang, Chao Chen, Haibin Ling
It detects possible triggers in the token space using image structural similarity and label consistency between the test image and MAE restorations.
1 code implementation • CVPR 2023 • Caixia Zhou, Yaping Huang, Mengyang Pu, Qingji Guan, Li Huang, Haibin Ling
Deep learning-based edge detectors heavily rely on pixel-wise labels which are often provided by multiple annotators.
1 code implementation • 19 Mar 2023 • Srikar Yellapragada, Zhenghong Li, Kevin Bhadresh Doshi, Purva Makarand Mhasakar, Heng Fan, Jie Wei, Erik Blasch, Bin Zhang, Haibin Ling
In this paper, we present a meticulously crafted and annotated benchmark, called \textbf{CCTV-Gun}, which addresses the challenges of detecting handguns in real-world CCTV images.
1 code implementation • 17 Mar 2023 • Gongpei Zhao, Tao Wang, Yidong Li, Yi Jin, Congyan Lang, Haibin Ling
Backpropagation algorithm has been widely used as a mainstream learning procedure for neural networks in the past decade, and has played a significant role in the development of deep learning.
no code implementations • 1 Jan 2023 • Libo Zhang, Wenzhang Zhou, Heng Fan, Tiejian Luo, Haibin Ling
To reduce discrepancy in feature distributions between two domains, recent approaches achieve domain adaption through feature alignment in different granularities via adversarial learning.
1 code implementation • CVPR 2023 • Lu Pang, Tao Sun, Haibin Ling, Chao Chen
In experiments, we show that our method, trained without labels, is on-par with state-of-the-art defense methods trained using labels.
1 code implementation • ICCV 2023 • Tao Sun, Cheng Lu, Haibin Ling
In this paper, we propose a Local context-aware ADA framework, named LADA, to address this issue.
1 code implementation • 26 Aug 2022 • Tao Sun, Cheng Lu, Haibin Ling
We show that this strategy is more efficient and better correlated with the objective of boosting prediction confidence than adversarial training on input images or intermediate features, as used in previous works.
no code implementations • 9 Aug 2022 • Weimin Lyu, Songzhu Zheng, Tengfei Ma, Haibin Ling, Chao Chen
Trojan attacks pose a severe threat to AI systems.
2 code implementations • 4 Aug 2022 • Bolin Ni, Houwen Peng, Minghao Chen, Songyang Zhang, Gaofeng Meng, Jianlong Fu, Shiming Xiang, Haibin Ling
Extensive experiments demonstrate that our approach is effective and can be generalized to different video recognition scenarios.
Ranked #7 on
Zero-Shot Action Recognition
on Kinetics
1 code implementation • 18 Jul 2022 • Tao Sun, Cheng Lu, Haibin Ling
We propose a general rectification module that uses such prior knowledge to refine model generated pseudo labels.
no code implementations • 14 Jun 2022 • Yunfan Li, Vinayak Shenoy, Prateek Prasanna, I. V. Ramakrishnan, Haibin Ling, Himanshu Gupta
Automatic recognition of surgical phases in surgical videos is a fundamental task in surgical workflow analysis.
1 code implementation • CVPR 2022 • Tao Sun, Cheng Lu, Tianshuo Zhang, Haibin Ling
Unsupervised Domain Adaptation (UDA) aims to leverage a label-rich source domain to solve tasks on a related unlabeled target domain.
1 code implementation • 14 Apr 2022 • Dihan Zheng, Chenglong Bao, Zuoqiang Shi, Haibin Ling, Kaisheng Ma
The Chan-Vese (CV) model is a classic region-based method in image segmentation.
1 code implementation • 25 Mar 2022 • Gongyang Li, Zhi Liu, Dan Zeng, Weisi Lin, Haibin Ling
As the key component of ACCoNet, ACCoM activates the salient regions of output features of the encoder and transmits them to the decoder.
1 code implementation • CVPR 2022 • Mengyang Pu, Yaping Huang, Yuming Liu, Qingji Guan, Haibin Ling
In Stage I, a global transformer encoder is used to capture long-range global context on coarse-grained image patches.
no code implementations • CVPR 2022 • Jiaxiang Ren, Kicheon Park, Yingtian Pan, Haibin Ling
With the structural information and appearance feature from noisy image as references, our model can remove larger BMA and produce better visualizing result.
no code implementations • 10 Feb 2022 • Tao Zhou, Huazhu Fu, Chen Gong, Ling Shao, Fatih Porikli, Haibin Ling, Jianbing Shen
Besides, a novel constraint based on the Hilbert Schmidt Independence Criterion (HSIC) is introduced to ensure the diversity of multi-level subspace representations, which enables the complementarity of multi-level representations to be explored to boost the transfer learning performance.
no code implementations • 5 Jan 2022 • He Liu, Tao Wang, Yidong Li, Congyan Lang, Songhe Feng, Haibin Ling
Most previous learning-based graph matching algorithms solve the \textit{quadratic assignment problem} (QAP) by dropping one or more of the matching constraints and adopting a relaxed assignment solver to obtain sub-optimal correspondences.
1 code implementation • CVPR 2022 • Mingzhen Huang, Supreeth Narasimhaswamy, Saif Vazir, Haibin Ling, Minh Hoai
The first stage is Forward Propagation, where the features from frame t-1 are propagated to frame t based on previously detected hands and their estimated motion.
Ranked #1 on
Multiple Object Tracking
on YouTube-Hands
(using extra training data)
1 code implementation • 2 Dec 2021 • Liting Lin, Heng Fan, Zhipeng Zhang, Yong Xu, Haibin Ling
The potential of Transformer in representation learning remains under-explored.
Ranked #6 on
Visual Object Tracking
on TrackingNet
1 code implementation • 2 Dec 2021 • Gongyang Li, Zhi Liu, Weisi Lin, Haibin Ling
In this paper, we propose a novel Multi-Content Complementation Network (MCCNet) to explore the complementarity of multiple content for RSI-SOD.
1 code implementation • NeurIPS 2021 • Minghao Chen, Kan Wu, Bolin Ni, Houwen Peng, Bei Liu, Jianlong Fu, Hongyang Chao, Haibin Ling
Vision Transformer has shown great visual representation power in substantial vision tasks such as recognition and detection, and thus been attracting fast-growing efforts on manually designing more effective architectures.
no code implementations • 19 Oct 2021 • Heng Fan, Jiaxiang Ren, Jie Yang, Yi-Xian Qin, Haibin Ling
The aim of this study was to investigate whether a deep convolutional neural network (CNN) with an attention module can detect osteoporosis on panoramic radiographs.
no code implementations • 4 Oct 2021 • Gautham Ramajayam, Tao Sun, Chiu C. Tan, Lannan Luo, Haibin Ling
Many critical applications rely on cameras to capture video footage for analytical purposes.
2 code implementations • 1 Sep 2021 • He Liu, Tao Wang, Yidong Li, Congyan Lang, Yi Jin, Haibin Ling
In this paper, we propose a joint \emph{graph learning and matching} network, named GLAM, to explore reliable graph structures for boosting graph matching.
2 code implementations • International Joint Conference on Artificial Intelligence 2021 • Jingwei Qu, Haibin Ling, Chenrui Zhang, Xiaoqing Lyu, Zhi Tang
To explore the potential of edges, EAGM learns edge attention on the assignment graph to 1) reveal the impact of each edge on graph matching, as well as 2) adjust the learning of edge representations adaptively.
Ranked #7 on
Graph Matching
on PASCAL VOC
(matching accuracy metric)
1 code implementation • ICCV 2021 • Hong Wang, Yuefan Deng, Shinjae Yoo, Haibin Ling, Yuewei Lin
The attention knowledge is obtained from a weight-fixed model trained on a clean dataset, referred to as a teacher model, and transferred to a model that is under training on adversarial examples (AEs), referred to as a student model.
1 code implementation • ICCV 2021 • Mengyang Pu, Yaping Huang, Qingji Guan, Haibin Ling
Taking into consideration the distinct attributes of each type of edges and the relationship between them, RINDNet learns effective representations for each of them and works in three stages.
1 code implementation • 19 Jul 2021 • Dawei Du, Longyin Wen, Pengfei Zhu, Heng Fan, QinGhua Hu, Haibin Ling, Mubarak Shah, Junwen Pan, Ali Al-Ali, Amr Mohamed, Bakour Imene, Bin Dong, Binyu Zhang, Bouchali Hadia Nesma, Chenfeng Xu, Chenzhen Duan, Ciro Castiello, Corrado Mencar, Dingkang Liang, Florian Krüger, Gennaro Vessio, Giovanna Castellano, Jieru Wang, Junyu Gao, Khalid Abualsaud, Laihui Ding, Lei Zhao, Marco Cianciotta, Muhammad Saqib, Noor Almaadeed, Omar Elharrouss, Pei Lyu, Qi Wang, Shidong Liu, Shuang Qiu, Siyang Pan, Somaya Al-Maadeed, Sultan Daud Khan, Tamer Khattab, Tao Han, Thomas Golda, Wei Xu, Xiang Bai, Xiaoqing Xu, Xuelong Li, Yanyun Zhao, Ye Tian, Yingnan Lin, Yongchao Xu, Yuehan Yao, Zhenyu Xu, Zhijian Zhao, Zhipeng Luo, Zhiwei Wei, Zhiyuan Zhao
Crowd counting on the drone platform is an interesting topic in computer vision, which brings new challenges such as small object inference, background clutter and wide viewpoint.
2 code implementations • ICCV 2021 • Minghao Chen, Houwen Peng, Jianlong Fu, Haibin Ling
Specifically, the performance of these subnets with weights inherited from the supernet is comparable to those retrained from scratch.
Ranked #1 on
Fine-Grained Image Classification
on Oxford 102 Flowers
(Top 1 Accuracy metric)
5 code implementations • 1 Jul 2021 • TingTing Liang, Xiaojie Chu, Yudong Liu, Yongtao Wang, Zhi Tang, Wei Chu, Jingdong Chen, Haibin Ling
With multi-scale testing, we push the current best single model result to a new record of 60. 1% box AP and 52. 3% mask AP without using extra training data.
Ranked #6 on
Object Detection
on COCO-O
(using extra training data)
no code implementations • 7 Jun 2021 • Yifeng Ding, Shuwei Dong, Yujun Tong, Zhanyu Ma, Bo Xiao, Haibin Ling
Classifying the sub-categories of an object from the same super-category (e. g., bird) in a fine-grained visual classification (FGVC) task highly relies on mining multiple discriminative features.
no code implementations • 28 May 2021 • Xinyi Li, Haibin Ling
Camera pose estimation or camera relocalization is the centerpiece in numerous computer vision tasks such as visual odometry, structure from motion (SfM) and SLAM.
1 code implementation • CVPR 2021 • Minghao Chen, Houwen Peng, Jianlong Fu, Haibin Ling
In this paper, we propose a one-shot neural ensemble architecture search (NEAS) solution that addresses the two challenges.
no code implementations • 1 Apr 2021 • Peng Chu, Jiang Wang, Quanzeng You, Haibin Ling, Zicheng Liu
TransMOT effectively models the interactions of a large number of objects by arranging the trajectories of the tracked objects as a set of sparse weighted graphs, and constructing a spatial graph transformer encoder layer, a temporal transformer encoder layer, and a spatial graph transformer decoder layer based on the graphs.
Ranked #4 on
Multi-Object Tracking
on MOT16
(using extra training data)
1 code implementation • CVPR 2021 • TingTing Liang, Yongtao Wang, Zhi Tang, Guosheng Hu, Haibin Ling
Encouraged by the success, we propose a novel One-Shot Path Aggregation Network Architecture Search (OPANAS) algorithm, which significantly improves both searching efficiency and detection accuracy.
no code implementations • 9 Feb 2021 • Xinyi Li, Haibin Ling
Rotation averaging is a synchronization process on single or multiple rotation groups, and is a fundamental problem in many computer vision tasks such as multi-view structure from motion (SfM).
1 code implementation • 22 Jan 2021 • Gongyang Li, Zhi Liu, Ran Shi, Zheng Hu, Weijie Wei, Yong Wu, Mengke Huang, Haibin Ling
In this paper, we focus on Personal Fixations-based Object Segmentation (PFOS) to address issues in previous studies, such as the lack of appropriate dataset and the ambiguity in fixations-based interaction.
1 code implementation • ICCV 2021 • Xinyi Li, Haibin Ling
Accurate camera pose estimation or global camera re-localization is a core component in Structure-from-Motion (SfM) and SLAM systems.
1 code implementation • ICCV 2021 • Xiaowei Liao, Yong Xu, Haibin Ling
Specifically, given two hypergraphs to be matched, we first construct an association hypergraph over them and convert the hypergraph matching problem into a node classification problem on the association hypergraph.
Ranked #9 on
Graph Matching
on Willow Object Class
1 code implementation • 22 Dec 2020 • Bingyao Huang, Ruyi Lian, Dimitris Samaras, Haibin Ling
Mail privacy protection aims to prevent unauthorized access to hidden content within an envelope since normal paper envelopes are not as safe as we think.
no code implementations • 19 Dec 2020 • Xuan Qin, Meizhu Liu, Yifan Hu, Christina Moo, Christian M. Riblet, Changwei Hu, Kevin Yen, Haibin Ling
In this paper, we propose a method that efficiently utilizes appearance features and text vectors to accurately classify political posters from other similar political images.
1 code implementation • 10 Dec 2020 • Bingyao Huang, Haibin Ling
Light-based adversarial attacks use spatial augmented reality (SAR) techniques to fool image classifiers by altering the physical light condition with a controllable light source, e. g., a projector.
no code implementations • 25 Nov 2020 • Heng Fan, Haibin Ling
The key is to bridge box regression and classification via an alignment step, which leads to more accurate features for proposal classification with improved robustness.
1 code implementation • CVPR 2021 • Hexin Bai, Wensheng Cheng, Peng Chu, Juehuan Liu, Kai Zhang, Haibin Ling
Multiple Object Tracking (MOT) has witnessed remarkable advances in recent years.
no code implementations • ICCV 2021 • Heng Fan, Halady Akhilesha Miththanthaya, Harshit, Siranjiv Ramana Rajan, Xiaoqiong Liu, Zhilin Zou, Yuewei Lin, Haibin Ling
To the best of our knowledge, TOTB is the first benchmark dedicated to transparent object tracking.
no code implementations • 2 Nov 2020 • Xinyi Li, Lin Yuan, Longin Jan Latecki, Haibin Ling
As an essential part of structure from motion (SfM) and Simultaneous Localization and Mapping (SLAM) systems, motion averaging has been extensively studied in the past years and continues to attract surging research attention.
1 code implementation • 8 Sep 2020 • Heng Fan, Hexin Bai, Liting Lin, Fan Yang, Peng Chu, Ge Deng, Sijia Yu, Harshit, Mingzhen Huang, Juehuan Liu, Yong Xu, Chunyuan Liao, Lin Yuan, Haibin Ling
The average video length of LaSOT is around 2, 500 frames, where each video contains various challenge factors that exist in real world video footage, such as the targets disappearing and re-appearing.
no code implementations • ECCV 2020 • Peng Chu, Xiao Bian, Shaopeng Liu, Haibin Ling
Real-world data often follow a long-tailed distribution as the frequency of each class is typically different.
Ranked #24 on
Long-tail Learning
on Places-LT
1 code implementation • 30 Jul 2020 • Bingyao Huang, Tao Sun, Haibin Ling
Full projector compensation aims to modify a projector input image to compensate for both geometric and photometric disturbance of the projection surface.
1 code implementation • 21 Jul 2020 • Jinxiu Liang, Jingwen Wang, Yuhui Quan, Tianyi Chen, Jiaying Liu, Haibin Ling, Yong Xu
REG produces progressively and efficiently intermediate images corresponding to various exposure settings, and such pseudo-exposures are then fused by MED to detect faces across different lighting conditions.
2 code implementations • ECCV 2020 • Gongyang Li, Zhi Liu, Linwei Ye, Yang Wang, Haibin Ling
In this paper, we propose a novel Cross-Modal Weighting (CMW) strategy to encourage comprehensive interactions between RGB and depth channels for RGB-D SOD.
Ranked #9 on
RGB-D Salient Object Detection
on NJU2K
no code implementations • 4 Jul 2020 • Jinxiu Liang, Yong Xu, Yuhui Quan, Jingwen Wang, Haibin Ling, Hui Ji
Low-light images, i. e. the images captured in low-light conditions, suffer from very poor visibility caused by low contrast, color distortion and significant measurement noise.
3 code implementations • 18 Jun 2020 • Hongyuan Yu, Houwen Peng, Yan Huang, Jianlong Fu, Hao Du, Liang Wang, Haibin Ling
First, the search network generates an initial architecture for evaluation, and the weights of the evaluation network are optimized.
Ranked #16 on
Neural Architecture Search
on NAS-Bench-201, CIFAR-10
no code implementations • 27 May 2020 • Hexin Bai, Peng Chu, Jeng-Yuan Tsai, Nathan Wilson, Xiaofeng Qian, Qimin Yan, Haibin Ling
Development of next-generation electronic devices for applications call for the discovery of quantum materials hosting novel electronic, magnetic, and topological properties.
1 code implementation • 27 May 2020 • Zhuoying Wang, Yongtao Wang, Zhi Tang, Yangyan Li, Ying Chen, Haibin Ling, Weisi Lin
Existing CNN-based methods for pixel labeling heavily depend on multi-scale features to meet the requirements of both semantic comprehension and detail preservation.
1 code implementation • 30 Mar 2020 • Junyi Feng, Songyuan Li, Xi Li, Fei Wu, Qi Tian, Ming-Hsuan Yang, Haibin Ling
Real-time semantic video segmentation is a challenging task due to the strict requirements of inference speed.
1 code implementation • CVPR 2020 • Tianfei Zhou, Wenguan Wang, Siyuan Qi, Haibin Ling, Jianbing Shen
The interaction recognition network has two crucial parts: a relation ranking module for high-quality HOI proposal selection and a triple-stream classifier for relation prediction.
no code implementations • 6 Mar 2020 • Bingyao Huang, Haibin Ling
In this paper, we propose a novel end-to-end trainable model named DeProCams to explicitly learn the photometric and geometric mappings of ProCams, and once trained, DeProCams can be applied simultaneously to the three tasks.
no code implementations • 9 Feb 2020 • Yifeng Ding, Shaoguo Wen, Jiyang Xie, Dongliang Chang, Zhanyu Ma, Zhongwei Si, Haibin Ling
Classifying the sub-categories of an object from the same super-category (e. g. bird species, car and aircraft models) in fine-grained visual classification (FGVC) highly relies on discriminative feature representation and accurate region localization.
1 code implementation • ICCV 2019 • Ziyi Shen, Wenguan Wang, Xiankai Lu, Jianbing Shen, Haibin Ling, Tingfa Xu, Ling Shao
This paper proposes a human-aware deblurring model that disentangles the motion blur between foreground (FG) humans and background (BG).
2 code implementations • 16 Jan 2020 • Pengfei Zhu, Longyin Wen, Dawei Du, Xiao Bian, Heng Fan, QinGhua Hu, Haibin Ling
We provide a large-scale drone captured dataset, VisDrone, which includes four tracks, i. e., (1) image object detection, (2) video object detection, (3) single object tracking, and (4) multi-object tracking.
no code implementations • 20 Dec 2019 • Ting-Ting Liang, Yongtao Wang, Qijie Zhao, huan zhang, Zhi Tang, Haibin Ling
Feature pyramids are widely exploited in many detectors to solve the scale variation problem for object detection.
no code implementations • 15 Dec 2019 • Jianqing Jia, Semir Elezovikj, Heng Fan, Shuojin Yang, Jing Liu, Wei Guo, Chiu C. Tan, Haibin Ling
Our solution encodes the constraints for placing labels in an optimization problem to obtain the final label layout, and the labels will be placed in appropriate positions to reduce the chances of overlaying important real-world objects in street view AR scenarios.
1 code implementation • 8 Dec 2019 • Fan Yang, Cheng Lu, Yandong Guo, Longin Jan Latecki, Haibin Ling
Feature pyramid architecture has been broadly adopted in object detection and segmentation to deal with multi-scale problem.
1 code implementation • 26 Nov 2019 • Yang Yang, Xiaojie Guo, Jiayi Ma, Lin Ma, Haibin Ling
It is challenging to inpaint face images in the wild, due to the large variation of appearance, such as different poses, expressions and occlusions.
no code implementations • 18 Nov 2019 • Heng Fan, Fan Yang, Peng Chu, Lin Yuan, Haibin Ling
For the analysis component, given the tracking results on all sequences, it investigates the behavior of the tracker under each individual factor and generates the report automatically.
no code implementations • 7 Nov 2019 • Yu Pang, Xinyi Li, Lin Yuan, Haibin Ling
We then use different techniques to smooth the trajectories at certain degree.
6 code implementations • 9 Sep 2019 • Yudong Liu, Yongtao Wang, Siwei Wang, Ting-Ting Liang, Qijie Zhao, Zhi Tang, Haibin Ling
In existing CNN based detectors, the backbone network is a very important component for basic feature extraction, and the performance of the detectors highly depends on it.
Ranked #44 on
Instance Segmentation
on COCO test-dev
1 code implementation • ICCV 2019 • Bingyao Huang, Haibin Ling
In this paper, we propose the first end-to-end solution, named CompenNet++, to solve the two problems jointly.
no code implementations • 5 Aug 2019 • Xinyi Li, Haibin Ling
This paper presents a hybrid real-time camera pose estimation framework with a novel partitioning scheme and introduces motion averaging to monocular Simultaneous Localization and Mapping (SLAM) systems.
no code implementations • CVPR 2019 2019 • Jinzhan Su, Zhe Wang, Chunyuan Liao, Haibin Ling
In particular, for a given image, our algorithm first estimates its global facial shape through a global regression network (GRegNet) and then using cascaded local refinement networks (LRefNet) to sequentially improve the alignment result.
Ranked #13 on
Face Alignment
on 300W
no code implementations • 14 May 2019 • Penghui Sun, Jingwei Qu, Xiaoqing Lyu, Haibin Ling, Zhi Tang
Graph convolutional neural networks (GCNNs) have been attracting increasing research attention due to its great potential in inference over graph structures.
1 code implementation • 19 Apr 2019 • Wenguan Wang, Qiuxia Lai, Huazhu Fu, Jianbing Shen, Haibin Ling, Ruigang Yang
As an essential problem in computer vision, salient object detection (SOD) has attracted an increasing amount of research attention over the years.
1 code implementation • ICCV 2019 • Fan Yang, Heng Fan, Peng Chu, Erik Blasch, Haibin Ling
The key components in ClusDet include a cluster proposal sub-network (CPNet), a scale estimation sub-network (ScaleNet), and a dedicated detection network (DetecNet).
no code implementations • ICCV 2019 • Peng Chu, Haibin Ling
Data association-based multiple object tracking (MOT) involves multiple separated modules processed or optimized differently, which results in complex method design and requires non-trivial tuning of parameters.
1 code implementation • CVPR 2019 • Bingyao Huang, Haibin Ling
Such benchmark is not previously available, to our best knowledge, due to the fact that conventional evaluation requests the hardware system to actually project the final results.
no code implementations • 4 Apr 2019 • Minye Wu, Haibin Ling, Ning Bi, Shenghua Gao, Hao Sheng, Jingyi Yu
A natural solution to these challenges is to use multiple cameras with multiview inputs, though existing systems are mostly limited to specific targets (e. g. human), static cameras, and/or camera calibration.
18 code implementations • 28 Feb 2019 • Xiaojie Guo, Siyuan Li, Jinke Yu, Jiawan Zhang, Jiayi Ma, Lin Ma, Wei Liu, Haibin Ling
Being accurate, efficient, and compact is essential to a facial landmark detector for practical use.
1 code implementation • 26 Feb 2019 • Runsheng Zhang, Yaping Huang, Mengyang Pu, Jian Zhang, Qingji Guan, Qi Zou, Haibin Ling
To tackle this problem, we propose a simple but effective pattern mining-based method, called Object Location Mining (OLM), which exploits the advantages of data mining and feature representation of pre-trained convolutional neural networks (CNNs).
no code implementations • 21 Feb 2019 • Peng Chu, Heng Fan, Chiu C. Tan, Haibin Ling
To address this issue, in this paper we propose an instance-aware tracker to integrate SOT techniques for MOT by encoding awareness both within and between target models.
1 code implementation • 18 Jan 2019 • Fan Yang, Lei Zhang, Sijia Yu, Danil Prokhorov, Xue Mei, Haibin Ling
To demonstrate the superiority and generality of the proposed method, we evaluate the proposed method on five crack datasets and compare it with state-of-the-art crack detection, edge detection, semantic segmentation methods.
no code implementations • CVPR 2019 • Heng Fan, Haibin Ling
C-RPN is trained end-to-end with the multi-task loss function.
10 code implementations • 12 Nov 2018 • Qijie Zhao, Tao Sheng, Yongtao Wang, Zhi Tang, Ying Chen, Ling Cai, Haibin Ling
Finally, we gather up the decoder layers with equivalent scales (sizes) to develop a feature pyramid for object detection, in which every feature map consists of the layers (features) from multiple levels.
Ranked #148 on
Object Detection
on COCO test-dev
no code implementations • 9 Nov 2018 • Heng Fan, Peng Chu, Longin Jan Latecki, Haibin Ling
Recurrent neural networks (RNNs) have shown the ability to improve scene parsing through capturing long-range dependencies among image units.
no code implementations • 16 Oct 2018 • Danping Zou, Yuanxin Wu, Ling Pei, Haibin Ling, Wenxian Yu
Instead of using Manhattan world assumption, we use Atlanta world model to describe such regularity.
Robotics
1 code implementation • CVPR 2019 • Heng Fan, Liting Lin, Fan Yang, Peng Chu, Ge Deng, Sijia Yu, Hexin Bai, Yong Xu, Chunyuan Liao, Haibin Ling
In this paper, we present LaSOT, a high-quality benchmark for Large-scale Single Object Tracking.
no code implementations • 23 Jun 2018 • Yifan Wu, Fan Yang, Haibin Ling
In this paper, we propose a new framework called Privacy-Protective-GAN (PP-GAN) that adapts GAN with novel verificator and regulator modules specially designed for the face de-identification problem to ensure generating de-identified output with retained structure similarity according to a single input.
no code implementations • 15 May 2018 • Qin Zhou, Heng Fan, Hua Yang, Hang Su, Shibao Zheng, Shuang Wu, Haibin Ling
To address this problem, in this paper, we present a robust and efficient graph correspondence transfer (REGCT) approach for explicit spatial alignment in Re-ID.
1 code implementation • 4 May 2018 • Xin Liu, Zhikai Hu, Haibin Ling, Yiu-ming Cheung
More specifically, MTFH exploits an efficient objective function to flexibly learn the modality-specific hash codes with different length settings, while synchronously learning two semantic correlation matrices to semantically correlate the different hash representations for heterogeneous data comparable.
no code implementations • 20 Apr 2018 • Pengfei Zhu, Longyin Wen, Xiao Bian, Haibin Ling, QinGhua Hu
In this paper we present a large-scale visual object detection and tracking benchmark, named VisDrone2018, aiming at advancing visual understanding tasks on the drone platform.
no code implementations • 1 Apr 2018 • Qin Zhou, Heng Fan, Shibao Zheng, Hang Su, Xinzhe Li, Shuang Wu, Haibin Ling
In this paper, we propose a graph correspondence transfer (GCT) approach for person re-identification.
1 code implementation • 24 Mar 2018 • Bingyao Huang, Samed Ozdemir, Ying Tang, Chunyuan Liao, Haibin Ling
Existing camera-projector calibration methods typically warp feature points from a camera image to a projector image using estimated homographies, and often suffer from errors in camera parameters and noise due to imperfect planarity of the calibration target.
no code implementations • 22 Mar 2018 • Zhigang Chang, Qin Zhou, Heng Fan, Hang Su, Hua Yang, Shibao Zheng, Haibin Ling
Meanwhile, a weighting scheme is applied on the bilinear coding to adaptively adjust the weights of local features at different locations based on their importance in recognition, further improving the discriminability of feature aggregation.
no code implementations • 30 Jan 2018 • Heng Fan, Haibin Ling
Being intensively studied, visual object tracking has witnessed great advances in either speed (e. g., with correlation filters) or accuracy (e. g., with deep features).
no code implementations • 21 Jan 2018 • Heng Fan, Haibin Ling
Recently recurrent neural networks (RNNs) have demonstrated the ability to improve scene labeling through capturing long-range dependencies among image units.
1 code implementation • ICCV 2017 • Lei Zhu, Haibin Ling, Jin Wu, Huiping Deng, Jin Liu
We show that the linear combination of structured labels can well model the saliency distribution in local regions.
no code implementations • ICCV 2017 • Heng Fan, Haibin Ling
In this paper we study the problem from a new perspective and present a novel parallel tracking and verifying (PTAV) framework, by taking advantage of the ubiquity of multi-thread techniques and borrowing from the success of parallel tracking and mapping in visual SLAM.
no code implementations • 27 Mar 2017 • Yunlong Yu, Zhong Ji, Xi Li, Jichang Guo, Zhongfei Zhang, Haibin Ling, Fei Wu
As an important and challenging problem in computer vision, zero-shot learning (ZSL) aims at automatically recognizing the instances from unseen object classes without training data.
no code implementations • 23 Mar 2017 • Pengpeng Liang, Yifan Wu, Hu Lu, Liming Wang, Chunyuan Liao, Haibin Ling
In this paper, we present a carefully designed planar object tracking benchmark containing 210 videos of 30 planar objects sampled in the natural environment.
2 code implementations • IEEE TIP 2016 • Xiaojie Guo, Yu Li, Haibin Ling
When one captures images in low-light conditions, the images often suffer from low visibility.
Ranked #1 on
Low-Light Image Enhancement
on 10 Monkey Species
(using extra training data)
no code implementations • 21 Nov 2016 • Heng Fan, Haibin Ling
Convolutional neural network (CNN) has drawn increasing interest in visual tracking owing to its powerfulness in feature extraction.
no code implementations • 8 Jul 2016 • Heng Fan, Xue Mei, Danil Prokhorov, Haibin Ling
Context in image is crucial for scene labeling while existing methods only exploit local context generated from a small surrounding area of an image patch or a pixel, by contrast long-range and global contextual information is ignored.
no code implementations • CVPR 2016 • Xinchu Shi, Haibin Ling, Weiming Hu, Junliang Xing, Yanning Zhang
Due to its wide range of applications, matching between two graphs has been extensively studied and remains an active topic.
2 code implementations • 23 Mar 2016 • Dangwei Li, Zhang Zhang, Xiaotang Chen, Haibin Ling, Kaiqi Huang
RAP has in total 41, 585 pedestrian samples, each of which is annotated with 72 attributes as well as viewpoints, occlusions, body parts information.
no code implementations • 18 Jan 2016 • Ying Huang, Hong Zheng, Haibin Ling, Erik Blasch, Hao Yang
Bird strikes present a huge risk for aircraft, especially since traditional airport bird surveillance is mainly dependent on inefficient human observation.
no code implementations • ICCV 2015 • Peiyi Li, Haibin Ling, Xi Li, Chunyuan Liao
In this paper, we propose a real-time 3D hand pose estimation algorithm using the randomized decision forest framework.
no code implementations • 19 Oct 2015 • Xi Li, Liming Zhao, Lina Wei, Ming-Hsuan Yang, Fei Wu, Yueting Zhuang, Haibin Ling, Jingdong Wang
A key problem in salient object detection is how to effectively model the semantic properties of salient objects in a data-driven manner.
no code implementations • CVPR 2015 • Liang Du, Haibin Ling
As shown in our experiments, the algorithm effectively balances feature sharing and feature exclusion between the two tasks; and, for face verification, the algorithm effectively removes distracting features used in age verification.
no code implementations • 5 Jan 2015 • Pengpeng Liang, Chunyuan Liao, Xue Mei, Haibin Ling
Noting that the way we integrate objectness in visual tracking is generic and straightforward, we expect even more improvement by using tracker-specific objectness.
no code implementations • CVPR 2014 • Erkang Cheng, Yu Pang, Ying Zhu, Jingyi Yu, Haibin Ling
Robust tracking of deformable object like catheter or vascular structures in X-ray images is an important technique used in image guided medical interventions for effective motion compensation and dynamic multi-modality image fusion.
no code implementations • CVPR 2014 • Nianyi Li, Jinwei Ye, Yu Ji, Haibin Ling, Jingyi Yu
Existing saliency detection approaches use images as inputs and are sensitive to foreground/background similarities, complex background textures, and occlusions.
no code implementations • CVPR 2014 • Xinchu Shi, Haibin Ling, Weiming Hu, Chunfeng Yuan, Junliang Xing
In this paper, we model interactions between neighbor targets by pair-wise motion context, and further encode such context into the global association optimization.
no code implementations • CVPR 2013 • Chunfeng Yuan, Xi Li, Weiming Hu, Haibin Ling, Stephen Maybank
In this paper, we propose a new global feature to capture the detailed geometrical distribution of interest points.
no code implementations • CVPR 2013 • Xinchu Shi, Haibin Ling, Junling Xing, Weiming Hu
In this paper we formulate multi-target tracking (MTT) as a rank-1 tensor approximation problem and propose an 1 norm tensor power iteration solution.