no code implementations • 21 Mar 2023 • Zhenyu Wang, YaLi Li, Xi Chen, Ser-Nam Lim, Antonio Torralba, Hengshuang Zhao, Shengjin Wang
In this paper, we formally address universal object detection, which aims to detect every scene and predict every category.
no code implementations • 7 Nov 2022 • Zhongdao Wang, Zhaopeng Dou, Jingwei Zhang, Liang Zheng, Yifan Sun, YaLi Li, Shengjin Wang
In this paper, we are interested in learning a generalizable person re-identification (re-ID) representation from unlabeled videos.
1 code implementation • 24 Oct 2022 • Zhaopeng Dou, Zhongdao Wang, Weihua Chen, YaLi Li, Shengjin Wang
(3) the data uncertainty and the model uncertainty are jointly learned in a unified network, and they serve as two fundamental criteria for the reliability assessment: if a probe is high-quality (low data uncertainty) and the model is confident in the prediction of the probe (low model uncertainty), the final ranking will be assessed as reliable.
1 code implementation • 20 Oct 2022 • Xin Liu, Zhongdao Wang, YaLi Li, Shengjin Wang
To cope with this issue, we propose Maximum Entropy Coding (MEC), a more principled objective that explicitly optimizes on the structure of the representation, so that the learned representation is less biased and thus generalizes better to unseen downstream tasks.
1 code implementation • 19 Oct 2022 • Xin Liu, Xiaofei Shao, Bo wang, YaLi Li, Shengjin Wang
First, unlike previous methods, we leverage convolution neural networks as well as graph neural networks in a complementary way for geometric representation learning.
no code implementations • 27 Jul 2022 • Yixuan Fan, Zhaopeng Dou, YaLi Li, Shengjin Wang
Furthermore, we focus on representation learning for portrait interpretation and propose a baseline that reflects our systematic perspective.
1 code implementation • 22 Jun 2022 • Yuhao Lu, Beixing Deng, Zhenyu Wang, Peiyuan Zhi, YaLi Li, Shengjin Wang
6-DoF grasp pose detection of multi-grasp and multi-object is a challenge task in the field of intelligent robot.
no code implementations • 2 Apr 2022 • Ya-Li Li, Shengjin Wang
In this paper, we propose a novel approach to combine decision trees and deep neural networks in an end-to-end learning manner for object detection.
1 code implementation • CVPR 2022 • Zhenyu Wang, YaLi Li, Shengjin Wang
We construct a framework for semi-supervised instance segmentation by assigning pixel-level pseudo labels.
no code implementations • CVPR 2022 • Dongchen Lu, Dongmei Li, YaLi Li, Shengjin Wang
By proposing the orientation-sensitive heatmap, OSKDet could learn the shape and direction of rotated target implicitly and has stronger modeling capabilities for rotated representation, which improves the localization accuracy and acquires high quality detection results.
no code implementations • CVPR 2022 • YaLi Li, Shengjin Wang
In this paper, we propose a novel approach to combine decision trees and deep neural networks in an end-to-end learning manner for object detection.
1 code implementation • 28 Dec 2021 • Jian Han, Ya-Li Li, Shengjin Wang
With the uncertainty-guided alternative optimization, we balance between the exploration of target domain data and the negative effects of noisy labeling.
Domain Adaptive Person Re-Identification
Person Re-Identification
no code implementations • 14 Dec 2021 • Yunzhong Hou, Zhongdao Wang, Shengjin Wang, Liang Zheng
In this paper, we design experiments to verify such misfit between global re-ID feature distances and local matching in tracking, and propose a simple yet effective approach to adapt affinity estimations to corresponding matching scopes in MTMCT.
no code implementations • NeurIPS 2021 • Zhenyu Wang, YaLi Li, Ye Guo, Shengjin Wang
To combat the noisy labeling, we propose noise-resistant semi-supervised learning by quantifying the region uncertainty.
1 code implementation • 15 Oct 2021 • Yaping Zhao, Mengqi Ji, Ruqi Huang, Bin Wang, Shengjin Wang
In this paper, we consider the problem of reference-based video super-resolution(RefVSR), i. e., how to utilize a high-resolution (HR) reference frame to super-resolve a low-resolution (LR) video sequence.
Reference-based Video Super-Resolution
Video Super-Resolution
no code implementations • ICCV 2021 • Takashi Isobe, Dong Li, Lu Tian, Weihua Chen, Yi Shan, Shengjin Wang
We observe that these proposed schemes are capable of facilitating the learning of discriminative feature representations.
1 code implementation • NeurIPS 2021 • Zhongdao Wang, Hengshuang Zhao, Ya-Li Li, Shengjin Wang, Philip H. S. Torr, Luca Bertinetto
We show how most tracking tasks can be solved within this framework, and that the same appearance model can be successfully used to obtain results that are competitive against specialised methods for most of the tasks considered.
Ranked #2 on
Video Object Segmentation
on DAVIS 2017
(mIoU metric)
Multi-Object Tracking
Multi-Object Tracking and Segmentation
+10
no code implementations • 19 Jun 2021 • Jingtao Xu, YaLi Li, Shengjin Wang
In this paper, we propose a novel Adaptive Zoom (AdaZoom) network as a selective magnifier with flexible shape and focal length to adaptively zoom the focus regions for object detection.
no code implementations • CVPR 2021 • Miao Hu, YaLi Li, Lu Fang, Shengjin Wang
Learning pyramidal feature representations is crucial for recognizing object instances at different scales.
no code implementations • CVPR 2021 • Takashi Isobe, Xu Jia, Shuaijun Chen, Jianzhong He, Yongjie Shi, Jianzhuang Liu, Huchuan Lu, Shengjin Wang
To obtain a single model that works across multiple target domains, we propose to simultaneously learn a student model which is trained to not only imitate the output of each expert on the corresponding target domain, but also to pull different expert close to each other with regularization on their weights.
Ranked #3 on
Domain Adaptation
on GTAV to Cityscapes+Mapillary
no code implementations • 7 May 2021 • Miao Hu, YaLi Li, Lu Fang, Shengjin Wang
Learning pyramidal feature representations is crucial for recognizing object instances at different scales.
no code implementations • CVPR 2021 • Zhenyu Wang, YaLi Li, Ye Guo, Lu Fang, Shengjin Wang
In this paper, we delve into semi-supervised object detection where unlabeled images are leveraged to break through the upper bound of fully-supervised object detection models.
no code implementations • ICCV 2021 • Xuege Hou, YaLi Li, Shengjin Wang
For quantitative measure of the degree of disentanglement, we verify that mutual information can represent as metric.
no code implementations • ICCV 2021 • Jiahe Shi, YaLi Li, Shengjin Wang
Human-oriented image captioning with both high diversity and accuracy is a challenging task in vision+language modeling.
2 code implementations • 13 Aug 2020 • Takashi Isobe, Fang Zhu, Xu Jia, Shengjin Wang
Video super-resolution plays an important role in surveillance video analysis and ultra-high-definition video display, which has drawn much attention in both the research and industrial communities.
Ranked #1 on
Video Super-Resolution
on SPMCS - 4x upscaling
2 code implementations • ECCV 2020 • Takashi Isobe, Xu Jia, Shuhang Gu, Songjiang Li, Shengjin Wang, Qi Tian
Most video super-resolution methods super-resolve a single reference frame with the help of neighboring frames in a temporal sliding window.
1 code implementation • CVPR 2020 • Takashi Isobe, Songjiang Li, Xu Jia, Shanxin Yuan, Gregory Slabaugh, Chunjing Xu, Ya-Li Li, Shengjin Wang, Qi Tian
Video super-resolution, which aims at producing a high-resolution video from its corresponding low-resolution version, has recently drawn increasing attention.
no code implementations • ECCV 2020 • Zhongdao Wang, Jingwei Zhang, Liang Zheng, Yixuan Liu, Yifan Sun, Ya-Li Li, Shengjin Wang
This paper proposes a self-supervised learning method for the person re-identification (re-ID) problem, where existing unsupervised methods usually rely on pseudo labels, such as those from video tracklets or clustering.
1 code implementation • 27 Nov 2019 • Yunzhong Hou, Liang Zheng, Zhongdao Wang, Shengjin Wang
Due to the continuity of target trajectories, tracking systems usually restrict their data association within a local neighborhood.
no code implementations • IJCNLP 2019 • Zeynep Akkalyoncu Yilmaz, Shengjin Wang, Wei Yang, Haotian Zhang, Jimmy Lin
We present Birch, a system that applies BERT to document retrieval via integration with the open-source Anserini information retrieval toolkit to demonstrate end-to-end search over large document collections.
12 code implementations • ECCV 2020 • Zhongdao Wang, Liang Zheng, Yixuan Liu, Ya-Li Li, Shengjin Wang
In this paper, we propose an MOT system that allows target detection and appearance embedding to be learned in a shared model.
Ranked #4 on
Multi-Object Tracking
on HiEve
no code implementations • 4 Aug 2019 • Lanqing He, Zhongdao Wang, Ya-Li Li, Shengjin Wang
The softmax loss and its variants are widely used as objectives for embedding learning, especially in applications like face recognition.
no code implementations • 4 Aug 2019 • Yixuan Liu, Yuwang Wang, Shengjin Wang
To this end, we first design a differentiable depth map warping operation, which is end-to-end trainable, and then propose a pose generator to generate novel views for a given image in an adversarial manner.
no code implementations • 30 May 2019 • Ye Guo, Ya-Li Li, Shengjin Wang
Generic object detection is one of the most fundamental problems in computer vision, yet it is difficult to provide all the bounding-box-level annotations aiming at large-scale object detection for thousands of categories.
no code implementations • 5 May 2019 • Takashi Isobe, Jian Han, Fang Zhu, Ya-Li Li, Shengjin Wang
Video-based person re-identification has drawn massive attention in recent years due to its extensive applications in video surveillance.
no code implementations • 25 Apr 2019 • Ya-Li Li, Shengjin Wang
First, we present the modules of spatial attention, channel attention and aligned attention for single-stage object detection.
1 code implementation • CVPR 2019 • Yifan Sun, Qin Xu, Ya-Li Li, Chi Zhang, Yikang Li, Shengjin Wang, Jian Sun
The visibility awareness allows VPM to extract region-level features and compare two images with focus on their shared regions (which are visible on both images).
Ranked #14 on
Person Re-Identification
on Market-1501-C
4 code implementations • CVPR 2019 • Zhongdao Wang, Liang Zheng, Ya-Li Li, Shengjin Wang
The key idea is that we find the local context in the feature space around an instance (face) contains rich information about the linkage relationship between this instance and its neighbors.
no code implementations • CVPR 2019 • Yue Zheng, Ya-Li Li, Shengjin Wang
In this paper, we propose a novel approach for generating image captions with guiding objects (CGO).
no code implementations • 31 Oct 2018 • Zhongdao Wang, Liang Zheng, Shengjin Wang
That is to say, for some queries, a feature may be neither discriminative nor complementary to existing ones, while for other queries, the feature suffices.
no code implementations • 27 Nov 2017 • Lingxiao Wang, Ya-Li Li, Shengjin Wang
Comprehensive experiments demonstrate that our proposed method can handle various blur kenels and achieve state-of-the-art results for small size blurry face images restoration.
20 code implementations • ECCV 2018 • Yifan Sun, Liang Zheng, Yi Yang, Qi Tian, Shengjin Wang
RPP re-assigns these outliers to the parts they are closest to, resulting in refined parts with enhanced within-part consistency.
Ranked #3 on
Person Re-Identification
on UAV-Human
1 code implementation • 12 Oct 2017 • Dong Li, Jia-Bin Huang, Ya-Li Li, Shengjin Wang, Ming-Hsuan Yang
In classification adaptation, we transfer a pre-trained network to a multi-label classification task for recognizing the presence of a certain object in an image.
no code implementations • ICCV 2017 • Zhongdao Wang, Luming Tang, Xihui Liu, Zhuliang Yao, Shuai Yi, Jing Shao, Junjie Yan, Shengjin Wang, Hongsheng Li, Xiaogang Wang
In our vehicle ReID framework, an orientation invariant feature embedding module and a spatial-temporal regularization module are proposed.
1 code implementation • ICCV 2017 • Jingchun Cheng, Yi-Hsuan Tsai, Shengjin Wang, Ming-Hsuan Yang
This paper proposes an end-to-end trainable network, SegFlow, for simultaneously predicting pixel-wise object segmentation and optical flow in videos.
Ranked #63 on
Semi-Supervised Video Object Segmentation
on DAVIS 2016
no code implementations • 14 Sep 2017 • Jingchun Cheng, Sifei Liu, Yi-Hsuan Tsai, Wei-Chih Hung, Shalini De Mello, Jinwei Gu, Jan Kautz, Shengjin Wang, Ming-Hsuan Yang
In addition, we apply a filter on the refined score map that aims to recognize the best connected region using spatial and temporal consistencies in the video.
no code implementations • 5 Jun 2017 • Dong Li, Hsin-Ying Lee, Jia-Bin Huang, Shengjin Wang, Ming-Hsuan Yang
First, we exploit the discriminative constraints to capture the intra- and inter-class relationships of image embeddings.
no code implementations • 8 Apr 2017 • Lu Tian, Shengjin Wang
Person re-identification is generally divided into two part: first how to represent a pedestrian by discriminative visual descriptors and second how to compare them by suitable distance metrics.
no code implementations • ICCV 2017 • Yifan Sun, Liang Zheng, Weijian Deng, Shengjin Wang
This paper proposes the SVDNet for retrieval problems, with focus on the application of person re-identification (re-ID).
Ranked #14 on
Person Re-Identification
on CUHK03 detected
no code implementations • CVPR 2016 • Dong Li, Jia-Bin Huang, Ya-Li Li, Shengjin Wang, Ming-Hsuan Yang
In this paper, we address this problem by progressive domain adaptation with two main steps: classification adaptation and detection adaptation.
no code implementations • 1 Apr 2016 • Liang Zheng, Yali Zhao, Shengjin Wang, Jingdong Wang, Qi Tian
The objective of this paper is the effective transfer of the Convolutional Neural Network (CNN) feature in image search and classification.
no code implementations • 23 Jan 2016 • Taiqing Wang, Shaogang Gong, Xiatian Zhu, Shengjin Wang
Current person re-identification (ReID) methods typically rely on single-frame imagery features, whilst ignoring space-time information from image sequences often available in the practical surveillance scenarios.
no code implementations • ICCV 2015 • Xu Zhang, Felix X. Yu, Ruiqi Guo, Sanjiv Kumar, Shengjin Wang, Shi-Fu Chang
We propose a family of structured matrices to speed up orthogonal projections for high-dimensional data commonly seen in computer vision applications.
no code implementations • ICCV 2015 • Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, Qi Tian
As a minor contribution, inspired by recent advances in large-scale image search, this paper proposes an unsupervised Bag-of-Words descriptor.
Ranked #88 on
Person Re-Identification
on DukeMTMC-reID
no code implementations • CVPR 2015 • Liang Zheng, Shengjin Wang, Lu Tian, Fei He, Ziqiong Liu, Qi Tian
However, in a more realistic situation, one does not know in advance whether a feature is effective or not for a given query.
no code implementations • 2 Mar 2015 • Xu Zhang, Felix Xinnan Yu, Shih-Fu Chang, Shengjin Wang
In this paper, we propose a new domain adaptation framework named Deep Transfer Network (DTN), where the highly flexible deep neural networks are used to implement such a distribution matching process.
no code implementations • 7 Feb 2015 • Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jiahao Bu, Qi Tian
In the light of recent advances in image search, this paper proposes to treat person re-identification as an image search problem.
no code implementations • 4 Jun 2014 • Fei He, Shengjin Wang
To improve the performance of boundary detection, a Learning-based Boundary Metric (LBM) is proposed to replace $\chi^2$ difference adopted by the classical algorithm mPb.
no code implementations • 3 Jun 2014 • Ziqiong Liu, Shengjin Wang, Liang Zheng, Qi Tian
This paper introduces an improved reranking method for the Bag-of-Words (BoW) based image search.
no code implementations • 1 Jun 2014 • Liang Zheng, Shengjin Wang, Fei He, Qi Tian
Specifically, the Convolutional Neural Network (CNN) is employed to extract features from regional and global patches, leading to the so-called "Deep Embedding" framework.
no code implementations • CVPR 2014 • Liang Zheng, Shengjin Wang, Wengang Zhou, Qi Tian
Albeit simple, Bayes merging can be well applied in various merging tasks, and consistently improves the baselines on multi-vocabulary merging.
no code implementations • CVPR 2014 • Liang Zheng, Shengjin Wang, Ziqiong Liu, Qi Tian
Specifically, we exploit the fusion of local color feature into c-MI.
no code implementations • CVPR 2013 • Liang Zheng, Shengjin Wang, Ziqiong Liu, Qi Tian
Further, by counting for the term-frequency in each image, the proposed L p -norm IDF helps to alleviate the visual word burstiness phenomenon.