1 code implementation • ECCV 2020 • Guolei Sun, Salman Khan, Wen Li, Hisham Cholakkal, Fahad Shahbaz Khan, Luc van Gool
This way, in an effort to fix localization errors, our loss provides an extra supervisory signal that helps the model to better discriminate between similar classes.
no code implementations • 15 Mar 2023 • Ye Huang, Di Kang, Shenghua Gao, Wen Li, Lixin Duan
One crucial design of the HFG is to protect the high-level features from being contaminated by using proper stop-gradient operations so that the backbone does not update according to the noisy gradient from the upsampler.
1 code implementation • 28 Feb 2023 • Wen Li, Cheng Zou, Meng Wang, Furong Xu, Jianan Zhao, Ruobing Zheng, Yuan Cheng, Wei Chu
In this paper, we propose a Diverse and Compact Transformer (DC-Former) that can achieve a similar effect by splitting embedding space into multiple diverse and compact subspaces.
no code implementations • 2 Feb 2023 • Zhean Shao, Wen Li, Ying Tan
When the model does not work well, it is triggered as the variable to tune to improve the performance.
1 code implementation • CVPR 2023 • Jinhong Deng, Dongli Xu, Wen Li, Lixin Duan
Self-training approaches recently achieved promising results in cross-domain object detection, where people iteratively generate pseudo labels for unlabeled target domain samples with a model, and select high-confidence samples to refine the model.
no code implementations • CVPR 2023 • Wen Li, Shangshu Yu, Cheng Wang, Guosheng Hu, Siqi Shen, Chenglu Wen
In this work, we propose a novel LiDAR localization framework, SGLoc, which decouples the pose estimation to point cloud correspondence regression and pose estimation via this correspondence.
1 code implementation • CVPR 2023 • Anqi Zhao, Tong Chu, Yahao Liu, Wen Li, Jingjing Li, Lixin Duan
On the algorithmic side, we derive a new algorithm for black-box targeted attacks based on our theoretical analysis, in which we additionally minimize the maximum model discrepancy(M3D) of the substitute models when training the generator to generate adversarial examples.
no code implementations • 21 Nov 2022 • Shaohua Zhi, Yinghui Wang, Haonan Xiao, Ti Bai, Hong Ge, Bing Li, Chenyang Liu, Wen Li, Tian Li, Jing Cai
Four-dimensional magnetic resonance imaging (4D-MRI) is an emerging technique for tumor motion management in image-guided radiation therapy (IGRT).
no code implementations • 29 Sep 2022 • Borun Xu, Biao Wang, Jinhong Deng, Jiale Tao, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan
Motion transfer aims to transfer the motion of a driving video to a source image.
1 code implementation • 28 Sep 2022 • Jiale Tao, Biao Wang, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan
Image animation aims to animate a source image by using motion learned from a driving video.
1 code implementation • CVPR 2022 • Dongli Xu, Jinhong Deng, Wen Li
However, a deep understanding of how AP loss affects the detector from a pairwise ranking perspective has not yet been developed. In this work, we revisit the average precision (AP)loss and reveal that the crucial element is that of selecting the ranking pairs between positive and negative samples. Based on this observation, we propose two strategies to improve the AP loss.
no code implementations • 4 Jul 2022 • Cheng Zou, Furong Xu, Meng Wang, Wen Li, Yuan Cheng
Automatic snake species recognition is important because it has vast potential to help lower deaths and disabilities caused by snakebites.
1 code implementation • 2 Jun 2022 • Chengyin Hu, Yilong Wang, Kalibinuer Tiliwalidi, Wen Li
It realizes robust and covert physical attack by using low-cost laser equipment.
no code implementations • 1 Jun 2022 • Jinhong Deng, Xiaoyue Zhang, Wen Li, Lixin Duan
In particular, we take advantage of the characteristics of cross-attention as used in detection transformer and propose the spatial-aware token alignment (SpaTA) and the semantic-aware token alignment (SemTA) strategies to guide the token alignment across domains.
1 code implementation • CVPR 2022 • Yawei Li, Kamil Adamczewski, Wen Li, Shuhang Gu, Radu Timofte, Luc van Gool
The proposed approach provides a new way to compare different methods, namely how well they behave compared with random pruning.
1 code implementation • CVPR 2022 • Yahao Liu, Jinhong Deng, Jiale Tao, Tong Chu, Lixin Duan, Wen Li
Existing works typically treat cross-domain semantic segmentation (CDSS) as a data distribution mismatch problem and focus on aligning the marginal distribution or conditional distribution.
1 code implementation • CVPR 2022 • Jiale Tao, Biao Wang, Borun Xu, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan
Specifically, inspired by the known deformable part model (DPM), our DAM introduces two types of anchors or keypoints: i) a number of motion anchors that capture both appearance and motion information from the source image and driving video; ii) a latent root anchor, which is linked to the motion anchors to facilitate better learning of the representations of the object structure information.
no code implementations • CVPR 2022 • Fanyue Wei, Biao Wang, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan
To this end, we propose to learn pixel-level distinctions to improve the video highlight detection.
no code implementations • 2 Apr 2022 • Chengyin Hu, Weiwen Shi, Wen Li
Moreover, we validate the robustness of our approach by successfully attacking advanced DNNs with a success rate of over 75% in all cases.
1 code implementation • CVPR 2022 • Duo Peng, Yinjie Lei, Munawar Hayat, Yulan Guo, Wen Li
In this paper, we address domain generalized semantic segmentation, where a segmentation model is trained to be domain-invariant without using any target domain data.
no code implementations • 1 Apr 2022 • Yan Zhang, Changyu Li, Ivor W. Tsang, Hui Xu, Lixin Duan, Hongzhi Yin, Wen Li, Jie Shao
Motivated by the idea of meta-augmentation, in this paper, by treating a user's preference over items as a task, we propose a so-called Diverse Preference Augmentation framework with multiple source domains based on meta-learning (referred to as MetaDPA) to i) generate diverse ratings in a new domain of interest (known as target domain) to handle overfitting on the case of sparse interactions, and to ii) learn a preference model in the target domain via a meta-learning scheme to alleviate cold-start issues.
no code implementations • 13 Mar 2022 • Feiyu Wang, Qin Wang, Wen Li, Dong Xu, Luc van Gool
Benefited from this new perspective, we first propose a new deep semi-supervised learning framework called Semi-supervised Learning by Empirical Distribution Alignment (SLEDA), in which existing technologies from the domain adaptation community can be readily used to address the semi-supervised learning problem through reducing the empirical distribution distance between labeled and unlabeled data.
1 code implementation • CVPR 2022 • Hao Ni, Jingkuan Song, Xiaopeng Luo, Feng Zheng, Wen Li, Heng Tao Shen
Domain Generalizable (DG) person ReID is a challenging task which trains a model on source domains yet generalizes well on target domains.
Domain Generalization
Generalizable Person Re-identification
+1
1 code implementation • 19 Dec 2021 • Borun Xu, Biao Wang, Jiale Tao, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan
Creative image animations are attractive in e-commerce applications, where motion transfer is one of the import ways to generate animations from static images.
no code implementations • 9 Dec 2021 • Wen Li, Furong Xu, Jianan Zhao, Ruobing Zheng, Cheng Zou, Meng Wang, Yuan Cheng
Triplet loss is a widely adopted loss function in ReID task which pulls the hardest positive pairs close and pushes the hardest negative pairs far away.
1 code implementation • ACM International Conference on Multimedia 2021 • Pengzhan Sun, Bo Wu, Xunsong Li, Wen Li, Lixin Duan, Chuang Gan
By doing that, our proposed CDN method can better recognize unseen action instances by debiasing the effect of appearances.
no code implementations • SEMEVAL 2021 • Liang Zou, Wen Li
And then we construct Bidirectional Long Short Term Memory-Conditional Random Field (Bi-LSTM-CRF) model by Baidu research to predict whether each word in the sentence is toxic or not.
1 code implementation • ICCV 2021 • Duo Peng, Yinjie Lei, Wen Li, Pingping Zhang, Yulan Guo
Domain adaptation is critical for success when confronting with the lack of annotations in a new domain.
1 code implementation • CVPR 2021 • Weichen Zhang, Wen Li, Dong Xu
In this work, we propose a new cross-dataset 3D object detection method named Scale-aware and Range-aware Domain Adaptation Network (SRDAN).
no code implementations • 17 May 2021 • Andrey Ignatov, Andres Romero, Heewon Kim, Radu Timofte, Chiu Man Ho, Zibo Meng, Kyoung Mu Lee, Yuxiang Chen, Yutong Wang, Zeyu Long, Chenhao Wang, Yifei Chen, Boshen Xu, Shuhang Gu, Lixin Duan, Wen Li, Wang Bofei, Zhang Diankai, Zheng Chengjian, Liu Shaoli, Gao Si, Zhang Xiaofeng, Lu Kaidi, Xu Tianyu, Zheng Hui, Xinbo Gao, Xiumei Wang, Jiaming Guo, Xueyi Zhou, Hao Jia, Youliang Yan
Video super-resolution has recently become one of the most important mobile-related problems due to the rise of video communication and streaming services.
no code implementations • 26 Mar 2021 • Jiayi Tian, Jing Zhang, Wen Li, Dong Xu
On the other hand, we also design an effective distribution alignment method to reduce the distribution divergence between the virtual domain and the target domain by gradually improving the compactness of the target domain distribution through model learning.
1 code implementation • ICCV 2021 • Yahao Liu, Jinhong Deng, Xinchen Gao, Wen Li, Lixin Duan
By integrating the boundary adaptation and prototype alignment, we are able to train a discriminative and domain-invariant model for cross-domain semantic segmentation.
no code implementations • CVPR 2021 • Rui Gong, Yuhua Chen, Danda Pani Paudel, Yawei Li, Ajad Chhatkuli, Wen Li, Dengxin Dai, Luc van Gool
Open compound domain adaptation (OCDA) is a domain adaptation setting, where target domain is modeled as a compound of multiple unknown homogeneous domains, which brings the advantage of improved generalization to unseen domains.
no code implementations • ICCV 2021 • Rui Gong, Dengxin Dai, Yuhua Chen, Wen Li, Luc van Gool
One challenge of object recognition is to generalize to new domains, to more classes and/or to new modalities.
1 code implementation • 8 Sep 2020 • Zhiyu Xue, Lixin Duan, Wen Li, Lin Chen, Jiebo Luo
For that, in this work, we propose a metric learning based method named Region Comparison Network (RCN), which is able to reveal how few-shot learning works as in a neural network as well as to find out specific regions that are related to each other in images coming from the query and support sets.
Ranked #32 on
Few-Shot Image Classification
on CIFAR-FS 5-way (5-shot)
no code implementations • 7 Aug 2020 • Tao Wu, Ellie Ka-In Chio, Heng-Tze Cheng, Yu Du, Steffen Rendle, Dima Kuzmin, Ritesh Agarwal, Li Zhang, John Anderson, Sarvjeet Singh, Tushar Chandra, Ed H. Chi, Wen Li, Ankit Kumar, Xiang Ma, Alex Soares, Nitin Jindal, Pei Cao
In light of these problems, we observed that most online content platforms have both a search and a recommender system that, while having heterogeneous input spaces, can be connected through their common output item space and a shared semantic representation.
1 code implementation • ECCV 2020 • Yuan Tian, Qin Wang, Zhiwu Huang, Wen Li, Dengxin Dai, Minghao Yang, Jun Wang, Olga Fink
In this paper, we introduce a new reinforcement learning (RL) based neural architecture search (NAS) methodology for effective and efficient generative adversarial network (GAN) architecture search.
Ranked #12 on
Image Generation
on STL-10
1 code implementation • CVPR 2021 • Yawei Li, Wen Li, Martin Danelljan, Kai Zhang, Shuhang Gu, Luc van Gool, Radu Timofte
Based on that, we articulate the heterogeneity hypothesis: with the same training protocol, there exists a layer-wise differentiated network architecture (LW-DNA) that can outperform the original network with regular channel configurations but with a lower level of model complexity.
no code implementations • 28 Jun 2020 • Rui Gong, Dengxin Dai, Yu-Hua Chen, Wen Li, Luc van Gool
AIT achieves this zero-shot image translation capability by coupling a supervised training scheme in the synthetic domain, a cycle consistency strategy in the real domain, an adversarial training scheme between the two domains, and a novel network design.
no code implementations • 5 Apr 2020 • Minghao Fu, Zhenshan Xie, Wen Li, Lixin Duan
Cross-domain object detection has recently attracted more and more attention for real-world applications, since it helps build robust detectors adapting well to new environments.
1 code implementation • CVPR 2021 • Jinhong Deng, Wen Li, Yu-Hua Chen, Lixin Duan
We reveal that there often exists a considerable model bias for the simple mean teacher (MT) model in cross-domain scenarios, and eliminate the model bias with several simple yet highly effective strategies.
no code implementations • WS 2019 • Hai Hu, Wen Li, He Zhou, Zuoyu Tian, Yiwen Zhang, Liang Zou
This paper describes the IUCL system at VarDial 2019 evaluation campaign for the task of discriminating between Mainland and Taiwan variation of mandarin Chinese.
no code implementations • SEMEVAL 2019 • Ping Liu, Wen Li, Liang Zou
Transfer learning and domain adaptive learning have been applied to various fields including computer vision (e. g., image recognition) and natural language processing (e. g., text classification).
1 code implementation • ICCV 2019 • Qin Wang, Wen Li, Luc van Gool
We reveal that an essential sampling bias exists in semi-supervised learning due to the limited number of labeled samples, which often leads to a considerable empirical distribution mismatch between labeled data and unlabeled data.
1 code implementation • 3 May 2019 • Qing Lian, Wen Li, Lin Chen, Lixin Duan
Particularly, in open set domain adaptation, we allow the classes from the source and target domains to be partially overlapped.
1 code implementation • CVPR 2019 • Jiqing Wu, Zhiwu Huang, Dinesh Acharya, Wen Li, Janine Thoma, Danda Pani Paudel, Luc van Gool
In generative modeling, the Wasserstein distance (WD) has emerged as a useful metric to measure the discrepancy between generated and real data distributions.
Ranked #1 on
Video Generation
on TrailerFaces
1 code implementation • CVPR 2019 • Rui Gong, Wen Li, Yu-Hua Chen, Luc van Gool
In this work, we present a domain flow generation(DLOW) model to bridge two different domains by generating a continuous sequence of intermediate domains flowing from one domain to the other.
no code implementations • CVPR 2019 • Yuhua Chen, Wen Li, Xiaoran Chen, Luc van Gool
In this work, we take the advantage of additional geometric information from synthetic data, a powerful yet largely neglected cue, to bridge the domain gap.
no code implementations • ECCV 2018 • Dongang Wang, Wanli Ouyang, Wen Li, Dong Xu
We then train view-specific action classifiers based on the view-specific representation for each view and a view classifier based on the shared representation at lower layers.
1 code implementation • CVPR 2018 • Weichen Zhang, Wanli Ouyang, Wen Li, Dong Xu
In this paper, we propose a new unsupervised domain adaptation approach called Collaborative and Adversarial Network (CAN) through domain-collaborative and domain-adversarial training of neural networks.
no code implementations • WS 2018 • Hai Hu, Wen Li, Sandra Kübler
We present a machine learning approach to distinguish texts translated to Chinese (by humans) from texts originally written in Chinese, with a focus on a wide range of syntactic features.
no code implementations • 6 Apr 2018 • Dengxin Dai, Wen Li, Till Kroeger, Luc van Gool
We mitigate this by introducing ensemble manifold segmentation (EMS).
8 code implementations • CVPR 2018 • Yuhua Chen, Wen Li, Christos Sakaridis, Dengxin Dai, Luc van Gool
The results demonstrate the effectiveness of our proposed approach for robust object detection in various domain shift scenarios.
no code implementations • CVPR 2018 • Yuhua Chen, Wen Li, Luc van Gool
To this end, we propose a new reality oriented adaptation approach for urban scene semantic segmentation by learning from synthetic data.
1 code implementation • CVPR 2018 • Limin Wang, Wei Li, Wen Li, Luc van Gool
Specifically, SMART blocks decouple the spatiotemporal learning module into an appearance branch for spatial modeling and a relation branch for temporal modeling.
Ranked #50 on
Action Recognition
on UCF101
no code implementations • RANLP 2017 • Wen Li, Markus Dickinson
The goal of this project is to predict the gender of users based on their posts on Weibo, a Chinese micro-blogging platform.
no code implementations • WS 2017 • Wen Li, Liang Zou
We also report the performance of each feature type, as well as the best features of a type.
no code implementations • 9 Aug 2017 • Wen Li, Li-Min Wang, Wei Li, Eirikur Agustsson, Luc van Gool
Our new WebVision database and relevant studies in this work would benefit the advance of learning state-of-the-art visual models with minimum supervision based on web data.
no code implementations • 13 Jul 2017 • Yifei Wang, Wen Li, Dengxin Dai, Luc van Gool
Our work builds on the recently proposed Deep CORAL method, which proposed to train a convolutional neural network and simultaneously minimize the Euclidean distance of convariance matrices between the source and target domains.
1 code implementation • 8 Jun 2017 • Jiqing Wu, Zhiwu Huang, Dinesh Acharya, Wen Li, Janine Thoma, Danda Pani Paudel, Luc van Gool
In generative modeling, the Wasserstein distance (WD) has emerged as a useful metric to measure the discrepancy between generated and real data distributions.
no code implementations • 16 May 2017 • Wen Li, Li-Min Wang, Wei Li, Eirikur Agustsson, Jesse Berent, Abhinav Gupta, Rahul Sukthankar, Luc van Gool
The 2017 WebVision challenge consists of two tracks, the image classification task on WebVision test set, and the transfer learning task on PASCAL VOC 2012 dataset.
3 code implementations • 8 Oct 2016 • Wen Li, Ying Zhang, Yifang Sun, Wei Wang, Wenjie Zhang, Xuemin Lin
Approximate Nearest neighbor search (ANNS) is fundamental and essential operation in applications from many domains, such as databases, machine learning, multimedia, and computer vision.
Databases
2 code implementations • 12 Jul 2016 • Muhammad Ghifary, W. Bastiaan Kleijn, Mengjie Zhang, David Balduzzi, Wen Li
In this paper, we propose a novel unsupervised domain adaptation algorithm based on deep learning for visual object recognition.
no code implementations • CVPR 2016 • Wen Li, Dengxin Dai, Mingkui Tan, Dong Xu, Luc van Gool
The SVM+ approach has shown excellent performance in visual recognition tasks for exploiting privileged information in the training data.
no code implementations • ICCV 2015 • Li Niu, Wen Li, Dong Xu
Considering the recent works show the domain generalization capability can be enhanced by fusing multiple SVM classifiers, we build upon exemplar SVMs to learn a set of SVM classifiers by using one positive sample and all negative samples in the source domain each time.
no code implementations • CVPR 2015 • Shijie Xiao, Wen Li, Dong Xu, DaCheng Tao
In this paper, we develop a fast LRR solver called FaLRR, by reformulating LRR as a new optimization problem with regard to factorized data (which is obtained by skinny SVD of the original data matrix).
no code implementations • CVPR 2015 • Li Niu, Wen Li, Dong Xu
In this work, we formulate a new weakly supervised domain generalization problem for the visual recognition task by using loosely labeled web images/videos as training data.
no code implementations • CVPR 2014 • Lin Chen, Wen Li, Dong Xu
In this work, we propose a new framework for recognizing RGB images captured by the conventional cameras by leveraging a set of labeled RGB-D data, in which the depth features can be additionally extracted from the depth images.
no code implementations • CVPR 2013 • Zhen Cui, Wen Li, Dong Xu, Shiguang Shan, Xilin Chen
Spatial-Temporal Face Region Descriptor, STFRD) for images (resp.