3 code implementations • CVPR 2019 • Yang He, Ping Liu, Ziwei Wang, Zhilan Hu, Yi Yang
In this paper, we analyze this norm-based criterion and point out that its effectiveness depends on two requirements that are not always met: (1) the norm deviation of the filters should be large; (2) the minimum norm of the filters should be small.
18 code implementations • 16 Aug 2017 • Zhun Zhong, Liang Zheng, Guoliang Kang, Shaozi Li, Yi Yang
In this paper, we introduce Random Erasing, a new data augmentation method for training the convolutional neural network (CNN).
Ranked #4 on Image Classification on Fashion-MNIST
4 code implementations • 2 Apr 2019 • Yi Yang, Baile Xu, Furao Shen, Jian Zhao
Many deep models are proposed to automatically learn high-order feature interactions.
12 code implementations • CVPR 2019 • Zhedong Zheng, Xiaodong Yang, Zhiding Yu, Liang Zheng, Yi Yang, Jan Kautz
To this end, we propose a joint learning framework that couples re-id learning and data generation end-to-end.
Ranked #1 on Person Re-Identification on UAV-Human
Image-to-Image Translation Unsupervised Domain Adaptation +1
3 code implementations • 14 Apr 2020 • Zhedong Zheng, Tao Ruan, Yunchao Wei, Yi Yang, Tao Mei
This stage relaxes the full alignment between the training and testing domains, as it is agnostic to the target vehicle domain.
Ranked #1 on Vehicle Re-Identification on VehicleID
2 code implementations • CVPR 2018 • Weijian Deng, Liang Zheng, Qixiang Ye, Guoliang Kang, Yi Yang, Jianbin Jiao
To this end, we propose to preserve two types of unsupervised similarities, 1) self-similarity of an image before and after translation, and 2) domain-dissimilarity of a translated source image and a target image.
1 code implementation • 11 May 2023 • Yangming Cheng, Liulei Li, Yuanyou Xu, Xiaodi Li, Zongxin Yang, Wenguan Wang, Yi Yang
This report presents a framework called Segment And Track Anything (SAMTrack) that allows users to precisely and effectively segment and track any object in a video.
4 code implementations • NeurIPS 2019 • Xuanyi Dong, Yi Yang
The maximum probability for the size in each distribution serves as the width and depth of the pruned network, whose parameters are learned by knowledge transfer, e. g., knowledge distillation, from the original networks.
Ranked #1 on Network Pruning on CIFAR-10
6 code implementations • CVPR 2019 • Xuanyi Dong, Yi Yang
To avoid traversing all the possibilities of the sub-graphs, we develop a differentiable sampler over the DAG.
Ranked #18 on Neural Architecture Search on CIFAR-10
4 code implementations • ICCV 2019 • Xuanyi Dong, Yi Yang
In this paper, we propose a Self-Evaluated Template Network (SETN) to improve the quality of the architecture candidates for evaluation so that it is more likely to cover competitive candidates.
Ranked #18 on Neural Architecture Search on NAS-Bench-201, ImageNet-16-120 (Accuracy (Val) metric)
4 code implementations • ICLR 2020 • Xuanyi Dong, Yi Yang
A variety of algorithms search architectures under different search space.
3 code implementations • ICCV 2019 • Ruijie Quan, Xuanyi Dong, Yu Wu, Linchao Zhu, Yi Yang
We propose to automatically search for a CNN architecture that is specifically suitable for the reID task.
Ranked #9 on Person Re-Identification on CUHK03 detected
2 code implementations • ECCV 2020 • Zongxin Yang, Yunchao Wei, Yi Yang
This paper investigates the principles of embedding learning to tackle the challenging semi-supervised video object segmentation.
Ranked #8 on Video Object Segmentation on YouTube-VOS 2019
1 code implementation • CVPR 2020 • Linchao Zhu, Yi Yang
In this paper, we introduce ActBERT for self-supervised learning of joint video-text representations from unlabeled data.
Ranked #8 on Action Segmentation on COIN
1 code implementation • CVPR 2021 • Xiaohan Wang, Linchao Zhu, Yi Yang
Moreover, a global alignment method is proposed to provide a global cross-modal measurement that is complementary to the local perspective.
3 code implementations • 7 Nov 2022 • Carl Doersch, Ankush Gupta, Larisa Markeeva, Adrià Recasens, Lucas Smaira, Yusuf Aytar, João Carreira, Andrew Zisserman, Yi Yang
Generic motion understanding from video involves not only tracking objects, but also perceiving how their surfaces deform and move.
1 code implementation • ICCV 2023 • Carl Doersch, Yi Yang, Mel Vecerik, Dilara Gokay, Ankush Gupta, Yusuf Aytar, Joao Carreira, Andrew Zisserman
We present a novel model for Tracking Any Point (TAP) that effectively tracks any queried point on any physical surface throughout a video sequence.
Ranked #1 on Visual Tracking on Kinetics
2 code implementations • 1 Feb 2024 • Carl Doersch, Yi Yang, Dilara Gokay, Pauline Luc, Skanda Koppula, Ankush Gupta, Joseph Heyward, Ross Goroshin, João Carreira, Andrew Zisserman
To endow models with greater understanding of physics and motion, it is useful to enable them to perceive how solid surfaces move and deform in real scenes.
2 code implementations • 22 Oct 2019 • Peike Li, Yunqiu Xu, Yunchao Wei, Yi Yang
To tackle the problem of learning with label noises, this work introduces a purification strategy, called Self-Correction for Human Parsing (SCHP), to progressively promote the reliability of the supervised labels as well as the learned models.
Ranked #2 on Human Part Segmentation on PASCAL-Part
1 code implementation • CVPR 2018 • Xuanyi Dong, Yan Yan, Wanli Ouyang, Yi Yang
In this work, we propose a style-aggregated approach to deal with the large intrinsic variance of image styles for facial landmark detection.
Ranked #2 on Facial Landmark Detection on AFLW-Front (Mean NME metric)
2 code implementations • ICCV 2019 • Xuanyi Dong, Yi Yang
A typical approach is to (1) train a detector on the labeled images; (2) generate new training samples using this detector's prediction as pseudo labels of unlabeled images; (3) retrain the detector on the labeled samples and partial pseudo labeled samples.
Ranked #1 on Facial Landmark Detection on 300W (Full) (using extra training data)
1 code implementation • 25 Jan 2021 • Xuanyi Dong, Yi Yang, Shih-En Wei, Xinshuo Weng, Yaser Sheikh, Shoou-I Yu
End-to-end training is made possible by differentiable registration and 3D triangulation modules.
1 code implementation • CVPR 2018 • Xuanyi Dong, Shoou-I Yu, Xinshuo Weng, Shih-En Wei, Yi Yang, Yaser Sheikh
In this paper, we present supervision-by-registration, an unsupervised approach to improve the precision of facial landmark detectors on both images and video.
Ranked #1 on Facial Landmark Detection on 300-VW (C)
1 code implementation • ICCV 2021 • Yanbin Liu, Juho Lee, Linchao Zhu, Ling Chen, Humphrey Shi, Yi Yang
Most existing few-shot classification methods only consider generalization on one dataset (i. e., single-domain), failing to transfer across various seen and unseen domains.
2 code implementations • 22 Sep 2018 • Chao Yu, Zuxin Liu, Xinjun Liu, Fugui Xie, Yi Yang, Qi Wei, Qiao Fei
It is one of the state-of-the-art SLAM systems in high-dynamic environments.
Robotics
2 code implementations • CVPR 2022 • Yuanzhi Liang, Linchao Zhu, Xiaohan Wang, Yi Yang
In this paper, we propose an episodic linear probing (ELP) classifier to reflect the generalization of visual representations in an online manner.
Ranked #13 on Fine-Grained Image Classification on CUB-200-2011
2 code implementations • NeurIPS 2021 • Zongxin Yang, Yunchao Wei, Yi Yang
The state-of-the-art methods learn to decode features with a single positive object and thus have to match and segment each target separately under multi-object scenarios, consuming multiple times computing resources.
Ranked #2 on Video Object Segmentation on DAVIS 2017 (test-dev) (using extra training data)
2 code implementations • 22 Mar 2022 • Zongxin Yang, Jiaxu Miao, Yunchao Wei, Wenguan Wang, Xiaohan Wang, Yi Yang
This paper delves into the challenges of achieving scalable and effective multi-object modeling for semi-supervised Video Object Segmentation (VOS).
2 code implementations • 18 Oct 2022 • Zongxin Yang, Yi Yang
To solve such a problem and further facilitate the learning of visual embeddings, this paper proposes a Decoupling Features in Hierarchical Propagation (DeAOT) approach.
Ranked #1 on Semi-Supervised Video Object Segmentation on VOT2020
2 code implementations • 8 May 2023 • Yuanyou Xu, Zongxin Yang, Yi Yang
Considering the challenges in panoptic VOS, we propose a strong baseline method named panoptic object association with transformers (PAOT), which uses panoptic identification to associate objects with a pyramid architecture on multiple scales.
1 code implementation • 15 Jun 2020 • Yi Yang, Mark Christopher Siy UY, Allen Huang
Contextual pretrained language models, such as BERT (Devlin et al., 2019), have made significant breakthrough in various NLP tasks by training on large scale of unlabeled text re-sources. Financial sector also accumulates large amount of financial communication text. However, there is no pretrained finance specific language models available.
29 code implementations • ECCV 2018 • Yifan Sun, Liang Zheng, Yi Yang, Qi Tian, Shengjin Wang
RPP re-assigns these outliers to the parts they are closest to, resulting in refined parts with enhanced within-part consistency.
Ranked #3 on Person Re-Identification on UAV-Human
3 code implementations • 27 Feb 2020 • Zhedong Zheng, Yunchao Wei, Yi Yang
To our knowledge, University-1652 is the first drone-based geo-localization dataset and enables two new tasks, i. e., drone-view target localization and drone navigation.
Ranked #6 on Drone navigation on University-1652
2 code implementations • 18 Apr 2022 • Tingyu Wang, Zhedong Zheng, Yaoqi Sun, Chenggang Yan, Yi Yang, Tat-Seng Chua
This task is mostly regarded as an image retrieval problem.
2 code implementations • 24 Dec 2019 • Zhedong Zheng, Yi Yang
We consider the unsupervised scene adaptation problem of learning from both labeled source data and unlabeled target data.
Ranked #1 on Domain Adaptation on SYNTHIA-to-Cityscapes Labels
3 code implementations • 8 Mar 2020 • Zhedong Zheng, Yi Yang
This paper focuses on the unsupervised domain adaptation of transferring the knowledge from the source domain to the target domain in the context of semantic segmentation.
6 code implementations • 21 Aug 2018 • Yang He, Guoliang Kang, Xuanyi Dong, Yanwei Fu, Yi Yang
Therefore, the network trained by our method has a larger model capacity to learn from the training data.
1 code implementation • 8 Feb 2024 • Dewei Zhou, You Li, Fan Ma, Xiaoting Zhang, Yi Yang
Lastly, we aggregate all the shaded instances to provide the necessary information for accurately generating multiple instances in stable diffusion (SD).
Ranked #1 on Conditional Text-to-Image Synthesis on COCO-MIG
1 code implementation • 13 Oct 2020 • Zongxin Yang, Yunchao Wei, Yi Yang
This paper investigates the principles of embedding learning to tackle the challenging semi-supervised video object segmentation.
3 code implementations • CVPR 2022 • Liulei Li, Tianfei Zhou, Wenguan Wang, Jianwu Li, Yi Yang
In this paper, we instead address hierarchical semantic segmentation (HSS), which aims at structured, pixel-wise description of visual observation in terms of a class hierarchy.
8 code implementations • ICCV 2017 • Zhedong Zheng, Liang Zheng, Yi Yang
We verify the proposed method on a practical problem: person re-identification (re-ID).
Ranked #4 on Person Re-Identification on CUHK03
Fine-Grained Image Classification Generative Adversarial Network +2
2 code implementations • CVPR 2019 • Guoliang Kang, Lu Jiang, Yi Yang, Alexander G. Hauptmann
Unsupervised Domain Adaptation (UDA) makes predictions for the target domain data while manual annotations are only available in the source domain.
Ranked #7 on Domain Adaptation on Office-31
2 code implementations • CVPR 2019 • Zhun Zhong, Liang Zheng, Zhiming Luo, Shaozi Li, Yi Yang
To achieve this goal, an exemplar memory is introduced to store features of the target domain and accommodate the three invariance properties.
Domain Adaptive Person Re-Identification Person Re-Identification +1
1 code implementation • CVPR 2019 • Yawei Luo, Liang Zheng, Tao Guan, Junqing Yu, Yi Yang
We consider the problem of unsupervised domain adaptation in semantic segmentation.
Ranked #8 on Semantic Segmentation on DADA-seg
10 code implementations • CVPR 2018 • Zhun Zhong, Liang Zheng, Zhedong Zheng, Shaozi Li, Yi Yang
In this paper, we explicitly consider this challenge by introducing camera style (CamStyle) adaptation.
Ranked #71 on Person Re-Identification on DukeMTMC-reID
2 code implementations • 15 Nov 2017 • Zhedong Zheng, Liang Zheng, Michael Garrett, Yi Yang, Mingliang Xu, Yi-Dong Shen
In this paper, we propose a new system to discriminatively embed the image and text to a shared visual-textual space.
Ranked #1 on Cross-Modal Retrieval on CUHK-PEDES
4 code implementations • 17 Nov 2016 • Zhedong Zheng, Liang Zheng, Yi Yang
We revisit two popular convolutional neural networks (CNN) in person re-identification (re-ID), i. e, verification and classification models.
Ranked #1 on Person Re-Identification on Market-1501+500k
1 code implementation • 8 Jun 2020 • Zhedong Zheng, Nenggan Zheng, Yi Yang
To our knowledge, we are among the first attempts to conduct person re-identification in the 3D space.
3 code implementations • 5 Oct 2022 • Chen Liang, Wenguan Wang, Jiaxu Miao, Yi Yang
Going beyond this, we propose GMMSeg, a new family of segmentation models that rely on a dense generative classifier for the joint distribution p(pixel feature, class).
2 code implementations • ICLR 2019 • Yanbin Liu, Juho Lee, Minseop Park, Saehoon Kim, Eunho Yang, Sung Ju Hwang, Yi Yang
The goal of few-shot learning is to learn a classifier that generalizes well even when trained with a limited number of training instances per class.
1 code implementation • 3 Jul 2017 • Zhedong Zheng, Liang Zheng, Yi Yang
This task aims to search a query person in a large image pool.
Ranked #1 on Person Re-Identification on CUHK03 (detected)
1 code implementation • 30 May 2017 • Hehe Fan, Liang Zheng, Yi Yang
Progressively, pedestrian clustering and the CNN model are improved simultaneously until algorithm convergence.
Ranked #12 on Unsupervised Person Re-Identification on DukeMTMC-reID
2 code implementations • 29 Jan 2024 • Qingwen Zhang, Yi Yang, Heng Fang, Ruoyu Geng, Patric Jensfelt
Scene flow estimation determines a scene's 3D motion field, by predicting the motion of points in the scene, especially for aiding tasks in autonomous driving.
Ranked #1 on Scene Flow Estimation on Argoverse 2
1 code implementation • ECCV 2018 • Yawei Luo, Zhedong Zheng, Liang Zheng, Tao Guan, Junqing Yu, Yi Yang
To address the two kinds of inconsistencies, this paper proposes the Macro-Micro Adversarial Net (MMAN).
Ranked #12 on Semantic Segmentation on LIP val
1 code implementation • ICCV 2023 • Yuan Gan, Zongxin Yang, Xihang Yue, Lingyun Sun, Yi Yang
Audio-driven talking-head synthesis is a popular research topic for virtual human-related applications.
5 code implementations • CVPR 2023 • Wenhao Wu, Xiaohan Wang, Haipeng Luo, Jingdong Wang, Yi Yang, Wanli Ouyang
In this paper, we propose a novel framework called BIKE, which utilizes the cross-modal bridge to explore bidirectional knowledge: i) We introduce the Video Attribute Association mechanism, which leverages the Video-to-Text knowledge to generate textual auxiliary attributes for complementing video recognition.
Ranked #1 on Zero-Shot Action Recognition on ActivityNet
4 code implementations • CVPR 2019 • Minfeng Zhu, Pingbo Pan, Wei Chen, Yi Yang
If the initial image is not well initialized, the following processes can hardly refine the image to a satisfactory quality.
Ranked #6 on Text-to-Image Generation on CUB (Inception score metric)
1 code implementation • 29 Nov 2022 • Shuangkang Fang, Weixin Xu, Heng Wang, Yi Yang, Yufeng Wang, Shuchang Zhou
In this paper, we propose Progressive Volume Distillation (PVD), a systematic distillation method that allows any-to-any conversions between different architectures, including MLP, sparse or low-rank tensors, hashtables and their compositions.
Ranked #1 on Novel View Synthesis on NeRF (Average PSNR metric)
1 code implementation • 8 Apr 2023 • Shuangkang Fang, Yufeng Wang, Yi Yang, Weixin Xu, Heng Wang, Wenrui Ding, Shuchang Zhou
To address this limitation and maximize the potential of each architecture, we propose Progressive Volume Distillation with Active Learning (PVD-AL), a systematic distillation method that enables any-to-any conversions between different architectures.
1 code implementation • NeurIPS 2023 • Guangyan Chen, Meiling Wang, Yi Yang, Kai Yu, Li Yuan, Yufeng Yue
Large language models (LLMs) based on the generative pre-training transformer (GPT) have demonstrated remarkable effectiveness across a diverse range of downstream tasks.
1 code implementation • CVPR 2021 • Hehe Fan, Yi Yang, Mohan Kankanhalli
To capture the dynamics in point cloud videos, point tracking is usually employed.
Ranked #4 on 3D Action Recognition on NTU RGB+D
1 code implementation • Deep Mind 2022 • Viorica Pătrăucean, Lucas Smaira, Ankush Gupta, Adrià Recasens Continente, Larisa Markeeva, Dylan Banarse, Mateusz Malinowski, Yi Yang, Carl Doersch, Tatiana Matejovicova, Yury Sulsky, Antoine Miech, Skanda Koppula, Alex Frechette, Hanna Klimczak, Raphael Koster, Junlin Zhang, Stephanie Winkler, Yusuf Aytar, Simon Osindero, Dima Damen, Andrew Zisserman and João Carreira
We propose a novel multimodal benchmark – the Perception Test – that aims to extensively evaluate perception and reasoning skills of multimodal models.
2 code implementations • NeurIPS 2023 • Viorica Pătrăucean, Lucas Smaira, Ankush Gupta, Adrià Recasens Continente, Larisa Markeeva, Dylan Banarse, Skanda Koppula, Joseph Heyward, Mateusz Malinowski, Yi Yang, Carl Doersch, Tatiana Matejovicova, Yury Sulsky, Antoine Miech, Alex Frechette, Hanna Klimczak, Raphael Koster, Junlin Zhang, Stephanie Winkler, Yusuf Aytar, Simon Osindero, Dima Damen, Andrew Zisserman, João Carreira
We propose a novel multimodal video benchmark - the Perception Test - to evaluate the perception and reasoning skills of pre-trained multimodal models (e. g. Flamingo, SeViLA, or GPT-4).
1 code implementation • ECCV 2018 • Xiaolin Zhang, Yunchao Wei, Guoliang Kang, Yi Yang, Thomas Huang
A stagewise approach is proposed to incorporate high confident object regions to learn the SPG masks.
Ranked #1 on Weakly-Supervised Object Localization on ILSVRC 2015
2 code implementations • 14 May 2017 • Shuchang Zhou, Taihong Xiao, Yi Yang, Dieqiao Feng, Qinyao He, Weiran He
In this work, we propose a model that can learn object transfiguration from two unpaired sets of images: one set containing images that "have" that kind of object, and the other set being the opposite, with the mild constraint that the objects be located approximately at the same place.
2 code implementations • 18 Oct 2019 • Hehe Fan, Yi Yang
We apply PointRNN, PointGRU and PointLSTM to moving point cloud prediction, which aims to predict the future trajectories of points in a set given their history movements.
2 code implementations • 8 Apr 2024 • Yufeng Yue, Yinan Deng, Jiahui Wang, Yi Yang
Implicit reconstruction of ESDF (Euclidean Signed Distance Field) involves training a neural network to regress the signed distance from any point to the nearest obstacle, which has the advantages of lightweight storage and continuous querying.
1 code implementation • ICCV 2023 • Liangqi Li, Jiaxu Miao, Dahu Shi, Wenming Tan, Ye Ren, Yi Yang, ShiLiang Pu
Current methods for open-vocabulary object detection (OVOD) rely on a pre-trained vision-language model (VLM) to acquire the recognition ability.
1 code implementation • 13 Nov 2021 • Wenhao Wang, Yifan Sun, Weipu Zhang, Yi Yang
In this paper, a data-driven and local-verification (D$^2$LV) approach is proposed to compete for Image Similarity Challenge: Matching Track at NeurIPS'21.
1 code implementation • 4 Feb 2020 • Tong Liu, Zhaowei Chen, Yi Yang, Zehao Wu, Haowei Li
Nowadays, deep learning techniques are widely used for lane detection, but application in low-light conditions remains a challenge until this day.
1 code implementation • ECCV 2018 • Zhun Zhong, Liang Zheng, Shaozi Li, Yi Yang
Person re-identification (re-ID) poses unique challenges for unsupervised domain adaptation (UDA) in that classes in the source and target sets (domains) are entirely different and that image variations are largely caused by cameras.
1 code implementation • 17 May 2023 • Dewei Zhou, Zongxin Yang, Yi Yang
Recovering noise-covered details from low-light images is challenging, and the results given by previous methods leave room for improvement.
Ranked #6 on Low-Light Image Enhancement on LOL
1 code implementation • 8 Oct 2018 • Yang Wang, Zhenheng Yang, Peng Wang, Yi Yang, Chenxu Luo, Wei Xu
Then the whole scene is decomposed into moving foreground and static background by compar- ing the estimated optical flow and rigid flow derived from the depth and ego-motion.
3 code implementations • CVPR 2020 • Zongxin Yang, Linchao Zhu, Yu Wu, Yi Yang
This lightweight layer incorporates a simple l2 normalization, enabling our transformation unit applicable to operator-level without much increase of additional parameters.
1 code implementation • CVPR 2022 • Jiaxu Miao, Xiaohan Wang, Yu Wu, Wei Li, Xu Zhang, Yunchao Wei, Yi Yang
In contrast, our large-scale VIdeo Panoptic Segmentation in the Wild (VIPSeg) dataset provides 3, 536 videos and 84, 750 frames with pixel-level panoptic annotations, covering a wide range of real-world scenarios and categories.
1 code implementation • 2 May 2022 • Shuai Zhao, Linchao Zhu, Xiaohan Wang, Yi Yang
In this paper, to reduce the number of redundant video tokens, we design a multi-segment token clustering algorithm to find the most representative tokens and drop the non-essential ones.
Ranked #11 on Video Retrieval on MSVD (using extra training data)
1 code implementation • ACL 2019 • Yu Qin, Yi Yang
Prior research has shown that textual information in a firm{'}s financial statement can be used to predict its stock{'}s risk level.
1 code implementation • CVPR 2022 • Chen Liang, Wenguan Wang, Tianfei Zhou, Yi Yang
In this paper, we propose a new task and dataset, Visual Abductive Reasoning (VAR), for examining abductive reasoning ability of machine intelligence in everyday visual situations.
1 code implementation • 22 Oct 2018 • Xiaolin Zhang, Yunchao Wei, Yi Yang, Thomas Huang
In this way, the possibilities embedded in the produced similarity maps can be adapted to guide the process of segmenting objects.
Ranked #89 on Few-Shot Semantic Segmentation on PASCAL-5i (5-Shot)
1 code implementation • ICCV 2015 • Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, Alan Yuille
In particular, we propose a transposed weight sharing scheme, which not only improves performance on image captioning, but also makes the model more suitable for the novel concept learning task.
2 code implementations • 20 Dec 2014 • Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, Alan Yuille
In this paper, we present a multimodal Recurrent Neural Network (m-RNN) model for generating novel image captions.
1 code implementation • CVPR 2021 • Guangrui Li, Guoliang Kang, Yi Zhu, Yunchao Wei, Yi Yang
To better exploit the intrinsic structure of the target domain, we propose Domain Consensus Clustering (DCC), which exploits the domain consensus knowledge to discover discriminative clusters on both common samples and private ones.
Ranked #4 on Partial Domain Adaptation on Office-31
1 code implementation • 6 Mar 2023 • Wei Li, Linchao Zhu, Longyin Wen, Yi Yang
This decoder is both data-efficient and computation-efficient: 1) it only requires the text data for training, easing the burden on the collection of paired data.
1 code implementation • 13 Nov 2021 • Wenhao Wang, Weipu Zhang, Yifan Sun, Yi Yang
In this paper, a bag of tricks and a strong baseline are proposed for image copy detection.
1 code implementation • 3 Mar 2022 • Yongxing Dai, Yifan Sun, Jun Liu, Zekun Tong, Yi Yang, Ling-Yu Duan
Instead of directly aligning the source and target domains against each other, we propose to align the source and target domains against their intermediate domains for a smooth knowledge transfer.
1 code implementation • 23 Dec 2023 • MingWei Li, Jiachen Tao, Zongxin Yang, Yi Yang
In this paper, we introduce Human101, a novel framework adept at producing high-fidelity dynamic 3D human reconstructions from 1-view videos by training 3D Gaussians in 100 seconds and rendering in 100+ FPS.
1 code implementation • ECCV 2020 • Guangrui Li, Guoliang Kang, Wu Liu, Yunchao Wei, Yi Yang
The target of CCM is to acquire those synthetic images that share similar distribution with the real ones in the target domain, so that the domain gap can be naturally alleviated by employing the content-consistent synthetic images for training.
Ranked #12 on Semantic Segmentation on GTAV-to-Cityscapes Labels
1 code implementation • 31 May 2021 • Shuai Bai, Zhedong Zheng, Xiaohan Wang, Junyang Lin, Zhu Zhang, Chang Zhou, Yi Yang, Hongxia Yang
In this paper, we apply one new modality, i. e., the language description, to search the vehicle of interest and explore the potential of this task in the real-world scenario.
1 code implementation • CVPR 2022 • Xuanmeng Zhang, Zhedong Zheng, Daiheng Gao, Bang Zhang, Pan Pan, Yi Yang
To address this challenge, we propose Multi-View Consistent Generative Adversarial Networks (MVCGAN) for high-quality 3D-aware image synthesis with geometry constraints.
1 code implementation • 26 Jun 2017 • Xuanyi Dong, Liang Zheng, Fan Ma, Yi Yang, Deyu Meng
Experiments on PASCAL VOC'07, MS COCO'14, and ILSVRC'13 indicate that by using as few as three or four samples selected for each category, our method produces very competitive results when compared to the state-of-the-art weakly-supervised approaches using a large number of image-level labels.
Ranked #1 on Weakly Supervised Object Detection on MS COCO
1 code implementation • NeurIPS 2020 • Guoliang Kang, Yunchao Wei, Yi Yang, Yueting Zhuang, Alexander G. Hauptmann
The conventional solution to this task is to minimize the discrepancy between source and target to enable effective knowledge transfer.
Ranked #25 on Synthetic-to-Real Translation on SYNTHIA-to-Cityscapes
1 code implementation • NeurIPS 2023 • Zechuan Zhang, Li Sun, Zongxin Yang, Ling Chen, Yi Yang
Reconstructing 3D clothed human avatars from single images is a challenging task, especially when encountering complex poses and loose clothing.
1 code implementation • ICLR 2021 • Hehe Fan, Xin Yu, Yuhang Ding, Yi Yang, Mohan Kankanhalli
Then, a spatial convolution is employed to capture the local structure of points in the 3D space, and a temporal convolution is used to model the dynamics of the spatial regions along the time dimension.
Ranked #3 on 3D Action Recognition on NTU RGB+D
1 code implementation • CVPR 2023 • Xiaohan Wang, Wenguan Wang, Jiayi Shao, Yi Yang
Recently, visual-language navigation (VLN) -- entailing robot agents to follow navigation instructions -- has shown great advance.
1 code implementation • ICCV 2019 • Zongxin Yang, Jian Dong, Ping Liu, Yi Yang, Shuicheng Yan
The second challenge is how to maintain high quality in generated results, especially for multi-step generations in which generated regions are spatially far away from the initial input.
1 code implementation • 14 Nov 2022 • Mu Chen, Zhedong Zheng, Yi Yang, Tat-Seng Chua
In an attempt to fill this gap, we propose a unified pixel- and patch-wise self-supervised learning framework, called PiPa, for domain adaptive semantic segmentation that facilitates intra-image pixel-wise correlations and patch-wise semantic consistency against different contexts.
Ranked #1 on Semantic Segmentation on SYNTHIA-to-Cityscapes
1 code implementation • 14 Dec 2020 • Xuanmeng Zhang, Minyue Jiang, Zhedong Zheng, Xiao Tan, Errui Ding, Yi Yang
We argue that the first phase equals building the k-nearest neighbor graph, while the second phase can be viewed as spreading the message within the graph.
Ranked #1 on Image Retrieval on Oxford5k
1 code implementation • 18 Oct 2017 • John Peurifoy, Yichen Shen, Li Jing, Yi Yang, Fidel Cano-Renteria, Brendan Delacy, Max Tegmark, John D. Joannopoulos, Marin Soljacic
We propose a method to use artificial neural networks to approximate light scattering by multilayer nanoparticles.
Computational Physics Applied Physics Optics
1 code implementation • CVPR 2021 • Tianfei Zhou, Wenguan Wang, Si Liu, Yi Yang, Luc van Gool
To address the challenging task of instance-aware human part parsing, a new bottom-up regime is proposed to learn category-level human semantic segmentation as well as multi-person pose estimation in a joint and end-to-end manner.
1 code implementation • ECCV 2020 • Fan Ma, Linchao Zhu, Yi Yang, Shengxin Zha, Gourab Kundu, Matt Feiszli, Zheng Shou
To obtain the single-frame supervision, the annotators are asked to identify only a single frame within the temporal window of an action.
Ranked #5 on Weakly Supervised Action Localization on BEOID
2 code implementations • NeurIPS 2021 • Gengwei Zhang, Guoliang Kang, Yi Yang, Yunchao Wei
Directly performing cross-attention may aggregate these features from support to query and bias the query features.
Ranked #52 on Few-Shot Semantic Segmentation on COCO-20i (5-shot)
1 code implementation • NeurIPS 2020 • Yawei Luo, Ping Liu, Tao Guan, Junqing Yu, Yi Yang
We aim at the problem named One-Shot Unsupervised Domain Adaptation.
domain classification One-shot Unsupervised Domain Adaptation +2
1 code implementation • 26 Aug 2020 • Tingyu Wang, Zhedong Zheng, Chenggang Yan, Jiyong Zhang, Yaoqi Sun, Bolun Zheng, Yi Yang
Existing methods usually concentrate on mining the fine-grained feature of the geographic target in the image center, but underestimate the contextual information in neighbor areas.
Ranked #3 on Drone navigation on University-1652
1 code implementation • 10 Mar 2024 • Wenhao Wang, Yifan Sun, Yi Yang
However, Sora, along with other text-to-video diffusion models, is highly reliant on prompts, and there is no publicly available dataset that features a study of text-to-video prompts.
1 code implementation • IJCNLP 2019 • Hugh Perkins, Yi Yang
We introduce the dialog intent induction task and present a novel deep multi-view clustering approach to tackle the problem.
1 code implementation • 1 Feb 2024 • Chao Liang, Fan Ma, Linchao Zhu, Yingying Deng, Yi Yang
Moreover, we introduce the 3D facial prior to equip our model with control over the human head in a flexible and 3D-consistent manner.
1 code implementation • 27 Apr 2022 • Zhedong Zheng, Jiayin Zhu, Wei Ji, Yi Yang, Tat-Seng Chua
This research aims to study a self-supervised 3D clothing reconstruction method, which recovers the geometry shape and texture of human clothing from a single image.
Ranked #1 on Single-View 3D Reconstruction on CUB-200-2011
1 code implementation • 30 Mar 2017 • Zhichao Li, Yi Yang, Xiao Liu, Feng Zhou, Shilei Wen, Wei Xu
We propose a dynamic computational time model to accelerate the average processing time for recurrent visual attention (RAM).
1 code implementation • 21 Dec 2016 • Xingzhong Du, Hongzhi Yin, Ling Chen, Yang Wang, Yi Yang, Xiaofang Zhou
In the existing video recommender systems, the models make the recommendations based on the user-video interactions and single specific content features.
2 code implementations • NAACL 2021 • Derek Chen, Howard Chen, Yi Yang, Alex Lin, Zhou Yu
Existing goal-oriented dialogue datasets focus mainly on identifying slots and values.
1 code implementation • 23 May 2023 • Shuai Zhao, Xiaohan Wang, Linchao Zhu, Ruijie Quan, Yi Yang
With such merits, we transform CLIP into a scene text reader and introduce CLIP4STR, a simple yet effective STR method built upon image and text encoders of CLIP.
Ranked #1 on Scene Text Recognition on WOST (using extra training data)
1 code implementation • 15 Sep 2023 • Yi Yang, Yixuan Tang, Kar Yan Tam
We present a new financial domain large language model, InvestLM, tuned on LLaMA-65B (Touvron et al., 2023), using a carefully curated instruction dataset related to financial investment.
1 code implementation • CVPR 2023 • Xiaolong Shen, Zongxin Yang, Xiaohan Wang, Jianxin Ma, Chang Zhou, Yi Yang
However, using a single kind of modeling structure is difficult to balance the learning of short-term and long-term temporal correlations, and may bias the network to one of them, leading to undesirable predictions like global location shift, temporal inconsistency, and insufficient local details.
Ranked #46 on 3D Human Pose Estimation on 3DPW
1 code implementation • EMNLP 2020 • Yi Yang, Arzoo Katiyar
We present a simple few-shot named entity recognition (NER) system based on nearest neighbor learning and structured inference.
2 code implementations • CVPR 2023 • Wei Shang, Dongwei Ren, Yi Yang, Hongzhi Zhang, Kede Ma, WangMeng Zuo
Moreover, on the seemingly implausible x16 interpolation task, our method outperforms existing methods by more than 1. 5 dB in terms of PSNR.
2 code implementations • EMNLP 2018 • Yi Yang
We introduce a class of convolutional neural networks (CNNs) that utilize recurrent neural networks (RNNs) as convolution filters.
Ranked #11 on Sentiment Analysis on SST-5 Fine-grained classification
1 code implementation • CVPR 2022 • Fan Ma, Mike Zheng Shou, Linchao Zhu, Haoqi Fan, Yilei Xu, Yi Yang, Zhicheng Yan
Although UniTrack \cite{wang2021different} demonstrates that a shared appearance model with multiple heads can be used to tackle individual tracking tasks, it fails to exploit the large-scale tracking datasets for training and performs poorly on single object tracking.
1 code implementation • TACL 2017 • Yi Yang, Jacob Eisenstein
Variation in language is ubiquitous, particularly in newer forms of writing such as social media.
2 code implementations • 7 Sep 2018 • Zhedong Zheng, Liang Zheng, Yi Yang, Fei Wu
Opposite-Direction Feature Attack (ODFA) effectively exploits feature-level adversarial gradients and takes advantage of feature distance in the representation space.
1 code implementation • 8 Jan 2024 • Chuyang Zhao, Yifan Sun, Wenhao Wang, Qiang Chen, Errui Ding, Yi Yang, Jingdong Wang
The traditional training procedure using one-to-one supervision in the original DETR lacks direct supervision for the object detection candidates.
1 code implementation • 16 Oct 2015 • Linnan Wang, Wei Wu, Jianxiong Xiao, Yi Yang
Basic Linear Algebra Subprograms (BLAS) are a set of low level linear algebra kernels widely adopted by applications involved with the deep learning and scientific computing.
Distributed, Parallel, and Cluster Computing
1 code implementation • 18 Mar 2022 • Chen Liang, Wenguan Wang, Tianfei Zhou, Jiaxu Miao, Yawei Luo, Yi Yang
We explore the task of language-guided video segmentation (LVS).
Ranked #7 on Referring Expression Segmentation on A2D Sentences
Referring Expression Segmentation Referring Video Object Segmentation +5
2 code implementations • 29 Mar 2021 • Zhedong Zheng, Yi Yang
Domain adaptation is to transfer the shared knowledge learned from the source domain to a new environment, i. e., target domain.
1 code implementation • ICLR 2020 • Jiawei Du, Hu Zhang, Joey Tianyi Zhou, Yi Yang, Jiashi Feng
Black-box attack methods aim to infer suitable attack patterns to targeted DNN models by only using output feedback of the models and the corresponding input queries.
1 code implementation • 13 Sep 2021 • Yi Yang, Daoye Zhu, Tengteng Qu, Qiangyu Wang, Fuhu Ren, Chengqi Cheng
In the experiments, the proposed method is applied to ResNet and UNet, and the adjusted networks are verified on three very diverse benchmark data sets (i. e., Houston2018 data, Berlin data, and MUUFL data).
1 code implementation • 27 Jan 2024 • Yixuan Tang, Yi Yang
We hope MultiHop-RAG will be a valuable resource for the community in developing effective RAG systems, thereby facilitating greater adoption of LLMs in practice.
1 code implementation • 14 Mar 2024 • Yinan Deng, Jiahui Wang, Jingyu Zhao, Xinyu Tian, Guangyan Chen, Yi Yang, Yufeng Yue
In this work, we propose OpenGraph, the first open-vocabulary hierarchical graph representation designed for large-scale outdoor environments.
1 code implementation • 15 Dec 2019 • Yanyan Wei, Zhao Zhang, Yang Wang, Mingliang Xu, Yi Yang, Shuicheng Yan, Meng Wang
However, in practice it is rather common to have no un-paired images in real deraining task, in such cases how to remove the rain streaks in an unsupervised way will be a very challenging task due to lack of constraints between images and hence suffering from low-quality recovery results.
2 code implementations • 13 Oct 2022 • Jian-Wei Zhang, Yifan Sun, Yi Yang, Wei Chen
With a rethink of recent advances, we find that the current FSS framework has deviated far from the supervised segmentation framework: Given the deep features, FSS methods typically use an intricate decoder to perform sophisticated pixel-wise matching, while the supervised segmentation methods use a simple linear classification head.
2 code implementations • 16 Sep 2015 • Lichao Huang, Yi Yang, Yafeng Deng, Yinan Yu
How can a single fully convolutional neural network (FCN) perform on object detection?
1 code implementation • ICCV 2021 • Yuhang Ding, Xin Yu, Yi Yang
In this work, we propose a Region-aware Fusion Network (RFNet) that is able to exploit different combinations of multi-modal data adaptively and effectively for tumor segmentation.
Ranked #69 on Semantic Segmentation on NYU Depth v2
1 code implementation • ICCV 2019 • Qianyu Feng, Guoliang Kang, Hehe Fan, Yi Yang
In this paper, we exploit the semantic structure of open set data from two aspects: 1) Semantic Categorical Alignment, which aims to achieve good separability of target known classes by categorically aligning the centroid of target with the source.
1 code implementation • 16 Jan 2024 • Zongxin Yang, Guikun Chen, Xiaodi Li, Wenguan Wang, Yi Yang
Recent LLM-driven visual agents mainly focus on solving image-based tasks, which limits their ability to understand dynamic scenes, making it far from real-life applications like guiding students in laboratory experiments and identifying their mistakes.
1 code implementation • 14 Dec 2014 • Yi Yang, Jacob Eisenstein
Representation learning is the dominant technique for unsupervised domain adaptation, but existing approaches often require the specification of "pivot features" that generalize across domains, which are selected by task-specific heuristics.
1 code implementation • ICCV 2021 • Yikai Wang, Yi Yang, Fuchun Sun, Anbang Yao
In the low-bit quantization field, training Binary Neural Networks (BNNs) is the extreme solution to ease the deployment of deep models on resource-constrained devices, having the lowest storage cost and significantly cheaper bit-wise operations compared to 32-bit floating-point counterparts.
1 code implementation • 9 Jul 2019 • Yuhang Ding, Hehe Fan, Mingliang Xu, Yi Yang
However, a problem of the adaptive selection is that, when an image has too many neighborhoods, it is more likely to attract other images as its neighborhoods.
1 code implementation • ECCV 2020 • Xiaolin Zhang, Yunchao Wei, Yi Yang
We learn a feature center for each category and realize the global feature consistency by forcing the object features to approach class-specific centers.
1 code implementation • 1 Jul 2022 • Naiyuan Liu, Xiaohan Wang, Xiaobo Li, Yi Yang, Yueting Zhuang
In this report, we present the ReLER@ZJU-Alibaba submission to the Ego4D Natural Language Queries (NLQ) Challenge in CVPR 2022.
Ranked #3 on Natural Language Queries on Ego4D
1 code implementation • ICCV 2023 • Jiahao Li, Zongxin Yang, Xiaohan Wang, Jianxin Ma, Chang Zhou, Yi Yang
Our method includes an encoder-decoder transformer architecture to fuse 2D and 3D representations for achieving 2D$\&$3D aligned results in a coarse-to-fine manner and a novel 3D joint contrastive learning approach for adding explicitly global supervision for the 3D feature space.
1 code implementation • CVPR 2022 • Juncheng Li, Junlin Xie, Long Qian, Linchao Zhu, Siliang Tang, Fei Wu, Yi Yang, Yueting Zhuang, Xin Eric Wang
To systematically measure the compositional generalizability of temporal grounding models, we introduce a new Compositional Temporal Grounding task and construct two new dataset splits, i. e., Charades-CG and ActivityNet-CG.
1 code implementation • CVPR 2022 • Yuanzhi Liang, Qianyu Feng, Linchao Zhu, Li Hu, Pan Pan, Yi Yang
Talking gesture generation is a practical yet challenging task which aims to synthesize gestures in line with speech.
Ranked #6 on Gesture Generation on TED Gesture Dataset
1 code implementation • 29 May 2023 • Shuai Zhao, Xiaohan Wang, Linchao Zhu, Yi Yang
Given a single test sample, the VLM is forced to maximize the CLIP reward between the input and sampled results from the VLM output distribution.
1 code implementation • 18 Sep 2023 • Kexin Li, Zongxin Yang, Lei Chen, Yi Yang, Jun Xiao
However, existing methods exhibit two limitations: 1) they address video temporal features and audio-visual interactive features separately, disregarding the inherent spatial-temporal dependence of combined audio and video, and 2) they inadequately introduce audio constraints and object-level information during the decoding stage, resulting in segmentation outcomes that fail to comply with audio directives.
1 code implementation • 30 Apr 2021 • Youjiang Xu, Linchao Zhu, Lu Jiang, Yi Yang
It has been shown that deep neural networks are prone to overfitting on biased training data.
1 code implementation • CVPR 2021 • Ruijie Quan, Xin Yu, Yuanzhi Liang, Yi Yang
First, we propose a complementary cascaded network architecture, namely CCN, to remove rain streaks and raindrops in a unified framework.
1 code implementation • ICCV 2021 • Aming Wu, Rui Liu, Yahong Han, Linchao Zhu, Yi Yang
Secondly, domain-specific representations are introduced as the differences between the input and domain-invariant representations.
1 code implementation • 26 Jul 2022 • Wenhao Wang, Yifan Sun, Zongxin Yang, Yi Yang
While model ensemble is common, we show that combining the vision models and vision-language models brings particular benefits from their complementarity and is a key factor to our superiority.
1 code implementation • CVPR 2023 • Difei Gao, Luowei Zhou, Lei Ji, Linchao Zhu, Yi Yang, Mike Zheng Shou
To build Video Question Answering (VideoQA) systems capable of assisting humans in daily activities, seeking answers from long-form videos with diverse and complex events is a must.
Ranked #2 on Video Question Answering on AGQA 2.0 balanced
1 code implementation • 16 Apr 2022 • Yulei Lu, Yawei Luo, Li Zhang, Zheyang Li, Yi Yang, Jun Xiao
A thriving trend for domain adaptive segmentation endeavors to generate the high-quality pseudo labels for target domain and retrain the segmentor on them.
1 code implementation • NAACL 2018 • Yi Yang, Ozan .Irsoy, Kazi Shefaet Rahman
To the best of our knowledge, our work is the first one that employs the structured gradient tree boosting (SGTB) algorithm for collective entity disambiguation.
1 code implementation • 26 Aug 2021 • Wuyang Chen, Xinyu Gong, Junru Wu, Yunchao Wei, Humphrey Shi, Zhicheng Yan, Yi Yang, Zhangyang Wang
This work targets designing a principled and unified training-free framework for Neural Architecture Search (NAS), with high performance, low cost, and in-depth interpretation.
1 code implementation • CVPR 2022 • Yunqiu Xu, Yifan Sun, Zongxin Yang, Jiaxu Miao, Yi Yang
How to align the source and target domains is critical to the CDWSOD accuracy.
Ranked #1 on Weakly Supervised Object Detection on Clipart1k
1 code implementation • CVPR 2023 • Chao Wang, Zhedong Zheng, Ruijie Quan, Yifan Sun, Yi Yang
(2) The conventional paradigm usually focuses on mining the abnormal pattern of a superimposed image to separate the noise, which de facto conflicts with the primary image restoration task.
1 code implementation • 5 Feb 2024 • Sheng Luo, Wei Chen, Wanxin Tian, Rui Liu, Luanxuan Hou, Xiubao Zhang, Haifeng Shen, Ruiqi Wu, Shuyi Geng, Yi Zhou, Ling Shao, Yi Yang, Bojun Gao, Qun Li, Guobin Wu
Foundation models have indeed made a profound impact on various fields, emerging as pivotal components that significantly shape the capabilities of intelligent systems.
1 code implementation • ACL 2021 • James Mullenbach, Yada Pruksachatkun, Sean Adler, Jennifer Seale, Jordan Swartz, T. Greg McKelvey, Hui Dai, Yi Yang, David Sontag
In this work, we describe our creation of a dataset of clinical action items annotated over MIMIC-III, the largest publicly available dataset of real clinical notes.
1 code implementation • CVPR 2022 • Changlin Li, Bohan Zhuang, Guangrun Wang, Xiaodan Liang, Xiaojun Chang, Yi Yang
First, we develop a strong manual baseline for progressive learning of ViTs, by introducing momentum growth (MoGrow) to bridge the gap brought by model growth.
1 code implementation • 20 Oct 2022 • Zhuo Chen, Wen Zhang, Yufeng Huang, Mingyang Chen, Yuxia Geng, Hongtao Yu, Zhen Bi, Yichi Zhang, Zhen Yao, Wenting Song, Xinliang Wu, Yi Yang, Mingyi Chen, Zhaoyang Lian, YingYing Li, Lei Cheng, Huajun Chen
In this work, we share our experience on tele-knowledge pre-training for fault analysis, a crucial task in telecommunication applications that requires a wide range of knowledge normally found in both machine log data and product documents.
1 code implementation • 4 Sep 2023 • Yunhong Lou, Linchao Zhu, Yaxiong Wang, Xiaohan Wang, Yi Yang
We present DiverseMotion, a new approach for synthesizing high-quality human motions conditioned on textual descriptions while preserving motion diversity. Despite the recent significant process in text-based human motion generation, existing methods often prioritize fitting training motions at the expense of action diversity.
Ranked #3 on Motion Synthesis on HumanML3D (using extra training data)
1 code implementation • ICCV 2021 • Aming Wu, Yahong Han, Linchao Zhu, Yi Yang
Thus, we develop a new framework of few-shot object detection with universal prototypes ({FSOD}^{up}) that owns the merit of feature generalization towards novel objects.
Ranked #23 on Few-Shot Object Detection on MS-COCO (10-shot)
1 code implementation • IEEE Transactions on Image Processing (TIP) 2022 • Jinliang Lin, Zhedong Zheng, Zhun Zhong, Zhiming Luo, Shaozi Li, Yi Yang, Nicu Sebe
Inspired by the human visual system for mining local patterns, we propose a new framework called RK-Net to jointly learn the discriminative Representation and detect salient Keypoints with a single Network.
Ranked #2 on Drone navigation on University-1652
1 code implementation • 22 May 2023 • Kezhou Lin, Xiaohan Wang, Linchao Zhu, Ke Sun, Bang Zhang, Yi Yang
In this paper, we tackle the problem of sign language translation (SLT) without gloss annotations.
1 code implementation • ICCV 2023 • Tuo Feng, Wenguan Wang, Xiaohan Wang, Yi Yang, Qinghua Zheng
The mined patterns are, in turn, used to repaint the embedding space, so as to respect the underlying distribution of the entire training dataset and improve the robustness to the variations.
1 code implementation • 1 Nov 2021 • Weixin Xu, Zipeng Feng, Shuangkang Fang, Song Yuan, Yi Yang, Shuchang Zhou
For example, Transformer Networks do not have native support on many popular chips, and hence are difficult to deploy.
1 code implementation • IEEE Transactions on Neural Networks and Learning Systems 2022 • Yuanzhi Liang, Linchao Zhu, Xiaohan Wang, Yi Yang
Second, we instantiate the loss function and provide a strong baseline for FGVC, where the performance of a naive backbone can be boosted and be comparable with recent methods.
Ranked #27 on Fine-Grained Image Classification on CUB-200-2011
Fine-Grained Image Classification Fine-Grained Visual Recognition
2 code implementations • 21 Mar 2017 • Yutian Lin, Liang Zheng, Zhedong Zheng, Yu Wu, Zhilan Hu, Chenggang Yan, Yi Yang
Person re-identification (re-ID) and attribute recognition share a common target at learning pedestrian descriptions.
Ranked #75 on Person Re-Identification on DukeMTMC-reID
1 code implementation • 29 Mar 2022 • Xiao Pan, Peike Li, Zongxin Yang, Huiling Zhou, Chang Zhou, Hongxia Yang, Jingren Zhou, Yi Yang
By contrast, pixel-level optimization is more explicit, however, it is sensitive to the visual quality of training data and is not robust to object deformation.
1 code implementation • 24 May 2022 • Wenhao Wang, Yifan Sun, Yi Yang
Moreover, this paper further reveals a unique difficulty for solving the hard negative problem in ICD, i. e., there is a fundamental conflict between current metric learning and ICD.
2 code implementations • 22 Aug 2018 • Yang He, Xuanyi Dong, Guoliang Kang, Yanwei Fu, Chenggang Yan, Yi Yang
With asymptotic pruning, the information of the training set would be gradually concentrated in the remaining filters, so the subsequent training and pruning process would be stable.
1 code implementation • 1 Sep 2021 • Chao Sun, Zhedong Zheng, Xiaohan Wang, Mingliang Xu, Yi Yang
Albeit simple, the pre-trained encoder can capture the key points of an unseen point cloud and surpasses the encoder trained from scratch on downstream tasks.
Ranked #43 on 3D Part Segmentation on ShapeNet-Part
1 code implementation • 22 May 2023 • Jinliang Deng, Xiusi Chen, Renhe Jiang, Du Yin, Yi Yang, Xuan Song, Ivor W. Tsang
The core issue in MTS forecasting is how to effectively model complex spatial-temporal patterns.
Ranked #1 on Time Series Forecasting on Weather (96)
2 code implementations • 12 Jun 2018 • Yaming Wang, Xiao Tan, Yi Yang, Xiao Liu, Errui Ding, Feng Zhou, Larry S. Davis
The new dataset is available at www. umiacs. umd. edu/~wym/3dpose. html
2 code implementations • 19 Oct 2018 • Yaming Wang, Xiao Tan, Yi Yang, Ziyu Li, Xiao Liu, Feng Zhou, Larry S. Davis
Existing 3D pose datasets of object categories are limited to generic object types and lack of fine-grained information.
1 code implementation • 27 Mar 2022 • Liulei Li, Tianfei Zhou, Wenguan Wang, Lu Yang, Jianwu Li, Yi Yang
Our target is to learn visual correspondence from unlabeled videos.
1 code implementation • 6 Apr 2023 • Jiancan Wu, Yi Yang, Yuchun Qian, Yongduo Sui, Xiang Wang, Xiangnan He
Then, we recognize the crux to the inability of traditional influence function for graph unlearning, and devise Graph Influence Function (GIF), a model-agnostic unlearning method that can efficiently and accurately estimate parameter changes in response to a $\epsilon$-mass perturbation in deleted data.
1 code implementation • 26 Apr 2023 • Bingqian Lin, Zicong Chen, Mingjie Li, Haokun Lin, Hang Xu, Yi Zhu, Jianzhuang Liu, Wenjia Cai, Lei Yang, Shen Zhao, Chenfei Wu, Ling Chen, Xiaojun Chang, Yi Yang, Lei Xing, Xiaodan Liang
In MOTOR, we combine two kinds of basic medical knowledge, i. e., general and specific knowledge, in a complementary manner to boost the general pretraining process.
1 code implementation • 19 Jan 2024 • Xiangpeng Yang, Linchao Zhu, Xiaohan Wang, Yi Yang
(2) Equipping the visual and text encoder with separated prompts failed to mitigate the visual-text modality gap.
1 code implementation • 22 Mar 2024 • Tuo Feng, Wenguan Wang, Fan Ma, Yi Yang
Consequently, it is essential to develop LiDAR perception methods that are both efficient and effective.
1 code implementation • 13 Jul 2017 • Linchao Zhu, Yanbin Liu, Yi Yang
In this paper, we present our solution to Google YouTube-8M Video Classification Challenge 2017.
1 code implementation • NeurIPS 2019 • Aming Wu, Linchao Zhu, Yahong Han, Yi Yang
Inspired by this idea, towards VCR, we propose a connective cognition network (CCN) to dynamically reorganize the visual neuron connectivity that is contextualized by the meaning of questions and answers.
1 code implementation • 5 Aug 2022 • Feng Zhu, Zongxin Yang, Xin Yu, Yi Yang, Yunchao Wei
In this work, we propose a new online VIS paradigm named Instance As Identity (IAI), which models temporal information for both detection and tracking in an efficient way.
2 code implementations • 20 Apr 2023 • Wenhao Wang, Yifan Sun, Yi Yang
Video Copy Detection (VCD) has been developed to identify instances of unauthorized or duplicated video content.
1 code implementation • 28 May 2023 • Wenjie Zhuo, Yifan Sun, Xiaohan Wang, Linchao Zhu, Yi Yang
Consequently, using multiple positive samples with enhanced diversity further improves contrastive learning due to better alignment.
1 code implementation • 16 Sep 2023 • Yi Yang, Qingwen Zhang, Thomas Gilles, Nazre Batool, John Folkesson
As the pretraining technique is growing in popularity, little work has been done on pretrained learning-based motion prediction methods in autonomous driving.
1 code implementation • 19 Oct 2023 • Barrett Martin Lattimer, Patrick Chen, Xinyuan Zhang, Yi Yang
We introduce SCALE (Source Chunking Approach for Large-scale inconsistency Evaluation), a task-agnostic model for detecting factual inconsistencies using a novel chunking strategy.
1 code implementation • 5 Sep 2019 • Yanbin Liu, Makoto Yamada, Yao-Hung Hubert Tsai, Tam Le, Ruslan Salakhutdinov, Yi Yang
To estimate the mutual information from data, a common practice is preparing a set of paired samples $\{(\mathbf{x}_i,\mathbf{y}_i)\}_{i=1}^n \stackrel{\mathrm{i. i. d.
2 code implementations • NAACL 2022 • Leilei Gan, Jiwei Li, Tianwei Zhang, Xiaoya Li, Yuxian Meng, Fei Wu, Yi Yang, Shangwei Guo, Chun Fan
To deal with this issue, in this paper, we propose a new strategy to perform textual backdoor attacks which do not require an external trigger, and the poisoned samples are correctly labeled.
1 code implementation • ICCV 2023 • Yuanyou Xu, Zongxin Yang, Yi Yang
Tracking any given object(s) spatially and temporally is a common purpose in Visual Object Tracking (VOT) and Video Object Segmentation (VOS).
Ranked #10 on Visual Object Tracking on LaSOT
1 code implementation • 10 Jul 2023 • Meng Li, Yahan Yu, Yi Yang, Guanghao Ren, Jian Wang
In this paper, we propose a deep learning-based character stroke extraction method that takes semantic features and prior information of strokes into consideration.
1 code implementation • ICCV 2023 • Lin Li, Guikun Chen, Jun Xiao, Yi Yang, Chunping Wang, Long Chen
Specifically, we first decompose each relation triplet feature into two components: intrinsic feature and extrinsic feature, which correspond to the intrinsic characteristics and extrinsic contexts of a relation triplet, respectively.
1 code implementation • 20 Nov 2023 • Zhiyuan Min, Yawei Luo, Wei Yang, Yuesong Wang, Yi Yang
Different from existing methods that consider cross-view and along-epipolar information independently, EVE-NeRF conducts the view-epipolar feature aggregation in an entangled manner by injecting the scene-invariant appearance continuity and geometry consistency priors to the aggregation process.
Ranked #1 on Generalizable Novel View Synthesis on Shiny dataset
2 code implementations • CVPR 2018 • Xiaolin Zhang, Yunchao Wei, Jiashi Feng, Yi Yang, Thomas Huang
With such an adversarial learning, the two parallel-classifiers are forced to leverage complementary object regions for classification and can finally generate integral object localization together.
Ranked #2 on Weakly-Supervised Object Localization on ILSVRC 2016
1 code implementation • 30 Jan 2018 • Qingji Guan, Yaping Huang, Zhun Zhong, Zhedong Zheng, Liang Zheng, Yi Yang
This paper considers the task of thorax disease classification on chest X-ray images.
1 code implementation • CVPR 2016 • Jiang Wang, Yi Yang, Junhua Mao, Zhiheng Huang, Chang Huang, Wei Xu
While deep convolutional neural networks (CNNs) have shown a great success in single-label image classification, it is important to note that real world images generally contain multiple labels, which could correspond to different objects, scenes, actions and attributes in an image.
1 code implementation • ECCV 2020 • Hu Zhang, Linchao Zhu, Yi Zhu, Yi Yang
Most of previous work on adversarial attack mainly focus on image models, while the vulnerability of video models is less explored.
1 code implementation • 31 May 2021 • Yuan Gan, Yawei Luo, Xin Yu, Bang Zhang, Yi Yang
In this paper, we investigate the task of hallucinating an authentic high-resolution (HR) human face from multiple low-resolution (LR) video snapshots.
1 code implementation • 9 Jun 2022 • Yi Yang, Yanqiao Zhu, Hejie Cui, Xuan Kan, Lifang He, Ying Guo, Carl Yang
Specifically, we propose to meta-train the model on datasets of large sample sizes and transfer the knowledge to small datasets.
1 code implementation • NeurIPS 2023 • Wenhao Wang, Yifan Sun, Wei Li, Yi Yang
This paper explores a hierarchical prompting mechanism for the hierarchical image classification (HIC) task.
1 code implementation • 20 May 2023 • Yi Yang, Hejie Cui, Carl Yang
The human brain is the central hub of the neurobiological system, controlling behavior and cognition in complex ways.
1 code implementation • 3 Jul 2023 • Chao Liang, Zongxin Yang, Linchao Zhu, Yi Yang
In real-world scenarios, collected and annotated data often exhibit the characteristics of multiple classes and long-tailed distribution.