Search Results for author: Yi Yang

Found 511 papers, 249 papers with code

Complex Event Detection via Multi-source Video Attributes

no code implementations • CVPR 2013 • Zhigang Ma, Yi Yang, Zhongwen Xu, Shuicheng Yan, Nicu Sebe, Alexander G. Hauptmann

Compared to complex event videos, these external videos contain simple contents such as objects, scenes and actions which are the basic elements of complex events.

Event Detection

Paper
Add Code

Harry Potter's Marauder's Map: Localizing and Tracking Multiple Persons-of-Interest by Nonnegative Discretization

no code implementations • CVPR 2013 • Shoou-I Yu, Yi Yang, Alexander Hauptmann

A device just like Harry Potter's Marauder's Map, which pinpoints the location of each person-of-interest at all times, provides invaluable information for analysis of surveillance videos.

Face Recognition Human Detection

Paper
Add Code

Overcoming the Memory Bottleneck in Distributed Training of Latent Variable Models of Text

no code implementations • NAACL 2013 • Yi Yang, Alex Yates, er, Doug Downey

Paper
Add Code

A Log-Linear Model for Unsupervised Text Normalization

no code implementations • EMNLP 2013 • Yi Yang, Jacob Eisenstein

Ranked #4 on Lexical Normalization on LexNorm

Language Modelling Lexical Normalization

Paper
Add Code

Learning Representations for Weakly Supervised Natural Language Processing Tasks

no code implementations • CL 2014 • Fei Huang, Arun Ahuja, Doug Downey, Yi Yang, Yuhong Guo, Alex Yates, er

Paper
Add Code

Active Learning with Constrained Topic Model

no code implementations • WS 2014 • Yi Yang, SHimei Pan, Doug Downey, Kunpeng Zhang

Active Learning Topic Models

Paper
Add Code

Decomposable Nonlocal Tensor Dictionary Learning for Multispectral Image Denoising

no code implementations • CVPR 2014 • Yi Peng, Deyu Meng, Zongben Xu, Chenqiang Gao, Yi Yang, Biao Zhang

As compared to the conventional RGB or gray-scale images, multispectral images (MSI) can deliver more faithful representation for real scenes, and enhance the performance of many computer vision tasks.

Dictionary Learning Image Denoising

Paper
Add Code

Parsing Occluded People

no code implementations • CVPR 2014 • Golnaz Ghiasi, Yi Yang, Deva Ramanan, Charless C. Fowlkes

Occlusion poses a significant difficulty for object recognition due to the combinatorial diversity of possible occlusion patterns.

Object Recognition Pose Estimation

Paper
Add Code

Event Detection using Multi-Level Relevance Labels and Multiple Features

no code implementations • CVPR 2014 • Zhongwen Xu, Ivor W. Tsang, Yi Yang, Zhigang Ma, Alexander G. Hauptmann

We address the challenging problem of utilizing related exemplars for complex event detection while multiple features are available.

Event Detection

Paper
Add Code

Fast Easy Unsupervised Domain Adaptation with Marginalized Structured Dropout

no code implementations • ACL 2014 • Yi Yang, Jacob Eisenstein

Denoising Part-Of-Speech Tagging +2

Paper
Add Code

Explain Images with Multimodal Recurrent Neural Networks

no code implementations • 4 Oct 2014 • Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Alan L. Yuille

In this paper, we present a multimodal Recurrent Neural Network (m-RNN) model for generating novel sentence descriptions to explain the content of images.

8k Retrieval +1

Paper
Add Code

A Discriminative CNN Video Representation for Event Detection

no code implementations • CVPR 2015 • Zhongwen Xu, Yi Yang, Alexander G. Hauptmann

In this paper, we propose a discriminative video representation for event detection over a large scale video dataset when only limited hardware resources are available.

Event Detection

Paper
Add Code

A Convex Formulation for Spectral Shrunk Clustering

no code implementations • 23 Nov 2014 • Xiaojun Chang, Feiping Nie, Zhigang Ma, Yi Yang, Xiaofang Zhou

Thus, applying manifold information obtained from the original space to the clustering process in a low-dimensional subspace is prone to inferior performance.

Clustering Dimensionality Reduction

Paper
Add Code

Semi-supervised Feature Analysis by Mining Correlations among Multiple Tasks

no code implementations • 23 Nov 2014 • Xiaojun Chang, Yi Yang

In this paper, we propose a novel semi-supervised feature selection framework by mining correlations among multiple tasks and apply it to different multimedia applications.

feature selection

Paper
Add Code

Balanced k-Means and Min-Cut Clustering

no code implementations • 23 Nov 2014 • Xiaojun Chang, Feiping Nie, Zhigang Ma, Yi Yang

Clustering is an effective technique in data mining to generate groups that are the matter of interest.

Clustering

Paper
Add Code

A Convex Sparse PCA for Feature Analysis

no code implementations • 23 Nov 2014 • Xiaojun Chang, Feiping Nie, Yi Yang, Heng Huang

In addition, based on the sparse model used in CSPCA, an optimal weight is assigned to each of the original feature, which in turn provides the output with good interpretability.

Dimensionality Reduction feature selection +1

Paper
Add Code

Improved Spectral Clustering via Embedded Label Propagation

no code implementations • 23 Nov 2014 • Xiaojun Chang, Feiping Nie, Yi Yang, Heng Huang

Our algorithm is built upon two advancements of the state of the art:1) label propagation, which propagates a node\'s labels to neighboring nodes according to their proximity; and 2) manifold learning, which has been widely used in its capacity to leverage the manifold structure of data points.

Clustering

Paper
Add Code

Compound Rank-k Projections for Bilinear Analysis

no code implementations • 23 Nov 2014 • Xiaojun Chang, Feiping Nie, Sen Wang, Yi Yang, Xiaofang Zhou, Chengqi Zhang

In many real-world applications, data are represented by matrices or high-order tensors.

Paper
Add Code

Unsupervised Domain Adaptation with Feature Embeddings

1 code implementation • 14 Dec 2014 • Yi Yang, Jacob Eisenstein

Representation learning is the dominant technique for unsupervised domain adaptation, but existing approaches often require the specification of "pivot features" that generalize across domains, which are selected by task-specific heuristics.

Representation Learning Unsupervised Domain Adaptation

Paper
Code

Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)

2 code implementations • 20 Dec 2014 • Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, Alan Yuille

In this paper, we present a multimodal Recurrent Neural Network (m-RNN) model for generating novel image captions.

8k Image Captioning +1

109

Paper
Code

Group $K$-Means

no code implementations • 5 Jan 2015 • Jianfeng Wang, Shuicheng Yan, Yi Yang, Mohan S. Kankanhalli, Shipeng Li, Jingdong Wang

We study how to learn multiple dictionaries from a dataset, and approximate any data point by the sum of the codewords each chosen from the corresponding dictionary.

Paper
Add Code

Depth-based hand pose estimation: methods, data, and challenges

no code implementations • 24 Apr 2015 • James Steven Supancic III, Gregory Rogez, Yi Yang, Jamie Shotton, Deva Ramanan

To spur further progress we introduce a challenging new dataset with diverse, cluttered scenes.

Hand Pose Estimation

Paper
Add Code

Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images

1 code implementation • ICCV 2015 • Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, Alan Yuille

In particular, we propose a transposed weight sharing scheme, which not only improves performance on image captioning, but also makes the model more suitable for the novel concept learning task.

Image Captioning Novel Concepts +1

109

Paper
Code

Unsupervised Multi-Domain Adaptation with Feature Embeddings

1 code implementation • HLT 2015 • Jacob Eisenstein, Yi Yang

Representation Learning Unsupervised Domain Adaptation

Paper
Code

DevNet: A Deep Event Network for Multimedia Event Detection and Evidence Recounting

no code implementations • CVPR 2015 • Chuang Gan, Naiyan Wang, Yi Yang, Dit-yan Yeung, Alex G. Hauptmann

Taking key frames of videos as input, we first detect the event of interest at the video level by aggregating the CNN features of the key frames.

Action Recognition Event Detection +2

Paper
Add Code

Learning From Massive Noisy Labeled Data for Image Classification

no code implementations • CVPR 2015 • Tong Xiao, Tian Xia, Yi Yang, Chang Huang, Xiaogang Wang

To demonstrate the effectiveness of our approach, we collect a large-scale real-world clothing classification dataset with both noisy and clean labels.

Classification General Classification +1

Paper
Add Code

Efficient Methods for Inferring Large Sparse Topic Hierarchies

no code implementations • IJCNLP 2015 • Doug Downey, Ch Bhagavatula, ra, Yi Yang

Topic Models

Paper
Add Code

Indexing of CNN Features for Large Scale Image Search

no code implementations • 2 Aug 2015 • Ruoyu Liu, Yao Zhao, Shikui Wei, Yi Yang

The convolutional neural network (CNN) features can give a good description of image content, which usually represent images with unique global vectors.

Clustering Image Retrieval +2

Paper
Add Code

Insurance Premium Prediction via Gradient Tree-Boosted Tweedie Compound Poisson Models

no code implementations • 26 Aug 2015 • Yi Yang, Wei Qian, Hui Zou

The Tweedie GLM is a widely used method for predicting insurance premiums.

Methodology

Paper
Add Code

Efficient Methods for Incorporating Knowledge into Topic Models

no code implementations • EMNLP 2015 • Yi Yang, Doug Downey, Jordan Boyd-Graber

Topic Models

Paper
Add Code

WikiQA: A Challenge Dataset for Open-Domain Question Answering

no code implementations • EMNLP 2015 • Yi Yang, Wen-tau Yih, Christopher Meek

Ranked #21 on Question Answering on WikiQA

Answer Selection Open-Domain Question Answering

Paper
Add Code

DenseBox: Unifying Landmark Localization with End to End Object Detection

2 code implementations • 16 Sep 2015 • Lichao Huang, Yi Yang, Yafeng Deng, Yinan Yu

How can a single fully convolutional neural network (FCN) perform on object detection?

Face Detection Multi-Task Learning +3

Paper
Code

BLASX: A High Performance Level-3 BLAS Library for Heterogeneous Multi-GPU Computing

1 code implementation • 16 Oct 2015 • Linnan Wang, Wei Wu, Jianxiong Xiao, Yi Yang

Basic Linear Algebra Subprograms (BLAS) are a set of low level linear algebra kernels widely adopted by applications involved with the deep learning and scientific computing.

Distributed, Parallel, and Cluster Computing

Paper
Code

Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks

no code implementations • CVPR 2016 • Haonan Yu, Jiang Wang, Zhiheng Huang, Yi Yang, Wei Xu

The sentence generator produces one simple short sentence that describes a specific short video interval.

Sentence Video Captioning

Paper
Add Code

Attention to Scale: Scale-aware Semantic Image Segmentation

no code implementations • CVPR 2016 • Liang-Chieh Chen, Yi Yang, Jiang Wang, Wei Xu, Alan L. Yuille

We adapt a state-of-the-art semantic image segmentation model, which we jointly train with multi-scale input images and the attention model.

Image Segmentation Segmentation +1

Paper
Add Code

Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning

no code implementations • CVPR 2016 • Pingbo Pan, Zhongwen Xu, Yi Yang, Fei Wu, Yueting Zhuang

In this paper, we propose a new approach, namely Hierarchical Recurrent Neural Encoder (HRNE), to exploit temporal information of videos.

Image Classification Video Captioning

Paper
Add Code

Uncovering Temporal Context for Video Question and Answering

no code implementations • 15 Nov 2015 • Linchao Zhu, Zhongwen Xu, Yi Yang, Alexander G. Hauptmann

In this work, we introduce Video Question Answering in temporal domain to infer the past, describe the present and predict the future.

Multiple-choice Question Answering +1

Paper
Add Code

Overcoming Language Variation in Sentiment Analysis with Social Attention

1 code implementation • TACL 2017 • Yi Yang, Jacob Eisenstein

Variation in language is ubiquitous, particularly in newer forms of writing such as social media.

Sentiment Analysis

Paper
Code

Depth-Based Hand Pose Estimation: Data, Methods, and Challenges

no code implementations • ICCV 2015 • James S. Supancic III, Gregory Rogez, Yi Yang, Jamie Shotton, Deva Ramanan

To spur further progress we introduce a challenging new dataset with diverse, cluttered scenes.

Hand Pose Estimation

Paper
Add Code

Look and Think Twice: Capturing Top-Down Visual Attention With Feedback Convolutional Neural Networks

no code implementations • ICCV 2015 • Chunshui Cao, Xian-Ming Liu, Yi Yang, Yinan Yu, Jiang Wang, Zilei Wang, Yongzhen Huang, Liang Wang, Chang Huang, Wei Xu, Deva Ramanan, Thomas S. Huang

While feedforward deep convolutional neural networks (CNNs) have been a great success in computer vision, it is important to remember that the human visual contex contains generally more feedback connections than foward connections.

Paper
Add Code

Dynamic Concept Composition for Zero-Example Event Detection

no code implementations • 14 Jan 2016 • Xiaojun Chang, Yi Yang, Guodong Long, Chengqi Zhang, Alexander G. Hauptmann

In this paper, we focus on automatically detecting events in unconstrained videos without the use of any visual training exemplars.

Event Detection Zero-Shot Learning

Paper
Add Code

Part-of-Speech Tagging for Historical English

no code implementations • NAACL 2016 • Yi Yang, Jacob Eisenstein

We evaluate several domain adaptation methods on the task of tagging Early Modern English and Modern British English texts in the Penn Corpora of Historical English.

Part-Of-Speech Tagging Unsupervised Domain Adaptation +1

Paper
Add Code

Accelerating Deep Neural Network Training with Inconsistent Stochastic Gradient Descent

no code implementations • 17 Mar 2016 • Linnan Wang, Yi Yang, Martin Renqiang Min, Srimat Chakradhar

Then we present the study of ISGD batch size to the learning rate, parallelism, synchronization cost, system saturation and scalability.

Paper
Add Code

Fully Convolutional Attention Networks for Fine-Grained Recognition

no code implementations • 22 Mar 2016 • Xiao Liu, Tian Xia, Jiang Wang, Yi Yang, Feng Zhou, Yuanqing Lin

Fine-grained recognition is challenging due to its subtle local inter-class differences versus large intra-class variations such as poses.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Person Re-identification in the Wild

no code implementations • CVPR 2017 • Liang Zheng, Hengheng Zhang, Shaoyan Sun, Manmohan Chandraker, Yi Yang, Qi Tian

Our baselines address three issues: the performance of various combinations of detectors and recognizers, mechanisms for pedestrian detection to help improve overall re-identification accuracy and assessing the effectiveness of different detectors for re-identification.

Benchmarking Pedestrian Detection +2

Paper
Add Code

CNN-RNN: A Unified Framework for Multi-label Image Classification

1 code implementation • CVPR 2016 • Jiang Wang, Yi Yang, Junhua Mao, Zhiheng Huang, Chang Huang, Wei Xu

While deep convolutional neural networks (CNNs) have shown a great success in single-label image classification, it is important to note that real world images generally contain multiple labels, which could correspond to different objects, scenes, actions and attributes in an image.

Classification General Classification +2

Paper
Code

Long-Term Identity-Aware Multi-Person Tracking for Surveillance Video Summarization

no code implementations • 25 Apr 2016 • Shoou-I Yu, Yi Yang, Xuanchong Li, Alexander G. Hauptmann

Therefore, our tracker propagates identity information to frames without recognized faces by uncovering the appearance and spatial manifold formed by person detections.

Face Recognition Video Summarization

Paper
Add Code

They Are Not Equally Reliable: Semantic Event Search Using Differentiated Concept Classifiers

no code implementations • CVPR 2016 • Xiaojun Chang, Yao-Liang Yu, Yi Yang, Eric P. Xing

Complex event detection on unconstrained Internet videos has seen much progress in recent years.

Event Detection

Paper
Add Code

You Lead, We Exceed: Labor-Free Video Concept Learning by Jointly Exploiting Web Videos and Images

no code implementations • CVPR 2016 • Chuang Gan, Ting Yao, Kuiyuan Yang, Yi Yang, Tao Mei

The Web images are then filtered by the learnt network and the selected images are additionally fed into the network to enhance the architecture and further trim the videos.

Action Recognition Event Detection +1

Paper
Add Code

Strategies for Searching Video Content with Text Queries or Video Examples

no code implementations • 17 Jun 2016 • Shoou-I Yu, Yi Yang, Zhongwen Xu, Shicheng Xu, Deyu Meng, Zexi Mao, Zhigang Ma, Ming Lin, Xuanchong Li, Huan Li, Zhenzhong Lan, Lu Jiang, Alexander G. Hauptmann, Chuang Gan, Xingzhong Du, Xiaojun Chang

The large number of user-generated videos uploaded on to the Internet everyday has led to many commercial video search engines, which mainly rely on text metadata for search.

Event Detection Retrieval +1

Paper
Add Code

SIFT Meets CNN: A Decade Survey of Instance Retrieval

1 code implementation • 5 Aug 2016 • Liang Zheng, Yi Yang, Qi Tian

This survey presents milestones in modern instance retrieval, reviews a broad selection of previous works in different categories, and provides insights on the connection between SIFT and CNN-based methods.

Content-Based Image Retrieval Retrieval

Paper
Code

S-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking

no code implementations • IJCNLP 2015 • Yi Yang, Ming-Wei Chang

Non-linear models recently receive a lot of attention as people are starting to discover the power of statistical and embedding features.

Entity Linking

Paper
Add Code

Toward Socially-Infused Information Extraction: Embedding Authors, Mentions, and Entities

no code implementations • EMNLP 2016 • Yi Yang, Ming-Wei Chang, Jacob Eisenstein

Entity linking is the task of identifying mentions of entities in text, and linking them to entries in a knowledge base.

Entity Linking Structured Prediction

Paper
Add Code

Person Re-identification: Past, Present and Future

no code implementations • 10 Oct 2016 • Liang Zheng, Yi Yang, Alexander G. Hauptmann

Person re-identification (re-ID) has become increasingly popular in the community due to its application and research significance.

Ranked #83 on Person Re-Identification on DukeMTMC-reID

Image Classification Person Re-Identification +1

Paper
Add Code

Optimizing Memory Efficiency for Deep Convolutional Neural Networks on GPUs

no code implementations • 12 Oct 2016 • Chao Li, Yi Yang, Min Feng, Srimat Chakradhar, Huiyang Zhou

Leveraging large data sets, deep Convolutional Neural Networks (CNNs) achieve state-of-the-art recognition accuracy.

Computational Efficiency

Paper
Add Code

A Discriminatively Learned CNN Embedding for Person Re-identification

4 code implementations • 17 Nov 2016 • Zhedong Zheng, Liang Zheng, Yi Yang

We revisit two popular convolutional neural networks (CNN) in person re-identification (re-ID), i. e, verification and classification models.

Ranked #1 on Person Re-Identification on Market-1501+500k

General Classification Image Retrieval +2

265

Paper
Code

Bidirectional Multirate Reconstruction for Temporal Modeling in Videos

no code implementations • CVPR 2017 • Linchao Zhu, Zhongwen Xu, Yi Yang

This learning process makes the learned model more capable of dealing with motion speed variance.

Event Detection Video Captioning

Paper
Add Code

Few-Shot Object Recognition from Machine-Labeled Web Images

no code implementations • CVPR 2017 • Zhongwen Xu, Linchao Zhu, Yi Yang

Then, we demonstrate that with our model, machine-labeled image annotations are very effective and abundant resources to perform object recognition on novel categories.

Few-Shot Learning Object +1

Paper
Add Code

Personalized Video Recommendation Using Rich Contents from Videos

1 code implementation • 21 Dec 2016 • Xingzhong Du, Hongzhi Yin, Ling Chen, Yang Wang, Yi Yang, Xiaofang Zhou

In the existing video recommender systems, the models make the recommendations based on the user-video interactions and single specific content features.

Recommendation Systems

Paper
Code

Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in vitro

8 code implementations • ICCV 2017 • Zhedong Zheng, Liang Zheng, Yi Yang

We verify the proposed method on a practical problem: person re-identification (re-ID).

Ranked #4 on Person Re-Identification on CUHK03

Fine-Grained Image Classification Generative Adversarial Network +2

321

Paper
Code

Pose Invariant Embedding for Deep Person Re-identification

no code implementations • 26 Jan 2017 • Liang Zheng, Yujia Huang, Huchuan Lu, Yi Yang

Second, to reduce the impact of pose estimation errors and information loss during PoseBox construction, we design a PoseBox fusion (PBF) CNN architecture that takes the original image, the PoseBox, and the pose estimation confidence as input.

Person Re-Identification Pose Estimation +1

Paper
Add Code

A New Evaluation Protocol and Benchmarking Results for Extendable Cross-media Retrieval

no code implementations • 10 Mar 2017 • Ruoyu Liu, Yao Zhao, Liang Zheng, Shikui Wei, Yi Yang

Additionally, a trivial solution, \ie, directly using the predicted class label for cross-media retrieval, is tested.

Benchmarking Image Retrieval +1

Paper
Add Code

Twitter100k: A Real-world Dataset for Weakly Supervised Cross-Media Retrieval

no code implementations • 20 Mar 2017 • Yuting Hu, Liang Zheng, Yi Yang, Yongfeng Huang

Second, texts in these datasets are written in well-organized language, leading to inconsistency with realistic applications.

Optical Character Recognition (OCR) Retrieval +1

Paper
Add Code

Improving Person Re-identification by Attribute and Identity Learning

2 code implementations • 21 Mar 2017 • Yutian Lin, Liang Zheng, Zhedong Zheng, Yu Wu, Zhilan Hu, Chenggang Yan, Yi Yang

Person re-identification (re-ID) and attribute recognition share a common target at learning pedestrian descriptions.

Ranked #75 on Person Re-Identification on DukeMTMC-reID

Attribute Person Recognition +2

Paper
Code

An End-to-End Approach to Natural Language Object Retrieval via Context-Aware Deep Reinforcement Learning

no code implementations • 22 Mar 2017 • Fan Wu, Zhongwen Xu, Yi Yang

We propose an end-to-end approach to the natural language object retrieval task, which localizes an object within an image according to a natural language description, i. e., referring expression.

Object Referring Expression +1

Paper
Add Code

More is Less: A More Complicated Network with Less Inference Complexity

no code implementations • CVPR 2017 • Xuanyi Dong, Junshi Huang, Yi Yang, Shuicheng Yan

In this paper, we present a novel and general network structure towards accelerating the inference process of convolutional neural networks, which is more complicated in network structure yet with less inference complexity.

Paper
Add Code

Dynamic Computational Time for Visual Attention

1 code implementation • 30 Mar 2017 • Zhichao Li, Yi Yang, Xiao Liu, Feng Zhou, Shilei Wen, Wei Xu

We propose a dynamic computational time model to accelerate the average processing time for recurrent visual attention (RAM).

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

GeneGAN: Learning Object Transfiguration and Attribute Subspace from Unpaired Data

2 code implementations • 14 May 2017 • Shuchang Zhou, Taihong Xiao, Yi Yang, Dieqiao Feng, Qinyao He, Weiran He

In this work, we propose a model that can learn object transfiguration from two unpaired sets of images: one set containing images that "have" that kind of object, and the other set being the opposite, with the mild constraint that the objects be located approximately at the same place.

Attribute Conditional Image Generation +1

144

Paper
Code

Unsupervised Learning Layers for Video Analysis

no code implementations • 24 May 2017 • Liang Zhao, Yang Wang, Yi Yang, Wei Xu

This paper presents two unsupervised learning layers (UL layers) for label-free video analysis: one for fully connected layers, and the other for convolutional ones.

Object Localization

Paper
Add Code

Unsupervised Person Re-identification: Clustering and Fine-tuning

1 code implementation • 30 May 2017 • Hehe Fan, Liang Zheng, Yi Yang

Progressively, pedestrian clustering and the CNN model are improved simultaneously until algorithm convergence.

Ranked #12 on Unsupervised Person Re-Identification on DukeMTMC-reID

Clustering Unsupervised Person Re-Identification

218

Paper
Code

Few-Example Object Detection with Model Communication

1 code implementation • 26 Jun 2017 • Xuanyi Dong, Liang Zheng, Fan Ma, Yi Yang, Deyu Meng

Experiments on PASCAL VOC'07, MS COCO'14, and ILSVRC'13 indicate that by using as few as three or four samples selected for each category, our method produces very competitive results when compared to the state-of-the-art weakly-supervised approaches using a large number of image-level labels.

Ranked #1 on Weakly Supervised Object Detection on MS COCO

Object object-detection

Paper
Code

Pedestrian Alignment Network for Large-scale Person Re-identification

1 code implementation • 3 Jul 2017 • Zhedong Zheng, Liang Zheng, Yi Yang

This task aims to search a query person in a large image pool.

Ranked #1 on Person Re-Identification on CUHK03 (detected)

Image Retrieval Large-Scale Person Re-Identification +1

234

Paper
Code

UTS submission to Google YouTube-8M Challenge 2017

1 code implementation • 13 Jul 2017 • Linchao Zhu, Yanbin Liu, Yi Yang

In this paper, we present our solution to Google YouTube-8M Video Classification Challenge 2017.

Classification General Classification +1

Paper
Code

PatchShuffle Regularization

no code implementations • 22 Jul 2017 • Guoliang Kang, Xuanyi Dong, Liang Zheng, Yi Yang

This paper focuses on regularizing the training of the convolutional neural network (CNN).

General Classification

Paper
Add Code

Robust PCA by Manifold Optimization

no code implementations • 1 Aug 2017 • Teng Zhang, Yi Yang

Robust PCA is a widely used statistical procedure to recover a underlying low-rank matrix with grossly corrupted observations.

Paper
Add Code

Random Erasing Data Augmentation

18 code implementations • 16 Aug 2017 • Zhun Zhong, Liang Zheng, Guoliang Kang, Shaozi Li, Yi Yang

In this paper, we introduce Random Erasing, a new data augmentation method for training the convolutional neural network (CNN).

Ranked #4 on Image Classification on Fashion-MNIST

General Classification Image Augmentation +4

29,826

Paper
Code

An Improved Residual LSTM Architecture for Acoustic Modeling

no code implementations • 17 Aug 2017 • Lu Huang, Jiasong Sun, Ji Xu, Yi Yang

Long Short-Term Memory (LSTM) is the primary recurrent neural networks architecture for acoustic modeling in automatic speech recognition systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

EraseReLU: A Simple Way to Ease the Training of Deep Convolution Neural Networks

no code implementations • 22 Sep 2017 • Xuanyi Dong, Guoliang Kang, Kun Zhan, Yi Yang

For most state-of-the-art architectures, Rectified Linear Unit (ReLU) becomes a standard component accompanied with each layer.

Ranked #12 on Image Classification on SVHN

Blocking Image Classification

Paper
Add Code

Recursive Spatial Transformer (ReST) for Alignment-Free Face Recognition

no code implementations • ICCV 2017 • Wanglong Wu, Meina Kan, Xin Liu, Yi Yang, Shiguang Shan, Xilin Chen

The designed ReST has an intrinsic recursive structure and is capable of progressively aligning faces to a canonical one, even those with large variations.

Face Alignment Face Recognition

Paper
Add Code

Learning Discriminative Latent Attributes for Zero-Shot Classification

no code implementations • ICCV 2017 • Huajie Jiang, Ruiping Wang, Shiguang Shan, Yi Yang, Xilin Chen

Zero-shot learning (ZSL) aims to transfer knowledge from observed classes to the unseen classes, based on the assumption that both the seen and unseen classes share a common semantic space, among which attributes enjoy a great popularity.

Attribute Classification +3

Paper
Add Code

Complex Event Detection by Identifying Reliable Shots From Untrimmed Videos

no code implementations • ICCV 2017 • Hehe Fan, Xiaojun Chang, De Cheng, Yi Yang, Dong Xu, Alexander G. Hauptmann

relevant) to the given event class, we formulate this task as a multi-instance learning (MIL) problem by taking each video as a bag and the video shots in each video as instances.

Event Detection

Paper
Add Code

Nanophotonic Particle Simulation and Inverse Design Using Artificial Neural Networks

1 code implementation • 18 Oct 2017 • John Peurifoy, Yichen Shen, Li Jing, Yi Yang, Fidel Cano-Renteria, Brendan Delacy, Max Tegmark, John D. Joannopoulos, Marin Soljacic

We propose a method to use artificial neural networks to approximate light scattering by multilayer nanoparticles.

Computational Physics Applied Physics Optics

Paper
Code

Dual-Path Convolutional Image-Text Embeddings with Instance Loss

2 code implementations • 15 Nov 2017 • Zhedong Zheng, Liang Zheng, Michael Garrett, Yi Yang, Mingliang Xu, Yi-Dong Shen

In this paper, we propose a new system to discriminatively embed the image and text to a shared visual-textual space.

Ranked #1 on Cross-Modal Retrieval on CUHK-PEDES

Content-Based Image Retrieval Cross-Modal Retrieval +4

280

Paper
Code

Occlusion Aware Unsupervised Learning of Optical Flow

no code implementations • CVPR 2018 • Yang Wang, Yi Yang, Zhenheng Yang, Liang Zhao, Peng Wang, Wei Xu

Especially on KITTI dataset where abundant unlabeled samples exist, our unsupervised method outperforms its counterpart trained with supervised learning.

Optical Flow Estimation

Paper
Add Code

Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification

2 code implementations • CVPR 2018 • Weijian Deng, Liang Zheng, Qixiang Ye, Guoliang Kang, Yi Yang, Jianbin Jiao

To this end, we propose to preserve two types of unsupervised similarities, 1) self-similarity of an image before and after translation, and 2) domain-dissimilarity of a translated source image and a target image.

Ranked #3 on Unsupervised Person Re-Identification on MSMT17->DukeMTMC-reID

Generative Adversarial Network Person Re-Identification +2

3,149

Paper
Code

Beyond Part Models: Person Retrieval with Refined Part Pooling (and a Strong Convolutional Baseline)

29 code implementations • ECCV 2018 • Yifan Sun, Liang Zheng, Yi Yang, Qi Tian, Shengjin Wang

RPP re-assigns these outliers to the parts they are closest to, resulting in refined parts with enhanced within-part consistency.

Ranked #3 on Person Re-Identification on UAV-Human

Person Re-Identification Person Retrieval +1

483

Paper
Code

Camera Style Adaptation for Person Re-identification

10 code implementations • CVPR 2018 • Zhun Zhong, Liang Zheng, Zhedong Zheng, Shaozi Li, Yi Yang

In this paper, we explicitly consider this challenge by introducing camera style (CamStyle) adaptation.

Ranked #71 on Person Re-Identification on DukeMTMC-reID

Data Augmentation Person Re-Identification +1

284

Paper
Code

Alibaba at IJCNLP-2017 Task 1: Embedding Grammatical Features into LSTMs for Chinese Grammatical Error Diagnosis Task

no code implementations • IJCNLP 2017 • Yi Yang, Pengjun Xie, Jun Tao, Guangwei Xu, Linlin Li, Luo Si

This paper introduces Alibaba NLP team system on IJCNLP 2017 shared task No.

Ranked #1 on 2D Human Pose Estimation on Alibaba Cluster Trace (using extra training data)

2D Human Pose Estimation Position

Paper
Add Code

Prognostication of chronic disorders of consciousness using brain functional networks and clinical characteristics

no code implementations • 10 Jan 2018 • Ming Song, Yi Yang, Jianghong He, Zhengyi Yang, Shan Yu, Qiuyou Xie, Xiaoyu Xia, Yuanyuan Dang, Qiang Zhang, Xinhuai Wu, Yue Cui, Bing Hou, Ronghao Yu, Ruxiang Xu, Tianzi Jiang

Disorders of consciousness are a heterogeneous mixture of different diseases or injuries.

Neurons and Cognition

Paper
Add Code

Diagnose like a Radiologist: Attention Guided Convolutional Neural Network for Thorax Disease Classification

1 code implementation • 30 Jan 2018 • Qingji Guan, Yaping Huang, Zhun Zhong, Zhedong Zheng, Liang Zheng, Yi Yang

This paper considers the task of thorax disease classification on chest X-ray images.

General Classification

Paper
Code

Deep Adversarial Attention Alignment for Unsupervised Domain Adaptation: the Benefit of Target Expectation Maximization

no code implementations • ECCV 2018 • Guoliang Kang, Liang Zheng, Yan Yan, Yi Yang

Second, we estimate the posterior label distribution of the unlabeled data for target network training.

Unsupervised Domain Adaptation

Paper
Add Code

Collective Entity Disambiguation with Structured Gradient Tree Boosting

1 code implementation • NAACL 2018 • Yi Yang, Ozan .Irsoy, Kazi Shefaet Rahman

To the best of our knowledge, our work is the first one that employs the structured gradient tree boosting (SGTB) algorithm for collective entity disambiguation.

Entity Disambiguation

Paper
Code

Style Aggregated Network for Facial Landmark Detection

1 code implementation • CVPR 2018 • Xuanyi Dong, Yan Yan, Wanli Ouyang, Yi Yang

In this work, we propose a style-aggregated approach to deal with the large intrinsic variance of image styles for facial landmark detection.

Ranked #2 on Facial Landmark Detection on AFLW-Front (Mean NME metric)

Face Alignment Facial Landmark Detection

917

Paper
Code

Decoupled Novel Object Captioner

1 code implementation • 11 Apr 2018 • Yu Wu, Linchao Zhu, Lu Jiang, Yi Yang

Thus, the sequence model can be decoupled from the novel object descriptions.

Image Captioning Novel Concepts +2

Paper
Code

Adversarial Complementary Learning for Weakly Supervised Object Localization

2 code implementations • CVPR 2018 • Xiaolin Zhang, Yunchao Wei, Jiashi Feng, Yi Yang, Thomas Huang

With such an adversarial learning, the two parallel-classifiers are forced to leverage complementary object regions for classification and can finally generate integral object localization together.

Ranked #2 on Weakly-Supervised Object Localization on ILSVRC 2016

General Classification Object +1

Paper
Code

Deploy Large-Scale Deep Neural Networks in Resource Constrained IoT Devices with Local Quantization Region

no code implementations • 24 May 2018 • Yi Yang, Andy Chen, Xiaoming Chen, Jiang Ji, Zhenyang Chen, Yan Dai

Implementing large-scale deep neural networks with high computational complexity on low-cost IoT devices may inevitably be constrained by limited computation resource, making the devices hard to respond in real-time.

Quantization

Paper
Add Code

Learning to Propagate Labels: Transductive Propagation Network for Few-shot Learning

2 code implementations • ICLR 2019 • Yanbin Liu, Juho Lee, Minseop Park, Saehoon Kim, Eunho Yang, Sung Ju Hwang, Yi Yang

The goal of few-shot learning is to learn a classifier that generalizes well even when trained with a limited number of training instances per class.

Ranked #5 on Few-Shot Image Classification on Mini-Imagenet 10-way (1-shot)

Few-Shot Image Classification Few-Shot Learning +2

240

Paper
Code

Exploit the Unknown Gradually: One-Shot Video-Based Person Re-Identification by Stepwise Learning

no code implementations • CVPR 2018 • Yu Wu, Yutian Lin, Xuanyi Dong, Yan Yan, Wanli Ouyang, Yi Yang

We focus on the one-shot learning for video-based person re-Identification (re-ID).

One-Shot Learning Pedestrian Detection +1

Paper
Add Code

Improve Neural Entity Recognition via Multi-Task Data Selection and Constrained Decoding

no code implementations • NAACL 2018 • Huasha Zhao, Yi Yang, Qiong Zhang, Luo Si

Entity recognition is a widely benchmarked task in natural language processing due to its massive applications.

Domain Adaptation Machine Reading Comprehension +3

Paper
Add Code

3D Pose Estimation for Fine-Grained Object Categories

2 code implementations • 12 Jun 2018 • Yaming Wang, Xiao Tan, Yi Yang, Xiao Liu, Errui Ding, Feng Zhou, Larry S. Davis

The new dataset is available at www. umiacs. umd. edu/~wym/3dpose. html

3D Pose Estimation Object

Paper
Code

Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors

1 code implementation • CVPR 2018 • Xuanyi Dong, Shoou-I Yu, Xinshuo Weng, Shih-En Wei, Yi Yang, Yaser Sheikh

In this paper, we present supervision-by-registration, an unsupervised approach to improve the precision of facial landmark detectors on both images and video.

Ranked #1 on Facial Landmark Detection on 300-VW (C)

Facial Landmark Detection Optical Flow Estimation

756

Paper
Code

Macro-Micro Adversarial Network for Human Parsing

1 code implementation • ECCV 2018 • Yawei Luo, Zhedong Zheng, Liang Zheng, Tao Guan, Junqing Yu, Yi Yang

To address the two kinds of inconsistencies, this paper proposes the Macro-Micro Adversarial Net (MMAN).

Ranked #12 on Semantic Segmentation on LIP val

Human Parsing Human Part Segmentation +1

208

Paper
Code

Self-produced Guidance for Weakly-supervised Object Localization

1 code implementation • ECCV 2018 • Xiaolin Zhang, Yunchao Wei, Guoliang Kang, Yi Yang, Thomas Huang

A stagewise approach is proposed to incorporate high confident object regions to learn the SPG masks.

Ranked #1 on Weakly-Supervised Object Localization on ILSVRC 2015

Classification General Classification +2

147

Paper
Code

Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks

6 code implementations • 21 Aug 2018 • Yang He, Guoliang Kang, Xuanyi Dong, Yanwei Fu, Yi Yang

Therefore, the network trained by our method has a larger model capacity to learn from the training data.

375

Paper
Code

Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks

2 code implementations • 22 Aug 2018 • Yang He, Xuanyi Dong, Guoliang Kang, Yanwei Fu, Chenggang Yan, Yi Yang

With asymptotic pruning, the information of the training set would be gradually concentrated in the remaining filters, so the subsequent training and pruning process would be stable.

Image Classification

Paper
Code

Attentive Sequence to Sequence Translation for Localizing Clips of Interest by Natural Language Descriptions

no code implementations • 27 Aug 2018 • Ke Ning, Linchao Zhu, Ming Cai, Yi Yang, Di Xie, Fei Wu

We validate the effectiveness of our ASST on two large-scale datasets.

Translation Video Description

Paper
Add Code

Convolutional Neural Networks with Recurrent Neural Filters

2 code implementations • EMNLP 2018 • Yi Yang

We introduce a class of convolutional neural networks (CNNs) that utilize recurrent neural networks (RNNs) as convolution filters.

Ranked #11 on Sentiment Analysis on SST-5 Fine-grained classification

Sentence Sentiment Analysis

Paper
Code

A Unified Analysis of Stochastic Momentum Methods for Deep Learning

no code implementations • 30 Aug 2018 • Yan Yan, Tianbao Yang, Zhe Li, Qihang Lin, Yi Yang

However, their theoretical analysis of convergence of the training objective and the generalization error for prediction is still under-explored.

Paper
Add Code

RCAA: Relational Context-Aware Agents for Person Search

no code implementations • ECCV 2018 • Xiaojun Chang, Po-Yao Huang, Yi-Dong Shen, Xiaodan Liang, Yi Yang, Alexander G. Hauptmann

In this paper, we address this problem by training relational context-aware agents which learn the actions to localize the target person from the gallery of whole scene images.

Person Search

Paper
Add Code

Compound Memory Networks for Few-shot Video Classification

no code implementations • ECCV 2018 • Linchao Zhu, Yi Yang

In this paper, we propose a new memory network structure for few-shot video classification by making the following contributions.

Classification General Classification +1

Paper
Add Code

Generalizing A Person Retrieval Model Hetero- and Homogeneously

1 code implementation • ECCV 2018 • Zhun Zhong, Liang Zheng, Shaozi Li, Yi Yang

Person re-identification (re-ID) poses unique challenges for unsupervised domain adaptation (UDA) in that classes in the source and target sets (domains) are entirely different and that image variations are largely caused by cameras.

Person Re-Identification Person Retrieval +2

130

Paper
Code

Query Attack via Opposite-Direction Feature:Towards Robust Image Retrieval

2 code implementations • 7 Sep 2018 • Zhedong Zheng, Liang Zheng, Yi Yang, Fei Wu

Opposite-Direction Feature Attack (ODFA) effectively exploits feature-level adversarial gradients and takes advantage of feature distance in the representation space.

Adversarial Attack General Classification +3

Paper
Code

DS-SLAM: A Semantic Visual SLAM towards Dynamic Environments

2 code implementations • 22 Sep 2018 • Chao Yu, Zuxin Liu, Xinjun Liu, Fugui Xie, Yi Yang, Qi Wei, Qiao Fei

It is one of the state-of-the-art SLAM systems in high-dynamic environments.

Robotics

655

Paper
Code

Taking A Closer Look at Domain Shift: Category-level Adversaries for Semantics Consistent Domain Adaptation

1 code implementation • CVPR 2019 • Yawei Luo, Liang Zheng, Tao Guan, Junqing Yu, Yi Yang

We consider the problem of unsupervised domain adaptation in semantic segmentation.

Ranked #8 on Semantic Segmentation on DADA-seg

Semantic Segmentation Synthetic-to-Real Translation +1

290

Paper
Code

Every Node Counts: Self-Ensembling Graph Convolutional Networks for Semi-Supervised Learning

1 code implementation • 26 Sep 2018 • Yawei Luo, Tao Guan, Junqing Yu, Ping Liu, Yi Yang

To capitalize on the information from unlabeled nodes to boost the training for GCN, we propose a novel framework named Self-Ensembling GCN (SEGCN), which marries GCN with Mean Teacher - another powerful model in semi-supervised learning.

Ranked #4 on Node Classification on Cora: fixed 20 node per class

General Classification Node Classification

Paper
Code

Learning Discriminators as Energy Networks in Adversarial Learning

no code implementations • ICLR 2019 • Pingbo Pan, Yan Yan, Tianbao Yang, Yi Yang

In this work, we propose to refine the predictions of structured prediction models by effectively integrating discriminative models into the prediction.

Image Segmentation Multi-Label Classification +2

Paper
Add Code

Joint Unsupervised Learning of Optical Flow and Depth by Watching Stereo Videos

1 code implementation • 8 Oct 2018 • Yang Wang, Zhenheng Yang, Peng Wang, Yi Yang, Chenxu Luo, Wei Xu

Then the whole scene is decomposed into moving foreground and static background by compar- ing the estimated optical flow and rigid flow derived from the depth and ego-motion.

Motion Estimation Optical Flow Estimation

128

Paper
Code

Improving Annotation for 3D Pose Dataset of Fine-Grained Object Categories

2 code implementations • 19 Oct 2018 • Yaming Wang, Xiao Tan, Yi Yang, Ziyu Li, Xiao Liu, Feng Zhou, Larry S. Davis

Existing 3D pose datasets of object categories are limited to generic object types and lack of fine-grained information.

3D Pose Estimation Object +1

Paper
Code

SG-One: Similarity Guidance Network for One-Shot Semantic Segmentation

1 code implementation • 22 Oct 2018 • Xiaolin Zhang, Yunchao Wei, Yi Yang, Thomas Huang

In this way, the possibilities embedded in the produced similarity maps can be adapted to guide the process of segmenting objects.

Ranked #89 on Few-Shot Semantic Segmentation on PASCAL-5i (5-Shot)

Few-Shot Semantic Segmentation Segmentation +1

111

Paper
Code

Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration

3 code implementations • CVPR 2019 • Yang He, Ping Liu, Ziwei Wang, Zhilan Hu, Yi Yang

In this paper, we analyze this norm-based criterion and point out that its effectiveness depends on two requirements that are not always met: (1) the norm deviation of the filters should be large; (2) the minimum norm of the filters should be small.

Image Classification

38,618

Paper
Code

Zero-Shot Transfer VQA Dataset

no code implementations • 2 Nov 2018 • Yuanpeng Li, Yi Yang, Jian-Yu Wang, Wei Xu

Therefore, toaccelerate this research, we propose a newzero-shot transfer VQA(ZST-VQA)dataset by reorganizing the existing VQA v1. 0 dataset in the way that duringtraining, some words appear only in one module (i. e. questions) but not in theother (i. e. answers).

Question Answering Transfer Learning +1

Paper
Add Code

Deep Unfolded Robust PCA with Application to Clutter Suppression in Ultrasound

no code implementations • 20 Nov 2018 • Oren Solomon, Regev Cohen, Yi Zhang, Yi Yang, He Qiong, Jianwen Luo, Ruud J. G. van Sloun, Yonina C. Eldar

We compare the performance of the suggested deep network on both simulations and in-vivo rat brain scans, with a commonly practiced deep-network architecture and the fast iterative shrinkage algorithm, and show that our architecture exhibits better image quality and contrast.

Super-Resolution

Paper
Add Code

Similarity-preserving Image-image Domain Adaptation for Person Re-identification

no code implementations • 26 Nov 2018 • Weijian Deng, Liang Zheng, Qixiang Ye, Yi Yang, Jianbin Jiao

It first preserves two types of unsupervised similarity, namely, self-similarity of an image before and after translation, and domain-dissimilarity of a translated source image and a target image.

Domain Adaptation Generative Adversarial Network +2

Paper
Add Code

Contrastive Adaptation Network for Unsupervised Domain Adaptation

2 code implementations • CVPR 2019 • Guoliang Kang, Lu Jiang, Yi Yang, Alexander G. Hauptmann

Unsupervised Domain Adaptation (UDA) makes predictions for the target domain data while manual annotations are only available in the source domain.

Ranked #7 on Domain Adaptation on Office-31

Unsupervised Domain Adaptation

316

Paper
Code

Auto-ReID: Searching for a Part-aware ConvNet for Person Re-Identification

3 code implementations • ICCV 2019 • Ruijie Quan, Xuanyi Dong, Yu Wu, Linchao Zhu, Yi Yang

We propose to automatically search for a CNN architecture that is specifically suitable for the reID task.

Ranked #9 on Person Re-Identification on CUHK03 detected

Classification General Classification +3

1,548

Paper
Code

Significance-aware Information Bottleneck for Domain Adaptive Semantic Segmentation

no code implementations • ICCV 2019 • Yawei Luo, Ping Liu, Tao Guan, Junqing Yu, Yi Yang

For unsupervised domain adaptation problems, the strategy of aligning the two domains in latent feature space through adversarial learning has achieved much progress in image classification, but usually fails in semantic segmentation tasks in which the latent representations are overcomplex.

Image Classification Segmentation +2

Paper
Add Code

Operation-aware Neural Networks for User Response Prediction

4 code implementations • 2 Apr 2019 • Yi Yang, Baile Xu, Furao Shen, Jian Zhao

Many deep models are proposed to automatically learn high-order feature interactions.

7,353

Paper
Code

DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis

4 code implementations • CVPR 2019 • Minfeng Zhu, Pingbo Pan, Wei Chen, Yi Yang

If the initial image is not well initialized, the following processes can hardly refine the image to a satisfactory quality.

Ranked #6 on Text-to-Image Generation on CUB (Inception score metric)

Generative Adversarial Network Text-to-Image Generation

182

Paper
Code

Invariance Matters: Exemplar Memory for Domain Adaptive Person Re-identification

2 code implementations • CVPR 2019 • Zhun Zhong, Liang Zheng, Zhiming Luo, Shaozi Li, Yi Yang

To achieve this goal, an exemplar memory is introduced to store features of the target domain and accommodate the three invariance properties.

Ranked #3 on Unsupervised Person Re-Identification on DukeMTMC-reID->Market-1501

Domain Adaptive Person Re-Identification Person Re-Identification +1

305

Paper
Code

Filter Pruning by Switching to Neighboring CNNs with Good Attributes

no code implementations • 8 Apr 2019 • Yang He, Ping Liu, Linchao Zhu, Yi Yang

In addition, when evaluating the filter importance, only the magnitude information of the filters is considered.

Attribute Image Classification

Paper
Add Code

Revisiting EmbodiedQA: A Simple Baseline and Beyond

no code implementations • 8 Apr 2019 • Yu Wu, Lu Jiang, Yi Yang

In this paper, we empirically study this problem and introduce 1) a simple yet effective baseline that achieves promising performance; 2) an easier and practical setting for EmbodiedQA where an agent has a chance to adapt the trained model to a new environment before it actually answers users questions.

Embodied Question Answering Question Answering

Paper
Add Code

Sim-Real Joint Reinforcement Transfer for 3D Indoor Navigation

no code implementations • CVPR 2019 • Fengda Zhu, Linchao Zhu, Yi Yang

Specifically, our method employs an adversarial feature adaptation model for visual representation transfer and a policy mimic strategy for policy behavior imitation.

Paper
Add Code

Joint Discriminative and Generative Learning for Person Re-identification

12 code implementations • CVPR 2019 • Zhedong Zheng, Xiaodong Yang, Zhiding Yu, Liang Zheng, Yi Yang, Jan Kautz

To this end, we propose a joint learning framework that couples re-id learning and data generation end-to-end.

Ranked #1 on Person Re-Identification on UAV-Human

Image-to-Image Translation Unsupervised Domain Adaptation +1

3,955

Paper
Code

Cubic LSTMs for Video Prediction

no code implementations • 20 Apr 2019 • Hehe Fan, Linchao Zhu, Yi Yang

Predicting future frames in videos has become a promising direction of research for both computer vision and robot learning communities.

motion prediction Video Prediction

Paper
Add Code

Network Pruning via Transformable Architecture Search

4 code implementations • NeurIPS 2019 • Xuanyi Dong, Yi Yang

The maximum probability for the size in each distribution serves as the width and depth of the pruned network, whose parameters are learned by knowledge transfer, e. g., knowledge distillation, from the original networks.

Ranked #1 on Network Pruning on CIFAR-10

Knowledge Distillation Network Pruning +2

1,548

Paper
Code

Syntax-Infused Variational Autoencoder for Text Generation

no code implementations • ACL 2019 • Xinyuan Zhang, Yi Yang, Siyang Yuan, Dinghan Shen, Lawrence Carin

We present a syntax-infused variational autoencoder (SIVAE), that integrates sentences with their syntactic trees to improve the grammar of generated sentences.

Sentence Text Generation

Paper
Add Code

Query-efficient Meta Attack to Deep Neural Networks

1 code implementation • ICLR 2020 • Jiawei Du, Hu Zhang, Joey Tianyi Zhou, Yi Yang, Jiashi Feng

Black-box attack methods aim to infer suitable attack patterns to targeted DNN models by only using output feedback of the models and the corresponding input queries.

Adversarial Attack Meta-Learning

Paper
Code

FASTER Recurrent Networks for Efficient Video Classification

no code implementations • 10 Jun 2019 • Linchao Zhu, Laura Sevilla-Lara, Du Tran, Matt Feiszli, Yi Yang, Heng Wang

FASTER aims to leverage the redundancy between neighboring clips and reduce the computational cost by learning to aggregate the predictions from models of different complexities.

Ranked #26 on Action Recognition on UCF101

Action Classification Action Recognition +3

Paper
Add Code

Baidu-UTS Submission to the EPIC-Kitchens Action Recognition Challenge 2019

no code implementations • 22 Jun 2019 • Xiaohan Wang, Yu Wu, Linchao Zhu, Yi Yang

In this report, we present the Baidu-UTS submission to the EPIC-Kitchens Action Recognition Challenge in CVPR 2019.

Action Recognition Object +2

Paper
Add Code

A Semi-Markov Structured Support Vector Machine Model for High-Precision Named Entity Recognition

no code implementations • ACL 2019 • Ravneet Arora, Chen-Tse Tsai, Ketevan Tsereteli, Prabhanjan Kambadur, Yi Yang

Named entity recognition (NER) is the backbone of many NLP solutions.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

What You Say and How You Say It Matters: Predicting Stock Volatility Using Verbal and Vocal Cues

1 code implementation • ACL 2019 • Yu Qin, Yi Yang

Prior research has shown that textual information in a firm{'}s financial statement can be used to predict its stock{'}s risk level.

118

Paper
Code

Adaptive Exploration for Unsupervised Person Re-Identification

1 code implementation • 9 Jul 2019 • Yuhang Ding, Hehe Fan, Mingliang Xu, Yi Yang

However, a problem of the adaptive selection is that, when an image has too many neighborhoods, it is more likely to attract other images as its neighborhoods.

Unsupervised Person Re-Identification

Paper
Code

Learning to Adapt Invariance in Memory for Person Re-identification

no code implementations • 1 Aug 2019 • Zhun Zhong, Liang Zheng, Zhiming Luo, Shaozi Li, Yi Yang

This work considers the problem of unsupervised domain adaptation in person re-identification (re-ID), which aims to transfer knowledge from the source domain to the target domain.

Ranked #7 on Unsupervised Domain Adaptation on Market to MSMT

Person Re-Identification Unsupervised Domain Adaptation

Paper
Add Code

Cascaded Revision Network for Novel Object Captioning

1 code implementation • 6 Aug 2019 • Qianyu Feng, Yu Wu, Hehe Fan, Chenggang Yan, Yi Yang

By this novel cascaded captioning-revising mechanism, CRN can accurately describe images with unseen objects.

Image Captioning Object +3

Paper
Code

Attract or Distract: Exploit the Margin of Open Set

1 code implementation • ICCV 2019 • Qianyu Feng, Guoliang Kang, Hehe Fan, Yi Yang

In this paper, we exploit the semantic structure of open set data from two aspects: 1) Semantic Categorical Alignment, which aims to achieve good separability of target known classes by categorically aligning the centroid of target with the source.

Domain Adaptation

Paper
Code

An Algorithm for Graph-Fused Lasso Based on Graph Decomposition

1 code implementation • 6 Aug 2019 • Feng Yu, Yi Yang, Teng Zhang

In comparison, this work proposes to decompose the objective function into two components, where one component is the loss function plus part of the total variation penalty, and the other component is the remaining total variation penalty.

Optimization and Control Computation

Paper
Code

Teacher Supervises Students How to Learn From Partially Labeled Images for Facial Landmark Detection

2 code implementations • ICCV 2019 • Xuanyi Dong, Yi Yang

A typical approach is to (1) train a detector on the labeled images; (2) generate new training samples using this detector's prediction as pseudo labels of unlabeled images; (3) retrain the detector on the labeled samples and partial pseudo labeled samples.

Ranked #1 on Facial Landmark Detection on 300W (Full) (using extra training data)

Facial Landmark Detection

917

Paper
Code

Recognizing Part Attributes with Insufficient Data

1 code implementation • ICCV 2019 • Xiangyun Zhao, Yi Yang, Feng Zhou, Xiao Tan, Yuchen Yuan, Yingze Bao, Ying Wu

Although great progress has been made to apply object-level recognition, recognizing the attributes of parts remains less applicable since the training data for part attributes recognition is usually scarce especially for internet-scale applications.

Attribute

Paper
Code

Learning to Transfer Learn: Reinforcement Learning-Based Selection for Adaptive Transfer Learning

no code implementations • ECCV 2020 • Linchao Zhu, Sercan O. Arik, Yi Yang, Tomas Pfister

We propose a novel adaptive transfer learning framework, learning to transfer learn (L2TL), to improve performance on a target dataset by careful extraction of the related information from a source dataset.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Dialog Intent Induction with Deep Multi-View Clustering

1 code implementation • IJCNLP 2019 • Hugh Perkins, Yi Yang

We introduce the dialog intent induction task and present a novel deep multi-view clustering approach to tackle the problem.

Clustering Representation Learning

Paper
Code

LSMI-Sinkhorn: Semi-supervised Mutual Information Estimation with Optimal Transport

1 code implementation • 5 Sep 2019 • Yanbin Liu, Makoto Yamada, Yao-Hung Hubert Tsai, Tam Le, Ruslan Salakhutdinov, Yi Yang

To estimate the mutual information from data, a common practice is preparing a set of paired samples $\{(\mathbf{x}_i,\mathbf{y}_i)\}_{i=1}^n \stackrel{\mathrm{i. i. d.

BIG-bench Machine Learning Mutual Information Estimation

Paper
Code

Multi-scale discriminative Region Discovery for Weakly-Supervised Object Localization

no code implementations • 24 Sep 2019 • Pei Lv, Haiyu Yu, Junxiao Xue, Junjin Cheng, Lisha Cui, Bing Zhou, Mingliang Xu, Yi Yang

On ILSVRC 2016, the proposed method yields the Top-1 localization error of 48. 65\%, which outperforms previous results by 2. 75\%.

Weakly-Supervised Object Localization

Paper
Add Code

Gated Channel Transformation for Visual Recognition

3 code implementations • CVPR 2020 • Zongxin Yang, Linchao Zhu, Yu Wu, Yi Yang

This lightweight layer incorporates a simple l2 normalization, enabling our transformation unit applicable to operator-level without much increase of additional parameters.

General Classification Image Classification +5

125

Paper
Code

Searching for A Robust Neural Architecture in Four GPU Hours

6 code implementations • CVPR 2019 • Xuanyi Dong, Yi Yang

To avoid traversing all the possibilities of the sub-graphs, we develop a differentiable sampler over the DAG.

Ranked #18 on Neural Architecture Search on CIFAR-10

Neural Architecture Search

1,548

Paper
Code

One-Shot Neural Architecture Search via Self-Evaluated Template Network

4 code implementations • ICCV 2019 • Xuanyi Dong, Yi Yang

In this paper, we propose a Self-Evaluated Template Network (SETN) to improve the quality of the architecture candidates for evaluation so that it is more likely to cover competitive candidates.

Ranked #18 on Neural Architecture Search on NAS-Bench-201, ImageNet-16-120 (Accuracy (Val) metric)

Neural Architecture Search

1,548

Paper
Code

PointRNN: Point Recurrent Neural Network for Moving Point Cloud Processing

2 code implementations • 18 Oct 2019 • Hehe Fan, Yi Yang

We apply PointRNN, PointGRU and PointLSTM to moving point cloud prediction, which aims to predict the future trajectories of points in a set given their history movements.

Moving Point Cloud Processing

141

Paper
Code

Self-Correction for Human Parsing

2 code implementations • 22 Oct 2019 • Peike Li, Yunqiu Xu, Yunchao Wei, Yi Yang

To tackle the problem of learning with label noises, this work introduces a purification strategy, called Self-Correction for Human Parsing (SCHP), to progressively promote the reliability of the supervised labels as well as the learned models.

Ranked #2 on Human Part Segmentation on PASCAL-Part

Human Parsing Human Part Segmentation +1

943

Paper
Code

Instance-Invariant Domain Adaptive Object Detection via Progressive Disentanglement

no code implementations • 20 Nov 2019 • Aming Wu, Yahong Han, Linchao Zhu, Yi Yang

Most state-of-the-art methods of object detection suffer from poor generalization ability when the training and test data are from different domains, e. g., with different styles.

Disentanglement Object +2

Paper
Add Code

Connective Cognition Network for Directional Visual Commonsense Reasoning

1 code implementation • NeurIPS 2019 • Aming Wu, Linchao Zhu, Yahong Han, Yi Yang

Inspired by this idea, towards VCR, we propose a connective cognition network (CCN) to dynamically reorganize the visual neuron connectivity that is contextualized by the meaning of questions and answers.

Sentence Visual Commonsense Reasoning

Paper
Code

DerainCycleGAN: Rain Attentive CycleGAN for Single Image Deraining and Rainmaking

1 code implementation • 15 Dec 2019 • Yanyan Wei, Zhao Zhang, Yang Wang, Mingliang Xu, Yi Yang, Shuicheng Yan, Meng Wang

However, in practice it is rather common to have no un-paired images in real deraining task, in such cases how to remove the rain streaks in an unsupervised way will be a very challenging task due to lack of constraints between images and hence suffering from low-quality recovery results.

Single Image Deraining

Paper
Code

Unsupervised Scene Adaptation with Memory Regularization in vivo

2 code implementations • 24 Dec 2019 • Zhedong Zheng, Yi Yang

We consider the unsupervised scene adaptation problem of learning from both labeled source data and unlabeled target data.

Ranked #1 on Domain Adaptation on SYNTHIA-to-Cityscapes Labels

Semantic Segmentation Synthetic-to-Real Translation +1

379

Paper
Code

Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech Separation

no code implementations • 25 Dec 2019 • Lu Huang, Gaofeng Cheng, Pengyuan Zhang, Yi Yang, Shumin Xu, Jiasong Sun

The experimental results show that uPIT outperforms cPIT when LC-BLSTM is used during inference.

Speech Separation

Paper
Add Code

Very Long Natural Scenery Image Prediction by Outpainting

1 code implementation • ICCV 2019 • Zongxin Yang, Jian Dong, Ping Liu, Yi Yang, Shuicheng Yan

The second challenge is how to maintain high quality in generated results, especially for multi-step generations in which generated regions are spatially far away from the initial input.

Image Inpainting Image Outpainting

Paper
Code

NAS-Bench-201: Extending the Scope of Reproducible Neural Architecture Search

4 code implementations • ICLR 2020 • Xuanyi Dong, Yi Yang

A variety of algorithms search architectures under different search space.

Data Augmentation Neural Architecture Search

1,548

Paper
Code

Progressive Local Filter Pruning for Image Retrieval Acceleration

no code implementations • 24 Jan 2020 • Xiaodong Wang, Zhedong Zheng, Yang He, Fei Yan, Zhiqiang Zeng, Yi Yang

To verify this, we evaluate our method on two widely-used image retrieval datasets, i. e., Oxford5k and Paris6K, and one person re-identification dataset, i. e., Market-1501.

Image Retrieval Network Pruning +2

Paper
Add Code

Lane Detection in Low-light Conditions Using an Efficient Data Enhancement : Light Conditions Style Transfer

1 code implementation • 4 Feb 2020 • Tong Liu, Zhaowei Chen, Yi Yang, Zehao Wu, Haowei Li

Nowadays, deep learning techniques are widely used for lane detection, but application in low-light conditions remains a challenge until this day.

Lane Detection Multi-Task Learning +1

134

Paper
Code

Symbiotic Attention with Privileged Information for Egocentric Action Recognition

no code implementations • 8 Feb 2020 • Xiaohan Wang, Yu Wu, Linchao Zhu, Yi Yang

Due to the large action vocabulary in egocentric video datasets, recent studies usually utilize a two-branch structure for action recognition, ie, one branch for verb classification and the other branch for noun classification.

Ranked #4 on Egocentric Activity Recognition on EGTEA

Action Recognition Egocentric Activity Recognition +5

Paper
Add Code

Dynamic Inference: A New Approach Toward Efficient Video Action Recognition

no code implementations • 9 Feb 2020 • Wenhao Wu, Dongliang He, Xiao Tan, Shifeng Chen, Yi Yang, Shilei Wen

In a nutshell, we treat input frames and network depth of the computational graph as a 2-dimensional grid, and several checkpoints are placed on this grid in advance with a prediction module.

Action Recognition In Videos Temporal Action Localization

Paper
Add Code

University-1652: A Multi-view Multi-source Benchmark for Drone-based Geo-localization

3 code implementations • 27 Feb 2020 • Zhedong Zheng, Yunchao Wei, Yi Yang

To our knowledge, University-1652 is the first drone-based geo-localization dataset and enables two new tasks, i. e., drone-view target localization and drone navigation.

Ranked #6 on Drone navigation on University-1652

Drone navigation Drone-view target localization +2

446

Paper
Code

Grounded and Controllable Image Completion by Incorporating Lexical Semantics

no code implementations • 29 Feb 2020 • Shengyu Zhang, Tan Jiang, Qinghao Huang, Ziqi Tan, Zhou Zhao, Siliang Tang, Jin Yu, Hongxia Yang, Yi Yang, Fei Wu

Existing image completion procedure is highly subjective by considering only visual context, which may trigger unpredictable results which are plausible but not faithful to a grounded knowledge.

Paper
Add Code

Angle-Based Cost-Sensitive Multicategory Classification

no code implementations • 8 Mar 2020 • Yi Yang, Yuxuan Guo, Xiangyu Chang

To show the usefulness of the framework, two cost-sensitive multicategory boosting algorithms are derived as concrete instances.

Classification General Classification

Paper
Add Code

Rectifying Pseudo Label Learning via Uncertainty Estimation for Domain Adaptive Semantic Segmentation

3 code implementations • 8 Mar 2020 • Zhedong Zheng, Yi Yang

This paper focuses on the unsupervised domain adaptation of transferring the knowledge from the source domain to the target domain in the context of semantic segmentation.

Ranked #2 on Unsupervised Domain Adaptation on Cityscapes-to-OxfordCar

Pseudo Label Segmentation +3

379

Paper
Code

SF-Net: Single-Frame Supervision for Temporal Action Localization

1 code implementation • ECCV 2020 • Fan Ma, Linchao Zhu, Yi Yang, Shengxin Zha, Gourab Kundu, Matt Feiszli, Zheng Shou

To obtain the single-frame supervision, the annotators are asked to identify only a single frame within the temporal window of an action.

Ranked #5 on Weakly Supervised Action Localization on BEOID

Weakly Supervised Action Localization

Paper
Code

Motion-Excited Sampler: Video Adversarial Attack with Sparked Prior

1 code implementation • ECCV 2020 • Hu Zhang, Linchao Zhu, Yi Zhu, Yi Yang

Most of previous work on adversarial attack mainly focus on image models, while the vulnerability of video models is less explored.

Adversarial Attack Video Classification

Paper
Code

Collaborative Video Object Segmentation by Foreground-Background Integration

2 code implementations • ECCV 2020 • Zongxin Yang, Yunchao Wei, Yi Yang

This paper investigates the principles of embedding learning to tackle the challenging semi-supervised video object segmentation.

Ranked #8 on Video Object Segmentation on YouTube-VOS 2019

Object One-shot visual object segmentation +3

1,419

Paper
Code

Memory Aggregation Networks for Efficient Interactive Video Object Segmentation

no code implementations • CVPR 2020 • Jiaxu Miao, Yunchao Wei, Yi Yang

Interactive video object segmentation (iVOS) aims at efficiently harvesting high-quality segmentation masks of the target object in a video with user interactions.

Ranked #5 on Interactive Video Object Segmentation on DAVIS 2017 (AUC-J metric)

Interactive Video Object Segmentation Object +2

Paper
Add Code

Context Modulated Dynamic Networks for Actor and Action Video Segmentation with Language Queries

no code implementations • 3 Apr 2020 • Hao Wang, Cheng Deng, Fan Ma, Yi Yang

Actor and action video segmentation with language queries aims to segment out the expression referred objects in the video.

Ranked #10 on Referring Expression Segmentation on J-HMDB

Referring Expression Segmentation Video Segmentation +2

Paper
Add Code

One Model to Recognize Them All: Marginal Distillation from NER Models with Different Tag Sets

no code implementations • 10 Apr 2020 • Keunwoo Peter Yu, Yi Yang

Named entity recognition (NER) is a fundamental component in the modern language understanding pipeline.

named-entity-recognition Named Entity Recognition +2

Paper
Add Code

OpenMix: Reviving Known Knowledge for Discovering Novel Visual Categories in An Open World

no code implementations • CVPR 2021 • Zhun Zhong, Linchao Zhu, Zhiming Luo, Shaozi Li, Yi Yang, Nicu Sebe

In this paper, we tackle the problem of discovering new classes in unlabeled visual data given labeled data from disjoint classes.

Clustering Novel Class Discovery

Paper
Add Code

Adversarial Style Mining for One-Shot Unsupervised Domain Adaptation

1 code implementation • NeurIPS 2020 • Yawei Luo, Ping Liu, Tao Guan, Junqing Yu, Yi Yang

We aim at the problem named One-Shot Unsupervised Domain Adaptation.

Ranked #2 on One-shot Unsupervised Domain Adaptation on GTA5 to Cityscapes

domain classification One-shot Unsupervised Domain Adaptation +2

Paper
Code

VehicleNet: Learning Robust Visual Representation for Vehicle Re-identification

3 code implementations • 14 Apr 2020 • Zhedong Zheng, Tao Ruan, Yunchao Wei, Yi Yang, Tao Mei

This stage relaxes the full alignment between the training and testing domains, as it is agnostic to the target vehicle domain.

Ranked #1 on Vehicle Re-Identification on VehicleID

Representation Learning Vehicle Re-Identification

3,955

Paper
Code

Omni-supervised Facial Expression Recognition via Distilled Data

no code implementations • 18 May 2020 • Ping Liu, Yunchao Wei, Zibo Meng, Weihong Deng, Joey Tianyi Zhou, Yi Yang

However, the performance of the current state-of-the-art facial expression recognition (FER) approaches is directly related to the labeled data for training.

Facial Expression Recognition Facial Expression Recognition (FER)

Paper
Add Code

Feature Robust Optimal Transport for High-dimensional Data

1 code implementation • 25 May 2020 • Mathis Petrovich, Chao Liang, Ryoma Sato, Yanbin Liu, Yao-Hung Hubert Tsai, Linchao Zhu, Yi Yang, Ruslan Salakhutdinov, Makoto Yamada

To show the effectiveness of FROT, we propose using the FROT algorithm for the layer selection problem in deep neural networks for semantic correspondence.

feature selection Semantic correspondence +1

Paper
Code

Parameter-Efficient Person Re-identification in the 3D Space

1 code implementation • 8 Jun 2020 • Zhedong Zheng, Nenggan Zheng, Yi Yang

To our knowledge, we are among the first attempts to conduct person re-identification in the 3D space.

Ranked #1 on Person Re-Identification on DukeMTMC-reID->Market-1501

3D Point Cloud Classification Point Cloud Classification +3

260

Paper
Code

Rethinking Localization Map: Towards Accurate Object Perception with Self-Enhancement Maps

1 code implementation • 9 Jun 2020 • Xiaolin Zhang, Yunchao Wei, Yi Yang, Fei Wu

To fulfill the direct evaluation, we annotate pixel-level object masks on the ILSVRC validation set.

Object Weakly-Supervised Object Localization

Paper
Code

FinBERT: A Pretrained Language Model for Financial Communications

1 code implementation • 15 Jun 2020 • Yi Yang, Mark Christopher Siy UY, Allen Huang

Contextual pretrained language models, such as BERT (Devlin et al., 2019), have made significant breakthrough in various NLP tasks by training on large scale of unlabeled text re-sources. Financial sector also accumulates large amount of financial communication text. However, there is no pretrained finance specific language models available.

Language Modelling Sentiment Analysis +1

522

Paper
Code

Sketch-Guided Scenery Image Outpainting

no code implementations • 17 Jun 2020 • Yaxiong Wang, Yunchao Wei, Xueming Qian, Li Zhu, Yi Yang

In this work, we take the image outpainting one step forward by allowing users to harvest personal custom outpainting results using sketches as the guidance.

Image Outpainting

Paper
Add Code

Interpretable Operational Risk Classification with Semi-Supervised Variational Autoencoder

no code implementations • ACL 2020 • Fan Zhou, Shengming Zhang, Yi Yang

To tackle these challenges, we present a semi-supervised text classification framework that integrates multi-head attention mechanism with Semi-supervised variational inference for Operational Risk Classification (SemiORC).

General Classification Management +3

Paper
Add Code

Interpreting Twitter User Geolocation

no code implementations • ACL 2020 • Ting Zhong, Tianliang Wang, Fan Zhou, Goce Trajcevski, Kunpeng Zhang, Yi Yang

Identifying user geolocation in online social networks is an essential task in many location-based applications.

Paper
Add Code

Single Image Brightening via Multi-Scale Exposure Fusion with Hybrid Learning

no code implementations • 4 Jul 2020 • Chaobing Zheng, Zhengguo Li, Yi Yang, Shiqian Wu

In this paper, a single image brightening algorithm is introduced to brighten such an image.

SSIM

Paper
Add Code

A Survey on Concept Factorization: From Shallow to Deep Representation Learning

no code implementations • 31 Jul 2020 • Zhao Zhang, Yan Zhang, Mingliang Xu, Li Zhang, Yi Yang, Shuicheng Yan

In this paper, we therefore survey the recent advances on CF methodologies and the potential benchmarks by categorizing and summarizing the current methods.

Clustering Representation Learning

Paper
Add Code

Inter-Image Communication for Weakly Supervised Localization

1 code implementation • ECCV 2020 • Xiaolin Zhang, Yunchao Wei, Yi Yang

We learn a feature center for each category and realize the global feature consistency by forcing the object features to approach class-specific centers.

Object

Paper
Code

Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents

1 code implementation • ECCV 2020 • Ye Zhu, Yu Wu, Yi Yang, Yan Yan

With the arising concerns for the AI systems provided with direct access to abundant sensitive information, researchers seek to develop more reliable AI with implicit information sources.

Video Description

Paper
Code

DONet: Dual Objective Networks for Skin Lesion Segmentation

no code implementations • 19 Aug 2020 • Yaxiong Wang, Yunchao Wei, Xueming Qian, Li Zhu, Yi Yang

Skin lesion segmentation is a crucial step in the computer-aided diagnosis of dermoscopic images.

Lesion Segmentation Segmentation +2

Paper
Add Code

Memory-based Jitter: Improving Visual Recognition on Long-tailed Data with Diversity In Memory

no code implementations • 22 Aug 2020 • Jialun Liu, Jingwei Zhang, Yi Yang, Wenhui Li, Chi Zhang, Yifan Sun

With slight modifications, MBJ is applicable for two fundamental visual recognition tasks, \emph{i. e.}, deep image classification and deep metric learning (on long-tailed data).

Ranked #44 on Long-tail Learning on CIFAR-100-LT (ρ=100)

Data Augmentation General Classification +4

Paper
Add Code

Point Adversarial Self Mining: A Simple Method for Facial Expression Recognition

no code implementations • 26 Aug 2020 • Ping Liu, Yuewei Lin, Zibo Meng, Lu Lu, Weihong Deng, Joey Tianyi Zhou, Yi Yang

In this paper, we propose a simple yet effective approach, named Point Adversarial Self Mining (PASM), to improve the recognition accuracy in facial expression recognition.

Adversarial Attack Data Augmentation +4

Paper
Add Code

Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization

1 code implementation • 26 Aug 2020 • Tingyu Wang, Zhedong Zheng, Chenggang Yan, Jiyong Zhang, Yaoqi Sun, Bolun Zheng, Yi Yang

Existing methods usually concentrate on mining the fine-grained feature of the geographic target in the image center, but underestimate the contextual information in neighbor areas.

Ranked #3 on Drone navigation on University-1652

Drone navigation Drone-view target localization +2

Paper
Code

Hierarchical memory decoder for visual narrating

no code implementations • IEEE Transactions on Circuits and Systems for Video Technology 2020 • Aming Wu, Yahong Han, Zhou Zhao, Yi Yang

In this article, we devise a novel memory decoder for visual narrating.

Ranked #13 on Visual Storytelling on VIST

Image Captioning Video Captioning +1

Paper
Add Code

Tasks Integrated Networks: Joint Detection and Retrieval for Image Search

no code implementations • 3 Sep 2020 • Lei Zhang, Zhenwei He, Yi Yang, Liang Wang, Xinbo Gao

The traditional object retrieval task aims to learn a discriminative feature representation with intra-similarity and inter-dissimilarity, which supposes that the objects in an image are manually or automatically pre-cropped exactly.

Image Retrieval Philosophy +1

Paper
Add Code

DOTS: Decoupling Operation and Topology in Differentiable Architecture Search

no code implementations • CVPR 2021 • Yu-Chao Gu, Li-Juan Wang, Yun Liu, Yi Yang, Yu-Huan Wu, Shao-Ping Lu, Ming-Ming Cheng

DARTS mainly focuses on the operation search and derives the cell topology from the operation weights.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.