Search Results for author: Yi Yang

Found 511 papers, 249 papers with code

Complex Event Detection via Multi-source Video Attributes

no code implementations CVPR 2013 Zhigang Ma, Yi Yang, Zhongwen Xu, Shuicheng Yan, Nicu Sebe, Alexander G. Hauptmann

Compared to complex event videos, these external videos contain simple contents such as objects, scenes and actions which are the basic elements of complex events.

Event Detection

Harry Potter's Marauder's Map: Localizing and Tracking Multiple Persons-of-Interest by Nonnegative Discretization

no code implementations CVPR 2013 Shoou-I Yu, Yi Yang, Alexander Hauptmann

A device just like Harry Potter's Marauder's Map, which pinpoints the location of each person-of-interest at all times, provides invaluable information for analysis of surveillance videos.

Face Recognition Human Detection

Decomposable Nonlocal Tensor Dictionary Learning for Multispectral Image Denoising

no code implementations CVPR 2014 Yi Peng, Deyu Meng, Zongben Xu, Chenqiang Gao, Yi Yang, Biao Zhang

As compared to the conventional RGB or gray-scale images, multispectral images (MSI) can deliver more faithful representation for real scenes, and enhance the performance of many computer vision tasks.

Dictionary Learning Image Denoising

Parsing Occluded People

no code implementations CVPR 2014 Golnaz Ghiasi, Yi Yang, Deva Ramanan, Charless C. Fowlkes

Occlusion poses a significant difficulty for object recognition due to the combinatorial diversity of possible occlusion patterns.

Object Recognition Pose Estimation

Event Detection using Multi-Level Relevance Labels and Multiple Features

no code implementations CVPR 2014 Zhongwen Xu, Ivor W. Tsang, Yi Yang, Zhigang Ma, Alexander G. Hauptmann

We address the challenging problem of utilizing related exemplars for complex event detection while multiple features are available.

Event Detection

Explain Images with Multimodal Recurrent Neural Networks

no code implementations4 Oct 2014 Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Alan L. Yuille

In this paper, we present a multimodal Recurrent Neural Network (m-RNN) model for generating novel sentence descriptions to explain the content of images.

8k Retrieval +1

A Discriminative CNN Video Representation for Event Detection

no code implementations CVPR 2015 Zhongwen Xu, Yi Yang, Alexander G. Hauptmann

In this paper, we propose a discriminative video representation for event detection over a large scale video dataset when only limited hardware resources are available.

Event Detection

A Convex Formulation for Spectral Shrunk Clustering

no code implementations23 Nov 2014 Xiaojun Chang, Feiping Nie, Zhigang Ma, Yi Yang, Xiaofang Zhou

Thus, applying manifold information obtained from the original space to the clustering process in a low-dimensional subspace is prone to inferior performance.

Clustering Dimensionality Reduction

Semi-supervised Feature Analysis by Mining Correlations among Multiple Tasks

no code implementations23 Nov 2014 Xiaojun Chang, Yi Yang

In this paper, we propose a novel semi-supervised feature selection framework by mining correlations among multiple tasks and apply it to different multimedia applications.

feature selection

Balanced k-Means and Min-Cut Clustering

no code implementations23 Nov 2014 Xiaojun Chang, Feiping Nie, Zhigang Ma, Yi Yang

Clustering is an effective technique in data mining to generate groups that are the matter of interest.

Clustering

A Convex Sparse PCA for Feature Analysis

no code implementations23 Nov 2014 Xiaojun Chang, Feiping Nie, Yi Yang, Heng Huang

In addition, based on the sparse model used in CSPCA, an optimal weight is assigned to each of the original feature, which in turn provides the output with good interpretability.

Dimensionality Reduction feature selection +1

Improved Spectral Clustering via Embedded Label Propagation

no code implementations23 Nov 2014 Xiaojun Chang, Feiping Nie, Yi Yang, Heng Huang

Our algorithm is built upon two advancements of the state of the art:1) label propagation, which propagates a node\'s labels to neighboring nodes according to their proximity; and 2) manifold learning, which has been widely used in its capacity to leverage the manifold structure of data points.

Clustering

Compound Rank-k Projections for Bilinear Analysis

no code implementations23 Nov 2014 Xiaojun Chang, Feiping Nie, Sen Wang, Yi Yang, Xiaofang Zhou, Chengqi Zhang

In many real-world applications, data are represented by matrices or high-order tensors.

Unsupervised Domain Adaptation with Feature Embeddings

1 code implementation14 Dec 2014 Yi Yang, Jacob Eisenstein

Representation learning is the dominant technique for unsupervised domain adaptation, but existing approaches often require the specification of "pivot features" that generalize across domains, which are selected by task-specific heuristics.

Representation Learning Unsupervised Domain Adaptation

Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)

2 code implementations20 Dec 2014 Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, Alan Yuille

In this paper, we present a multimodal Recurrent Neural Network (m-RNN) model for generating novel image captions.

8k Image Captioning +1

Group $K$-Means

no code implementations5 Jan 2015 Jianfeng Wang, Shuicheng Yan, Yi Yang, Mohan S. Kankanhalli, Shipeng Li, Jingdong Wang

We study how to learn multiple dictionaries from a dataset, and approximate any data point by the sum of the codewords each chosen from the corresponding dictionary.

Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images

1 code implementation ICCV 2015 Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, Alan Yuille

In particular, we propose a transposed weight sharing scheme, which not only improves performance on image captioning, but also makes the model more suitable for the novel concept learning task.

Image Captioning Novel Concepts +1

DevNet: A Deep Event Network for Multimedia Event Detection and Evidence Recounting

no code implementations CVPR 2015 Chuang Gan, Naiyan Wang, Yi Yang, Dit-yan Yeung, Alex G. Hauptmann

Taking key frames of videos as input, we first detect the event of interest at the video level by aggregating the CNN features of the key frames.

Action Recognition Event Detection +2

Learning From Massive Noisy Labeled Data for Image Classification

no code implementations CVPR 2015 Tong Xiao, Tian Xia, Yi Yang, Chang Huang, Xiaogang Wang

To demonstrate the effectiveness of our approach, we collect a large-scale real-world clothing classification dataset with both noisy and clean labels.

Classification General Classification +1

Indexing of CNN Features for Large Scale Image Search

no code implementations2 Aug 2015 Ruoyu Liu, Yao Zhao, Shikui Wei, Yi Yang

The convolutional neural network (CNN) features can give a good description of image content, which usually represent images with unique global vectors.

Clustering Image Retrieval +2

Insurance Premium Prediction via Gradient Tree-Boosted Tweedie Compound Poisson Models

no code implementations26 Aug 2015 Yi Yang, Wei Qian, Hui Zou

The Tweedie GLM is a widely used method for predicting insurance premiums.

Methodology

BLASX: A High Performance Level-3 BLAS Library for Heterogeneous Multi-GPU Computing

1 code implementation16 Oct 2015 Linnan Wang, Wei Wu, Jianxiong Xiao, Yi Yang

Basic Linear Algebra Subprograms (BLAS) are a set of low level linear algebra kernels widely adopted by applications involved with the deep learning and scientific computing.

Distributed, Parallel, and Cluster Computing

Attention to Scale: Scale-aware Semantic Image Segmentation

no code implementations CVPR 2016 Liang-Chieh Chen, Yi Yang, Jiang Wang, Wei Xu, Alan L. Yuille

We adapt a state-of-the-art semantic image segmentation model, which we jointly train with multi-scale input images and the attention model.

Image Segmentation Segmentation +1

Uncovering Temporal Context for Video Question and Answering

no code implementations15 Nov 2015 Linchao Zhu, Zhongwen Xu, Yi Yang, Alexander G. Hauptmann

In this work, we introduce Video Question Answering in temporal domain to infer the past, describe the present and predict the future.

Multiple-choice Question Answering +1

Overcoming Language Variation in Sentiment Analysis with Social Attention

1 code implementation TACL 2017 Yi Yang, Jacob Eisenstein

Variation in language is ubiquitous, particularly in newer forms of writing such as social media.

Sentiment Analysis

Look and Think Twice: Capturing Top-Down Visual Attention With Feedback Convolutional Neural Networks

no code implementations ICCV 2015 Chunshui Cao, Xian-Ming Liu, Yi Yang, Yinan Yu, Jiang Wang, Zilei Wang, Yongzhen Huang, Liang Wang, Chang Huang, Wei Xu, Deva Ramanan, Thomas S. Huang

While feedforward deep convolutional neural networks (CNNs) have been a great success in computer vision, it is important to remember that the human visual contex contains generally more feedback connections than foward connections.

Dynamic Concept Composition for Zero-Example Event Detection

no code implementations14 Jan 2016 Xiaojun Chang, Yi Yang, Guodong Long, Chengqi Zhang, Alexander G. Hauptmann

In this paper, we focus on automatically detecting events in unconstrained videos without the use of any visual training exemplars.

Event Detection Zero-Shot Learning

Part-of-Speech Tagging for Historical English

no code implementations NAACL 2016 Yi Yang, Jacob Eisenstein

We evaluate several domain adaptation methods on the task of tagging Early Modern English and Modern British English texts in the Penn Corpora of Historical English.

Part-Of-Speech Tagging Unsupervised Domain Adaptation +1

Accelerating Deep Neural Network Training with Inconsistent Stochastic Gradient Descent

no code implementations17 Mar 2016 Linnan Wang, Yi Yang, Martin Renqiang Min, Srimat Chakradhar

Then we present the study of ISGD batch size to the learning rate, parallelism, synchronization cost, system saturation and scalability.

Fully Convolutional Attention Networks for Fine-Grained Recognition

no code implementations22 Mar 2016 Xiao Liu, Tian Xia, Jiang Wang, Yi Yang, Feng Zhou, Yuanqing Lin

Fine-grained recognition is challenging due to its subtle local inter-class differences versus large intra-class variations such as poses.

reinforcement-learning Reinforcement Learning (RL)

Person Re-identification in the Wild

no code implementations CVPR 2017 Liang Zheng, Hengheng Zhang, Shaoyan Sun, Manmohan Chandraker, Yi Yang, Qi Tian

Our baselines address three issues: the performance of various combinations of detectors and recognizers, mechanisms for pedestrian detection to help improve overall re-identification accuracy and assessing the effectiveness of different detectors for re-identification.

Benchmarking Pedestrian Detection +2

CNN-RNN: A Unified Framework for Multi-label Image Classification

1 code implementation CVPR 2016 Jiang Wang, Yi Yang, Junhua Mao, Zhiheng Huang, Chang Huang, Wei Xu

While deep convolutional neural networks (CNNs) have shown a great success in single-label image classification, it is important to note that real world images generally contain multiple labels, which could correspond to different objects, scenes, actions and attributes in an image.

Classification General Classification +2

Long-Term Identity-Aware Multi-Person Tracking for Surveillance Video Summarization

no code implementations25 Apr 2016 Shoou-I Yu, Yi Yang, Xuanchong Li, Alexander G. Hauptmann

Therefore, our tracker propagates identity information to frames without recognized faces by uncovering the appearance and spatial manifold formed by person detections.

Face Recognition Video Summarization

You Lead, We Exceed: Labor-Free Video Concept Learning by Jointly Exploiting Web Videos and Images

no code implementations CVPR 2016 Chuang Gan, Ting Yao, Kuiyuan Yang, Yi Yang, Tao Mei

The Web images are then filtered by the learnt network and the selected images are additionally fed into the network to enhance the architecture and further trim the videos.

Action Recognition Event Detection +1

Strategies for Searching Video Content with Text Queries or Video Examples

no code implementations17 Jun 2016 Shoou-I Yu, Yi Yang, Zhongwen Xu, Shicheng Xu, Deyu Meng, Zexi Mao, Zhigang Ma, Ming Lin, Xuanchong Li, Huan Li, Zhenzhong Lan, Lu Jiang, Alexander G. Hauptmann, Chuang Gan, Xingzhong Du, Xiaojun Chang

The large number of user-generated videos uploaded on to the Internet everyday has led to many commercial video search engines, which mainly rely on text metadata for search.

Event Detection Retrieval +1

SIFT Meets CNN: A Decade Survey of Instance Retrieval

1 code implementation5 Aug 2016 Liang Zheng, Yi Yang, Qi Tian

This survey presents milestones in modern instance retrieval, reviews a broad selection of previous works in different categories, and provides insights on the connection between SIFT and CNN-based methods.

Content-Based Image Retrieval Retrieval

S-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking

no code implementations IJCNLP 2015 Yi Yang, Ming-Wei Chang

Non-linear models recently receive a lot of attention as people are starting to discover the power of statistical and embedding features.

Entity Linking

Person Re-identification: Past, Present and Future

no code implementations10 Oct 2016 Liang Zheng, Yi Yang, Alexander G. Hauptmann

Person re-identification (re-ID) has become increasingly popular in the community due to its application and research significance.

Image Classification Person Re-Identification +1

Optimizing Memory Efficiency for Deep Convolutional Neural Networks on GPUs

no code implementations12 Oct 2016 Chao Li, Yi Yang, Min Feng, Srimat Chakradhar, Huiyang Zhou

Leveraging large data sets, deep Convolutional Neural Networks (CNNs) achieve state-of-the-art recognition accuracy.

Computational Efficiency

A Discriminatively Learned CNN Embedding for Person Re-identification

4 code implementations17 Nov 2016 Zhedong Zheng, Liang Zheng, Yi Yang

We revisit two popular convolutional neural networks (CNN) in person re-identification (re-ID), i. e, verification and classification models.

General Classification Image Retrieval +2

Few-Shot Object Recognition from Machine-Labeled Web Images

no code implementations CVPR 2017 Zhongwen Xu, Linchao Zhu, Yi Yang

Then, we demonstrate that with our model, machine-labeled image annotations are very effective and abundant resources to perform object recognition on novel categories.

Few-Shot Learning Object +1

Personalized Video Recommendation Using Rich Contents from Videos

1 code implementation21 Dec 2016 Xingzhong Du, Hongzhi Yin, Ling Chen, Yang Wang, Yi Yang, Xiaofang Zhou

In the existing video recommender systems, the models make the recommendations based on the user-video interactions and single specific content features.

Recommendation Systems

Pose Invariant Embedding for Deep Person Re-identification

no code implementations26 Jan 2017 Liang Zheng, Yujia Huang, Huchuan Lu, Yi Yang

Second, to reduce the impact of pose estimation errors and information loss during PoseBox construction, we design a PoseBox fusion (PBF) CNN architecture that takes the original image, the PoseBox, and the pose estimation confidence as input.

Person Re-Identification Pose Estimation +1

A New Evaluation Protocol and Benchmarking Results for Extendable Cross-media Retrieval

no code implementations10 Mar 2017 Ruoyu Liu, Yao Zhao, Liang Zheng, Shikui Wei, Yi Yang

Additionally, a trivial solution, \ie, directly using the predicted class label for cross-media retrieval, is tested.

Benchmarking Image Retrieval +1

Twitter100k: A Real-world Dataset for Weakly Supervised Cross-Media Retrieval

no code implementations20 Mar 2017 Yuting Hu, Liang Zheng, Yi Yang, Yongfeng Huang

Second, texts in these datasets are written in well-organized language, leading to inconsistency with realistic applications.

Optical Character Recognition (OCR) Retrieval +1

An End-to-End Approach to Natural Language Object Retrieval via Context-Aware Deep Reinforcement Learning

no code implementations22 Mar 2017 Fan Wu, Zhongwen Xu, Yi Yang

We propose an end-to-end approach to the natural language object retrieval task, which localizes an object within an image according to a natural language description, i. e., referring expression.

Object Referring Expression +1

More is Less: A More Complicated Network with Less Inference Complexity

no code implementations CVPR 2017 Xuanyi Dong, Junshi Huang, Yi Yang, Shuicheng Yan

In this paper, we present a novel and general network structure towards accelerating the inference process of convolutional neural networks, which is more complicated in network structure yet with less inference complexity.

Dynamic Computational Time for Visual Attention

1 code implementation30 Mar 2017 Zhichao Li, Yi Yang, Xiao Liu, Feng Zhou, Shilei Wen, Wei Xu

We propose a dynamic computational time model to accelerate the average processing time for recurrent visual attention (RAM).

reinforcement-learning Reinforcement Learning (RL)

GeneGAN: Learning Object Transfiguration and Attribute Subspace from Unpaired Data

2 code implementations14 May 2017 Shuchang Zhou, Taihong Xiao, Yi Yang, Dieqiao Feng, Qinyao He, Weiran He

In this work, we propose a model that can learn object transfiguration from two unpaired sets of images: one set containing images that "have" that kind of object, and the other set being the opposite, with the mild constraint that the objects be located approximately at the same place.

Attribute Conditional Image Generation +1

Unsupervised Learning Layers for Video Analysis

no code implementations24 May 2017 Liang Zhao, Yang Wang, Yi Yang, Wei Xu

This paper presents two unsupervised learning layers (UL layers) for label-free video analysis: one for fully connected layers, and the other for convolutional ones.

Object Localization

Few-Example Object Detection with Model Communication

1 code implementation26 Jun 2017 Xuanyi Dong, Liang Zheng, Fan Ma, Yi Yang, Deyu Meng

Experiments on PASCAL VOC'07, MS COCO'14, and ILSVRC'13 indicate that by using as few as three or four samples selected for each category, our method produces very competitive results when compared to the state-of-the-art weakly-supervised approaches using a large number of image-level labels.

Object object-detection

UTS submission to Google YouTube-8M Challenge 2017

1 code implementation13 Jul 2017 Linchao Zhu, Yanbin Liu, Yi Yang

In this paper, we present our solution to Google YouTube-8M Video Classification Challenge 2017.

Classification General Classification +1

PatchShuffle Regularization

no code implementations22 Jul 2017 Guoliang Kang, Xuanyi Dong, Liang Zheng, Yi Yang

This paper focuses on regularizing the training of the convolutional neural network (CNN).

General Classification

Robust PCA by Manifold Optimization

no code implementations1 Aug 2017 Teng Zhang, Yi Yang

Robust PCA is a widely used statistical procedure to recover a underlying low-rank matrix with grossly corrupted observations.

Random Erasing Data Augmentation

18 code implementations16 Aug 2017 Zhun Zhong, Liang Zheng, Guoliang Kang, Shaozi Li, Yi Yang

In this paper, we introduce Random Erasing, a new data augmentation method for training the convolutional neural network (CNN).

General Classification Image Augmentation +4

An Improved Residual LSTM Architecture for Acoustic Modeling

no code implementations17 Aug 2017 Lu Huang, Jiasong Sun, Ji Xu, Yi Yang

Long Short-Term Memory (LSTM) is the primary recurrent neural networks architecture for acoustic modeling in automatic speech recognition systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

EraseReLU: A Simple Way to Ease the Training of Deep Convolution Neural Networks

no code implementations22 Sep 2017 Xuanyi Dong, Guoliang Kang, Kun Zhan, Yi Yang

For most state-of-the-art architectures, Rectified Linear Unit (ReLU) becomes a standard component accompanied with each layer.

Blocking Image Classification

Recursive Spatial Transformer (ReST) for Alignment-Free Face Recognition

no code implementations ICCV 2017 Wanglong Wu, Meina Kan, Xin Liu, Yi Yang, Shiguang Shan, Xilin Chen

The designed ReST has an intrinsic recursive structure and is capable of progressively aligning faces to a canonical one, even those with large variations.

Face Alignment Face Recognition

Learning Discriminative Latent Attributes for Zero-Shot Classification

no code implementations ICCV 2017 Huajie Jiang, Ruiping Wang, Shiguang Shan, Yi Yang, Xilin Chen

Zero-shot learning (ZSL) aims to transfer knowledge from observed classes to the unseen classes, based on the assumption that both the seen and unseen classes share a common semantic space, among which attributes enjoy a great popularity.

Attribute Classification +3

Complex Event Detection by Identifying Reliable Shots From Untrimmed Videos

no code implementations ICCV 2017 Hehe Fan, Xiaojun Chang, De Cheng, Yi Yang, Dong Xu, Alexander G. Hauptmann

relevant) to the given event class, we formulate this task as a multi-instance learning (MIL) problem by taking each video as a bag and the video shots in each video as instances.

Event Detection

Nanophotonic Particle Simulation and Inverse Design Using Artificial Neural Networks

1 code implementation18 Oct 2017 John Peurifoy, Yichen Shen, Li Jing, Yi Yang, Fidel Cano-Renteria, Brendan Delacy, Max Tegmark, John D. Joannopoulos, Marin Soljacic

We propose a method to use artificial neural networks to approximate light scattering by multilayer nanoparticles.

Computational Physics Applied Physics Optics

Occlusion Aware Unsupervised Learning of Optical Flow

no code implementations CVPR 2018 Yang Wang, Yi Yang, Zhenheng Yang, Liang Zhao, Peng Wang, Wei Xu

Especially on KITTI dataset where abundant unlabeled samples exist, our unsupervised method outperforms its counterpart trained with supervised learning.

Optical Flow Estimation

Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification

2 code implementations CVPR 2018 Weijian Deng, Liang Zheng, Qixiang Ye, Guoliang Kang, Yi Yang, Jianbin Jiao

To this end, we propose to preserve two types of unsupervised similarities, 1) self-similarity of an image before and after translation, and 2) domain-dissimilarity of a translated source image and a target image.

Generative Adversarial Network Person Re-Identification +2

Collective Entity Disambiguation with Structured Gradient Tree Boosting

1 code implementation NAACL 2018 Yi Yang, Ozan .Irsoy, Kazi Shefaet Rahman

To the best of our knowledge, our work is the first one that employs the structured gradient tree boosting (SGTB) algorithm for collective entity disambiguation.

Entity Disambiguation

Style Aggregated Network for Facial Landmark Detection

1 code implementation CVPR 2018 Xuanyi Dong, Yan Yan, Wanli Ouyang, Yi Yang

In this work, we propose a style-aggregated approach to deal with the large intrinsic variance of image styles for facial landmark detection.

Ranked #2 on Facial Landmark Detection on AFLW-Front (Mean NME metric)

Face Alignment Facial Landmark Detection

Decoupled Novel Object Captioner

1 code implementation11 Apr 2018 Yu Wu, Linchao Zhu, Lu Jiang, Yi Yang

Thus, the sequence model can be decoupled from the novel object descriptions.

Image Captioning Novel Concepts +2

Adversarial Complementary Learning for Weakly Supervised Object Localization

2 code implementations CVPR 2018 Xiaolin Zhang, Yunchao Wei, Jiashi Feng, Yi Yang, Thomas Huang

With such an adversarial learning, the two parallel-classifiers are forced to leverage complementary object regions for classification and can finally generate integral object localization together.

General Classification Object +1

Deploy Large-Scale Deep Neural Networks in Resource Constrained IoT Devices with Local Quantization Region

no code implementations24 May 2018 Yi Yang, Andy Chen, Xiaoming Chen, Jiang Ji, Zhenyang Chen, Yan Dai

Implementing large-scale deep neural networks with high computational complexity on low-cost IoT devices may inevitably be constrained by limited computation resource, making the devices hard to respond in real-time.

Quantization

Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks

6 code implementations21 Aug 2018 Yang He, Guoliang Kang, Xuanyi Dong, Yanwei Fu, Yi Yang

Therefore, the network trained by our method has a larger model capacity to learn from the training data.

Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks

2 code implementations22 Aug 2018 Yang He, Xuanyi Dong, Guoliang Kang, Yanwei Fu, Chenggang Yan, Yi Yang

With asymptotic pruning, the information of the training set would be gradually concentrated in the remaining filters, so the subsequent training and pruning process would be stable.

Image Classification

Convolutional Neural Networks with Recurrent Neural Filters

2 code implementations EMNLP 2018 Yi Yang

We introduce a class of convolutional neural networks (CNNs) that utilize recurrent neural networks (RNNs) as convolution filters.

Sentence Sentiment Analysis

A Unified Analysis of Stochastic Momentum Methods for Deep Learning

no code implementations30 Aug 2018 Yan Yan, Tianbao Yang, Zhe Li, Qihang Lin, Yi Yang

However, their theoretical analysis of convergence of the training objective and the generalization error for prediction is still under-explored.

RCAA: Relational Context-Aware Agents for Person Search

no code implementations ECCV 2018 Xiaojun Chang, Po-Yao Huang, Yi-Dong Shen, Xiaodan Liang, Yi Yang, Alexander G. Hauptmann

In this paper, we address this problem by training relational context-aware agents which learn the actions to localize the target person from the gallery of whole scene images.

Person Search

Compound Memory Networks for Few-shot Video Classification

no code implementations ECCV 2018 Linchao Zhu, Yi Yang

In this paper, we propose a new memory network structure for few-shot video classification by making the following contributions.

Classification General Classification +1

Generalizing A Person Retrieval Model Hetero- and Homogeneously

1 code implementation ECCV 2018 Zhun Zhong, Liang Zheng, Shaozi Li, Yi Yang

Person re-identification (re-ID) poses unique challenges for unsupervised domain adaptation (UDA) in that classes in the source and target sets (domains) are entirely different and that image variations are largely caused by cameras.

Person Re-Identification Person Retrieval +2

Query Attack via Opposite-Direction Feature:Towards Robust Image Retrieval

2 code implementations7 Sep 2018 Zhedong Zheng, Liang Zheng, Yi Yang, Fei Wu

Opposite-Direction Feature Attack (ODFA) effectively exploits feature-level adversarial gradients and takes advantage of feature distance in the representation space.

Adversarial Attack General Classification +3

DS-SLAM: A Semantic Visual SLAM towards Dynamic Environments

2 code implementations22 Sep 2018 Chao Yu, Zuxin Liu, Xinjun Liu, Fugui Xie, Yi Yang, Qi Wei, Qiao Fei

It is one of the state-of-the-art SLAM systems in high-dynamic environments.

Robotics

Every Node Counts: Self-Ensembling Graph Convolutional Networks for Semi-Supervised Learning

1 code implementation26 Sep 2018 Yawei Luo, Tao Guan, Junqing Yu, Ping Liu, Yi Yang

To capitalize on the information from unlabeled nodes to boost the training for GCN, we propose a novel framework named Self-Ensembling GCN (SEGCN), which marries GCN with Mean Teacher - another powerful model in semi-supervised learning.

General Classification Node Classification

Learning Discriminators as Energy Networks in Adversarial Learning

no code implementations ICLR 2019 Pingbo Pan, Yan Yan, Tianbao Yang, Yi Yang

In this work, we propose to refine the predictions of structured prediction models by effectively integrating discriminative models into the prediction.

Image Segmentation Multi-Label Classification +2

Joint Unsupervised Learning of Optical Flow and Depth by Watching Stereo Videos

1 code implementation8 Oct 2018 Yang Wang, Zhenheng Yang, Peng Wang, Yi Yang, Chenxu Luo, Wei Xu

Then the whole scene is decomposed into moving foreground and static background by compar- ing the estimated optical flow and rigid flow derived from the depth and ego-motion.

Motion Estimation Optical Flow Estimation

Improving Annotation for 3D Pose Dataset of Fine-Grained Object Categories

2 code implementations19 Oct 2018 Yaming Wang, Xiao Tan, Yi Yang, Ziyu Li, Xiao Liu, Feng Zhou, Larry S. Davis

Existing 3D pose datasets of object categories are limited to generic object types and lack of fine-grained information.

3D Pose Estimation Object +1

SG-One: Similarity Guidance Network for One-Shot Semantic Segmentation

1 code implementation22 Oct 2018 Xiaolin Zhang, Yunchao Wei, Yi Yang, Thomas Huang

In this way, the possibilities embedded in the produced similarity maps can be adapted to guide the process of segmenting objects.

Few-Shot Semantic Segmentation Segmentation +1

Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration

3 code implementations CVPR 2019 Yang He, Ping Liu, Ziwei Wang, Zhilan Hu, Yi Yang

In this paper, we analyze this norm-based criterion and point out that its effectiveness depends on two requirements that are not always met: (1) the norm deviation of the filters should be large; (2) the minimum norm of the filters should be small.

Image Classification

Zero-Shot Transfer VQA Dataset

no code implementations2 Nov 2018 Yuanpeng Li, Yi Yang, Jian-Yu Wang, Wei Xu

Therefore, toaccelerate this research, we propose a newzero-shot transfer VQA(ZST-VQA)dataset by reorganizing the existing VQA v1. 0 dataset in the way that duringtraining, some words appear only in one module (i. e. questions) but not in theother (i. e. answers).

Question Answering Transfer Learning +1

Deep Unfolded Robust PCA with Application to Clutter Suppression in Ultrasound

no code implementations20 Nov 2018 Oren Solomon, Regev Cohen, Yi Zhang, Yi Yang, He Qiong, Jianwen Luo, Ruud J. G. van Sloun, Yonina C. Eldar

We compare the performance of the suggested deep network on both simulations and in-vivo rat brain scans, with a commonly practiced deep-network architecture and the fast iterative shrinkage algorithm, and show that our architecture exhibits better image quality and contrast.

Super-Resolution

Similarity-preserving Image-image Domain Adaptation for Person Re-identification

no code implementations26 Nov 2018 Weijian Deng, Liang Zheng, Qixiang Ye, Yi Yang, Jianbin Jiao

It first preserves two types of unsupervised similarity, namely, self-similarity of an image before and after translation, and domain-dissimilarity of a translated source image and a target image.

Domain Adaptation Generative Adversarial Network +2

Contrastive Adaptation Network for Unsupervised Domain Adaptation

2 code implementations CVPR 2019 Guoliang Kang, Lu Jiang, Yi Yang, Alexander G. Hauptmann

Unsupervised Domain Adaptation (UDA) makes predictions for the target domain data while manual annotations are only available in the source domain.

Unsupervised Domain Adaptation

Significance-aware Information Bottleneck for Domain Adaptive Semantic Segmentation

no code implementations ICCV 2019 Yawei Luo, Ping Liu, Tao Guan, Junqing Yu, Yi Yang

For unsupervised domain adaptation problems, the strategy of aligning the two domains in latent feature space through adversarial learning has achieved much progress in image classification, but usually fails in semantic segmentation tasks in which the latent representations are overcomplex.

Image Classification Segmentation +2

Operation-aware Neural Networks for User Response Prediction

4 code implementations2 Apr 2019 Yi Yang, Baile Xu, Furao Shen, Jian Zhao

Many deep models are proposed to automatically learn high-order feature interactions.

DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis

4 code implementations CVPR 2019 Minfeng Zhu, Pingbo Pan, Wei Chen, Yi Yang

If the initial image is not well initialized, the following processes can hardly refine the image to a satisfactory quality.

Ranked #6 on Text-to-Image Generation on CUB (Inception score metric)

Generative Adversarial Network Text-to-Image Generation

Filter Pruning by Switching to Neighboring CNNs with Good Attributes

no code implementations8 Apr 2019 Yang He, Ping Liu, Linchao Zhu, Yi Yang

In addition, when evaluating the filter importance, only the magnitude information of the filters is considered.

Attribute Image Classification

Revisiting EmbodiedQA: A Simple Baseline and Beyond

no code implementations8 Apr 2019 Yu Wu, Lu Jiang, Yi Yang

In this paper, we empirically study this problem and introduce 1) a simple yet effective baseline that achieves promising performance; 2) an easier and practical setting for EmbodiedQA where an agent has a chance to adapt the trained model to a new environment before it actually answers users questions.

Embodied Question Answering Question Answering

Sim-Real Joint Reinforcement Transfer for 3D Indoor Navigation

no code implementations CVPR 2019 Fengda Zhu, Linchao Zhu, Yi Yang

Specifically, our method employs an adversarial feature adaptation model for visual representation transfer and a policy mimic strategy for policy behavior imitation.

Cubic LSTMs for Video Prediction

no code implementations20 Apr 2019 Hehe Fan, Linchao Zhu, Yi Yang

Predicting future frames in videos has become a promising direction of research for both computer vision and robot learning communities.

motion prediction Video Prediction

Network Pruning via Transformable Architecture Search

4 code implementations NeurIPS 2019 Xuanyi Dong, Yi Yang

The maximum probability for the size in each distribution serves as the width and depth of the pruned network, whose parameters are learned by knowledge transfer, e. g., knowledge distillation, from the original networks.

Knowledge Distillation Network Pruning +2

Syntax-Infused Variational Autoencoder for Text Generation

no code implementations ACL 2019 Xinyuan Zhang, Yi Yang, Siyang Yuan, Dinghan Shen, Lawrence Carin

We present a syntax-infused variational autoencoder (SIVAE), that integrates sentences with their syntactic trees to improve the grammar of generated sentences.

Sentence Text Generation

Query-efficient Meta Attack to Deep Neural Networks

1 code implementation ICLR 2020 Jiawei Du, Hu Zhang, Joey Tianyi Zhou, Yi Yang, Jiashi Feng

Black-box attack methods aim to infer suitable attack patterns to targeted DNN models by only using output feedback of the models and the corresponding input queries.

Adversarial Attack Meta-Learning

FASTER Recurrent Networks for Efficient Video Classification

no code implementations10 Jun 2019 Linchao Zhu, Laura Sevilla-Lara, Du Tran, Matt Feiszli, Yi Yang, Heng Wang

FASTER aims to leverage the redundancy between neighboring clips and reduce the computational cost by learning to aggregate the predictions from models of different complexities.

Action Classification Action Recognition +3

Baidu-UTS Submission to the EPIC-Kitchens Action Recognition Challenge 2019

no code implementations22 Jun 2019 Xiaohan Wang, Yu Wu, Linchao Zhu, Yi Yang

In this report, we present the Baidu-UTS submission to the EPIC-Kitchens Action Recognition Challenge in CVPR 2019.

Action Recognition Object +2

What You Say and How You Say It Matters: Predicting Stock Volatility Using Verbal and Vocal Cues

1 code implementation ACL 2019 Yu Qin, Yi Yang

Prior research has shown that textual information in a firm{'}s financial statement can be used to predict its stock{'}s risk level.

Adaptive Exploration for Unsupervised Person Re-Identification

1 code implementation9 Jul 2019 Yuhang Ding, Hehe Fan, Mingliang Xu, Yi Yang

However, a problem of the adaptive selection is that, when an image has too many neighborhoods, it is more likely to attract other images as its neighborhoods.

Unsupervised Person Re-Identification

Learning to Adapt Invariance in Memory for Person Re-identification

no code implementations1 Aug 2019 Zhun Zhong, Liang Zheng, Zhiming Luo, Shaozi Li, Yi Yang

This work considers the problem of unsupervised domain adaptation in person re-identification (re-ID), which aims to transfer knowledge from the source domain to the target domain.

Person Re-Identification Unsupervised Domain Adaptation

Cascaded Revision Network for Novel Object Captioning

1 code implementation6 Aug 2019 Qianyu Feng, Yu Wu, Hehe Fan, Chenggang Yan, Yi Yang

By this novel cascaded captioning-revising mechanism, CRN can accurately describe images with unseen objects.

Image Captioning Object +3

Attract or Distract: Exploit the Margin of Open Set

1 code implementation ICCV 2019 Qianyu Feng, Guoliang Kang, Hehe Fan, Yi Yang

In this paper, we exploit the semantic structure of open set data from two aspects: 1) Semantic Categorical Alignment, which aims to achieve good separability of target known classes by categorically aligning the centroid of target with the source.

Domain Adaptation

An Algorithm for Graph-Fused Lasso Based on Graph Decomposition

1 code implementation6 Aug 2019 Feng Yu, Yi Yang, Teng Zhang

In comparison, this work proposes to decompose the objective function into two components, where one component is the loss function plus part of the total variation penalty, and the other component is the remaining total variation penalty.

Optimization and Control Computation

Teacher Supervises Students How to Learn From Partially Labeled Images for Facial Landmark Detection

2 code implementations ICCV 2019 Xuanyi Dong, Yi Yang

A typical approach is to (1) train a detector on the labeled images; (2) generate new training samples using this detector's prediction as pseudo labels of unlabeled images; (3) retrain the detector on the labeled samples and partial pseudo labeled samples.

 Ranked #1 on Facial Landmark Detection on 300W (Full) (using extra training data)

Facial Landmark Detection

Recognizing Part Attributes with Insufficient Data

1 code implementation ICCV 2019 Xiangyun Zhao, Yi Yang, Feng Zhou, Xiao Tan, Yuchen Yuan, Yingze Bao, Ying Wu

Although great progress has been made to apply object-level recognition, recognizing the attributes of parts remains less applicable since the training data for part attributes recognition is usually scarce especially for internet-scale applications.

Attribute

Learning to Transfer Learn: Reinforcement Learning-Based Selection for Adaptive Transfer Learning

no code implementations ECCV 2020 Linchao Zhu, Sercan O. Arik, Yi Yang, Tomas Pfister

We propose a novel adaptive transfer learning framework, learning to transfer learn (L2TL), to improve performance on a target dataset by careful extraction of the related information from a source dataset.

reinforcement-learning Reinforcement Learning (RL) +1

Dialog Intent Induction with Deep Multi-View Clustering

1 code implementation IJCNLP 2019 Hugh Perkins, Yi Yang

We introduce the dialog intent induction task and present a novel deep multi-view clustering approach to tackle the problem.

Clustering Representation Learning

LSMI-Sinkhorn: Semi-supervised Mutual Information Estimation with Optimal Transport

1 code implementation5 Sep 2019 Yanbin Liu, Makoto Yamada, Yao-Hung Hubert Tsai, Tam Le, Ruslan Salakhutdinov, Yi Yang

To estimate the mutual information from data, a common practice is preparing a set of paired samples $\{(\mathbf{x}_i,\mathbf{y}_i)\}_{i=1}^n \stackrel{\mathrm{i. i. d.

BIG-bench Machine Learning Mutual Information Estimation

Multi-scale discriminative Region Discovery for Weakly-Supervised Object Localization

no code implementations24 Sep 2019 Pei Lv, Haiyu Yu, Junxiao Xue, Junjin Cheng, Lisha Cui, Bing Zhou, Mingliang Xu, Yi Yang

On ILSVRC 2016, the proposed method yields the Top-1 localization error of 48. 65\%, which outperforms previous results by 2. 75\%.

Weakly-Supervised Object Localization

Gated Channel Transformation for Visual Recognition

3 code implementations CVPR 2020 Zongxin Yang, Linchao Zhu, Yu Wu, Yi Yang

This lightweight layer incorporates a simple l2 normalization, enabling our transformation unit applicable to operator-level without much increase of additional parameters.

General Classification Image Classification +5

Searching for A Robust Neural Architecture in Four GPU Hours

6 code implementations CVPR 2019 Xuanyi Dong, Yi Yang

To avoid traversing all the possibilities of the sub-graphs, we develop a differentiable sampler over the DAG.

Neural Architecture Search

One-Shot Neural Architecture Search via Self-Evaluated Template Network

4 code implementations ICCV 2019 Xuanyi Dong, Yi Yang

In this paper, we propose a Self-Evaluated Template Network (SETN) to improve the quality of the architecture candidates for evaluation so that it is more likely to cover competitive candidates.

Neural Architecture Search

PointRNN: Point Recurrent Neural Network for Moving Point Cloud Processing

2 code implementations18 Oct 2019 Hehe Fan, Yi Yang

We apply PointRNN, PointGRU and PointLSTM to moving point cloud prediction, which aims to predict the future trajectories of points in a set given their history movements.

Moving Point Cloud Processing

Self-Correction for Human Parsing

2 code implementations22 Oct 2019 Peike Li, Yunqiu Xu, Yunchao Wei, Yi Yang

To tackle the problem of learning with label noises, this work introduces a purification strategy, called Self-Correction for Human Parsing (SCHP), to progressively promote the reliability of the supervised labels as well as the learned models.

Human Parsing Human Part Segmentation +1

Instance-Invariant Domain Adaptive Object Detection via Progressive Disentanglement

no code implementations20 Nov 2019 Aming Wu, Yahong Han, Linchao Zhu, Yi Yang

Most state-of-the-art methods of object detection suffer from poor generalization ability when the training and test data are from different domains, e. g., with different styles.

Disentanglement Object +2

Connective Cognition Network for Directional Visual Commonsense Reasoning

1 code implementation NeurIPS 2019 Aming Wu, Linchao Zhu, Yahong Han, Yi Yang

Inspired by this idea, towards VCR, we propose a connective cognition network (CCN) to dynamically reorganize the visual neuron connectivity that is contextualized by the meaning of questions and answers.

Sentence Visual Commonsense Reasoning

DerainCycleGAN: Rain Attentive CycleGAN for Single Image Deraining and Rainmaking

1 code implementation15 Dec 2019 Yanyan Wei, Zhao Zhang, Yang Wang, Mingliang Xu, Yi Yang, Shuicheng Yan, Meng Wang

However, in practice it is rather common to have no un-paired images in real deraining task, in such cases how to remove the rain streaks in an unsupervised way will be a very challenging task due to lack of constraints between images and hence suffering from low-quality recovery results.

Single Image Deraining

Unsupervised Scene Adaptation with Memory Regularization in vivo

2 code implementations24 Dec 2019 Zhedong Zheng, Yi Yang

We consider the unsupervised scene adaptation problem of learning from both labeled source data and unlabeled target data.

Semantic Segmentation Synthetic-to-Real Translation +1

Very Long Natural Scenery Image Prediction by Outpainting

1 code implementation ICCV 2019 Zongxin Yang, Jian Dong, Ping Liu, Yi Yang, Shuicheng Yan

The second challenge is how to maintain high quality in generated results, especially for multi-step generations in which generated regions are spatially far away from the initial input.

Image Inpainting Image Outpainting

Progressive Local Filter Pruning for Image Retrieval Acceleration

no code implementations24 Jan 2020 Xiaodong Wang, Zhedong Zheng, Yang He, Fei Yan, Zhiqiang Zeng, Yi Yang

To verify this, we evaluate our method on two widely-used image retrieval datasets, i. e., Oxford5k and Paris6K, and one person re-identification dataset, i. e., Market-1501.

Image Retrieval Network Pruning +2

Lane Detection in Low-light Conditions Using an Efficient Data Enhancement : Light Conditions Style Transfer

1 code implementation4 Feb 2020 Tong Liu, Zhaowei Chen, Yi Yang, Zehao Wu, Haowei Li

Nowadays, deep learning techniques are widely used for lane detection, but application in low-light conditions remains a challenge until this day.

Lane Detection Multi-Task Learning +1

Symbiotic Attention with Privileged Information for Egocentric Action Recognition

no code implementations8 Feb 2020 Xiaohan Wang, Yu Wu, Linchao Zhu, Yi Yang

Due to the large action vocabulary in egocentric video datasets, recent studies usually utilize a two-branch structure for action recognition, ie, one branch for verb classification and the other branch for noun classification.

Action Recognition Egocentric Activity Recognition +5

Dynamic Inference: A New Approach Toward Efficient Video Action Recognition

no code implementations9 Feb 2020 Wenhao Wu, Dongliang He, Xiao Tan, Shifeng Chen, Yi Yang, Shilei Wen

In a nutshell, we treat input frames and network depth of the computational graph as a 2-dimensional grid, and several checkpoints are placed on this grid in advance with a prediction module.

Action Recognition In Videos Temporal Action Localization

University-1652: A Multi-view Multi-source Benchmark for Drone-based Geo-localization

3 code implementations27 Feb 2020 Zhedong Zheng, Yunchao Wei, Yi Yang

To our knowledge, University-1652 is the first drone-based geo-localization dataset and enables two new tasks, i. e., drone-view target localization and drone navigation.

Drone navigation Drone-view target localization +2

Grounded and Controllable Image Completion by Incorporating Lexical Semantics

no code implementations29 Feb 2020 Shengyu Zhang, Tan Jiang, Qinghao Huang, Ziqi Tan, Zhou Zhao, Siliang Tang, Jin Yu, Hongxia Yang, Yi Yang, Fei Wu

Existing image completion procedure is highly subjective by considering only visual context, which may trigger unpredictable results which are plausible but not faithful to a grounded knowledge.

Angle-Based Cost-Sensitive Multicategory Classification

no code implementations8 Mar 2020 Yi Yang, Yuxuan Guo, Xiangyu Chang

To show the usefulness of the framework, two cost-sensitive multicategory boosting algorithms are derived as concrete instances.

Classification General Classification

Rectifying Pseudo Label Learning via Uncertainty Estimation for Domain Adaptive Semantic Segmentation

3 code implementations8 Mar 2020 Zhedong Zheng, Yi Yang

This paper focuses on the unsupervised domain adaptation of transferring the knowledge from the source domain to the target domain in the context of semantic segmentation.

Pseudo Label Segmentation +3

Motion-Excited Sampler: Video Adversarial Attack with Sparked Prior

1 code implementation ECCV 2020 Hu Zhang, Linchao Zhu, Yi Zhu, Yi Yang

Most of previous work on adversarial attack mainly focus on image models, while the vulnerability of video models is less explored.

Adversarial Attack Video Classification

Memory Aggregation Networks for Efficient Interactive Video Object Segmentation

no code implementations CVPR 2020 Jiaxu Miao, Yunchao Wei, Yi Yang

Interactive video object segmentation (iVOS) aims at efficiently harvesting high-quality segmentation masks of the target object in a video with user interactions.

Interactive Video Object Segmentation Object +2

VehicleNet: Learning Robust Visual Representation for Vehicle Re-identification

3 code implementations14 Apr 2020 Zhedong Zheng, Tao Ruan, Yunchao Wei, Yi Yang, Tao Mei

This stage relaxes the full alignment between the training and testing domains, as it is agnostic to the target vehicle domain.

Representation Learning Vehicle Re-Identification

Omni-supervised Facial Expression Recognition via Distilled Data

no code implementations18 May 2020 Ping Liu, Yunchao Wei, Zibo Meng, Weihong Deng, Joey Tianyi Zhou, Yi Yang

However, the performance of the current state-of-the-art facial expression recognition (FER) approaches is directly related to the labeled data for training.

Facial Expression Recognition Facial Expression Recognition (FER)

Feature Robust Optimal Transport for High-dimensional Data

1 code implementation25 May 2020 Mathis Petrovich, Chao Liang, Ryoma Sato, Yanbin Liu, Yao-Hung Hubert Tsai, Linchao Zhu, Yi Yang, Ruslan Salakhutdinov, Makoto Yamada

To show the effectiveness of FROT, we propose using the FROT algorithm for the layer selection problem in deep neural networks for semantic correspondence.

feature selection Semantic correspondence +1

FinBERT: A Pretrained Language Model for Financial Communications

1 code implementation15 Jun 2020 Yi Yang, Mark Christopher Siy UY, Allen Huang

Contextual pretrained language models, such as BERT (Devlin et al., 2019), have made significant breakthrough in various NLP tasks by training on large scale of unlabeled text re-sources. Financial sector also accumulates large amount of financial communication text. However, there is no pretrained finance specific language models available.

Language Modelling Sentiment Analysis +1

Sketch-Guided Scenery Image Outpainting

no code implementations17 Jun 2020 Yaxiong Wang, Yunchao Wei, Xueming Qian, Li Zhu, Yi Yang

In this work, we take the image outpainting one step forward by allowing users to harvest personal custom outpainting results using sketches as the guidance.

Image Outpainting

Interpretable Operational Risk Classification with Semi-Supervised Variational Autoencoder

no code implementations ACL 2020 Fan Zhou, Shengming Zhang, Yi Yang

To tackle these challenges, we present a semi-supervised text classification framework that integrates multi-head attention mechanism with Semi-supervised variational inference for Operational Risk Classification (SemiORC).

General Classification Management +3

Interpreting Twitter User Geolocation

no code implementations ACL 2020 Ting Zhong, Tianliang Wang, Fan Zhou, Goce Trajcevski, Kunpeng Zhang, Yi Yang

Identifying user geolocation in online social networks is an essential task in many location-based applications.

Single Image Brightening via Multi-Scale Exposure Fusion with Hybrid Learning

no code implementations4 Jul 2020 Chaobing Zheng, Zhengguo Li, Yi Yang, Shiqian Wu

In this paper, a single image brightening algorithm is introduced to brighten such an image.

SSIM

A Survey on Concept Factorization: From Shallow to Deep Representation Learning

no code implementations31 Jul 2020 Zhao Zhang, Yan Zhang, Mingliang Xu, Li Zhang, Yi Yang, Shuicheng Yan

In this paper, we therefore survey the recent advances on CF methodologies and the potential benchmarks by categorizing and summarizing the current methods.

Clustering Representation Learning

Inter-Image Communication for Weakly Supervised Localization

1 code implementation ECCV 2020 Xiaolin Zhang, Yunchao Wei, Yi Yang

We learn a feature center for each category and realize the global feature consistency by forcing the object features to approach class-specific centers.

Object

Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents

1 code implementation ECCV 2020 Ye Zhu, Yu Wu, Yi Yang, Yan Yan

With the arising concerns for the AI systems provided with direct access to abundant sensitive information, researchers seek to develop more reliable AI with implicit information sources.

Video Description

DONet: Dual Objective Networks for Skin Lesion Segmentation

no code implementations19 Aug 2020 Yaxiong Wang, Yunchao Wei, Xueming Qian, Li Zhu, Yi Yang

Skin lesion segmentation is a crucial step in the computer-aided diagnosis of dermoscopic images.

Lesion Segmentation Segmentation +2

Memory-based Jitter: Improving Visual Recognition on Long-tailed Data with Diversity In Memory

no code implementations22 Aug 2020 Jialun Liu, Jingwei Zhang, Yi Yang, Wenhui Li, Chi Zhang, Yifan Sun

With slight modifications, MBJ is applicable for two fundamental visual recognition tasks, \emph{i. e.}, deep image classification and deep metric learning (on long-tailed data).

Data Augmentation General Classification +4

Point Adversarial Self Mining: A Simple Method for Facial Expression Recognition

no code implementations26 Aug 2020 Ping Liu, Yuewei Lin, Zibo Meng, Lu Lu, Weihong Deng, Joey Tianyi Zhou, Yi Yang

In this paper, we propose a simple yet effective approach, named Point Adversarial Self Mining (PASM), to improve the recognition accuracy in facial expression recognition.

Adversarial Attack Data Augmentation +4

Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization

1 code implementation26 Aug 2020 Tingyu Wang, Zhedong Zheng, Chenggang Yan, Jiyong Zhang, Yaoqi Sun, Bolun Zheng, Yi Yang

Existing methods usually concentrate on mining the fine-grained feature of the geographic target in the image center, but underestimate the contextual information in neighbor areas.

Drone navigation Drone-view target localization +2

Tasks Integrated Networks: Joint Detection and Retrieval for Image Search

no code implementations3 Sep 2020 Lei Zhang, Zhenwei He, Yi Yang, Liang Wang, Xinbo Gao

The traditional object retrieval task aims to learn a discriminative feature representation with intra-similarity and inter-dissimilarity, which supposes that the objects in an image are manually or automatically pre-cropped exactly.

Image Retrieval Philosophy +1

Cannot find the paper you are looking for? You can Submit a new open access paper.