Search Results for author: Bai-Ling Zhang

Found 5 papers, 0 papers with code

Attentive Prototype Few-shot Learning with Capsule Network-based Embedding

no code implementations • ECCV 2020 • Fang-Yu Wu, Jeremy S. Smith, Wenjin Lu, Chaoyi Pang, Bai-Ling Zhang

Few-shot learning, namely recognizing novel categories with a very small amount of training examples, is a challenging area of machine learning research.

Classification Few-Shot Learning +1

Paper
Add Code

Image Captioning Based on a Hierarchical Attention Mechanism and Policy Gradient Optimization

no code implementations • 13 Nov 2018 • Shi-Yang Yan, Yuan Xie, Fang-Yu Wu, Jeremy S. Smith, Wenjin Lu, Bai-Ling Zhang

Automatically generating the descriptions of an image, i. e., image captioning, is an important and fundamental topic in artificial intelligence, which bridges the gap between computer vision and natural language processing.

Generative Adversarial Network Image Captioning +1

Paper
Add Code

Hierarchical Multi-scale Attention Networks for Action Recognition

no code implementations • 25 Aug 2017 • Shi-Yang Yan, Jeremy S. Smith, Wenjin Lu, Bai-Ling Zhang

Through visualization of what have been learnt by the networks, it can be observed that both the attention regions of images and the hierarchical temporal structure can be captured by HM-AN.

Action Recognition Hard Attention +1

Paper
Add Code

Traffic scene recognition based on deep cnn and vlad spatial pyramids

no code implementations • 24 Jul 2017 • Fang-Yu Wu, Shi-Yang Yan, Jeremy S. Smith, Bai-Ling Zhang

In this paper, we attempted to solve the traffic scene recognition problem by combining the features representational capabilities of CNN with the VLAD encoding scheme.

Region Proposal Scene Classification +1

Paper
Add Code

CHAM: action recognition using convolutional hierarchical attention model

no code implementations • 9 May 2017 • Shi-Yang Yan, Jeremy S. Smith, Wenjin Lu, Bai-Ling Zhang

This paper presents improvements to the soft attention model by combining a convolutional LSTM with a hierarchical system architecture to recognize action categories in videos.

Action Recognition Image Captioning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.