Search Results for author: Hongzhi Li

Found 6 papers, 1 papers with code

Multi-modal Deep Analysis for Multimedia

no code implementations11 Oct 2019 Wenwu Zhu, Xin Wang, Hongzhi Li

To address the two scientific problems, we investigate them from the following aspects: 1) multi-modal correlational representation: multi-modal fusion of data across different modalities, and 2) multi-modal data and knowledge fusion: multi-modal fusion of data with domain knowledge.

Question Answering Transfer Learning +2

Rethinking Classification and Localization for Object Detection

2 code implementations CVPR 2020 Yue Wu, Yinpeng Chen, Lu Yuan, Zicheng Liu, Lijuan Wang, Hongzhi Li, Yun Fu

Two head structures (i. e. fully connected head and convolution head) have been widely used in R-CNN based detectors for classification and localization tasks.

Classification General Classification +3

PatternNet: Visual Pattern Mining with Deep Neural Network

no code implementations18 Mar 2017 Hongzhi Li, Joseph G. Ellis, Lei Zhang, Shih-Fu Chang

In this paper, we study the problem of visual pattern mining and propose a novel deep neural network architecture called PatternNet for discovering these patterns that are both discriminative and representative.

Image Classification

Event Specific Multimodal Pattern Mining with Image-Caption Pairs

no code implementations31 Dec 2015 Hongzhi Li, Joseph G. Ellis, Shih-Fu Chang

In this paper we describe a novel framework and algorithms for discovering image patch patterns from a large corpus of weakly supervised image-caption pairs generated from news events.

Descriptive Image Captioning

Cannot find the paper you are looking for? You can Submit a new open access paper.