Search Results for author: Wangli Hao

Found 2 papers, 0 papers with code

Integrating both Visual and Audio Cues for Enhanced Video Caption

no code implementations22 Nov 2017 Wangli Hao, Zhao-Xiang Zhang, He Guan, Guibo Zhu

Furthermore, we first propose a dynamic multimodal feature fusion framework to deal with the part modalities missing case.

Descriptive Sentence +1

CMCGAN: A Uniform Framework for Cross-Modal Visual-Audio Mutual Generation

no code implementations22 Nov 2017 Wangli Hao, Zhao-Xiang Zhang, He Guan

By recovering the missing modality from the existing one based on the common information shared between them and the prior information of the specific modality, great bonus will be gained for various vision tasks.

Audio Generation Generative Adversarial Network

Cannot find the paper you are looking for? You can Submit a new open access paper.