Search Results for author: Haoyuan Peng

Found 3 papers, 0 papers with code

VKIE: The Application of Key Information Extraction on Video Text

no code implementations • 18 Oct 2023 • Siyu An, Ye Liu, Haoyuan Peng, Di Yin

Extracting structured information from videos is critical for numerous downstream applications in the industry.

Paper
Add Code

OSAN: A One-Stage Alignment Network To Unify Multimodal Alignment and Unsupervised Domain Adaptation

no code implementations • CVPR 2023 • Ye Liu, Lingfeng Qiao, Changchong Lu, Di Yin, Chen Lin, Haoyuan Peng, Bo Ren

An intuitive way to handle these two problems is to fulfill these tasks in two separate stages: aligning modalities followed by domain adaptation, or vice versa.

Unsupervised Domain Adaptation

Paper
Add Code

Grafting Pre-trained Models for Multimodal Headline Generation

no code implementations • 14 Nov 2022 • Lingfeng Qiao, Chen Wu, Ye Liu, Haoyuan Peng, Di Yin, Bo Ren

In this paper, we propose a novel approach to graft the video encoder from the pre-trained video-language model on the generative pre-trained language model.

Headline Generation Language Modelling +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.