Search Results for author: Haoyuan Peng

Found 3 papers, 0 papers with code

VKIE: The Application of Key Information Extraction on Video Text

no code implementations18 Oct 2023 Siyu An, Ye Liu, Haoyuan Peng, Di Yin

Extracting structured information from videos is critical for numerous downstream applications in the industry.

Key Information Extraction

OSAN: A One-Stage Alignment Network To Unify Multimodal Alignment and Unsupervised Domain Adaptation

no code implementations CVPR 2023 Ye Liu, Lingfeng Qiao, Changchong Lu, Di Yin, Chen Lin, Haoyuan Peng, Bo Ren

An intuitive way to handle these two problems is to fulfill these tasks in two separate stages: aligning modalities followed by domain adaptation, or vice versa.

Unsupervised Domain Adaptation

Grafting Pre-trained Models for Multimodal Headline Generation

no code implementations14 Nov 2022 Lingfeng Qiao, Chen Wu, Ye Liu, Haoyuan Peng, Di Yin, Bo Ren

In this paper, we propose a novel approach to graft the video encoder from the pre-trained video-language model on the generative pre-trained language model.

Headline Generation Language Modelling +1

Cannot find the paper you are looking for? You can Submit a new open access paper.