Search Results for author: Yuechen Wang

Found 10 papers, 3 papers with code

Progressive Multi-modal Conditional Prompt Tuning

no code implementations18 Apr 2024 Xiaoyu Qiu, Hao Feng, Yuechen Wang, Wengang Zhou, Houqiang Li

Initialization is responsible for encoding image and text using a VLM, followed by a feature filter that selects text features similar to image.

Cross-Lingual Transfer for Natural Language Inference via Multilingual Prompt Translator

no code implementations19 Mar 2024 Xiaoyu Qiu, Yuechen Wang, Jiaxin Shi, Wengang Zhou, Houqiang Li

To efficiently transfer soft prompt, we propose a novel framework, Multilingual Prompt Translator (MPT), where a multilingual prompt translator is introduced to properly process crucial knowledge embedded in prompt by changing language knowledge while retaining task knowledge.

Cross-Lingual Transfer Natural Language Inference

Text-Only Training for Visual Storytelling

no code implementations17 Aug 2023 Yuechen Wang, Wengang Zhou, Zhenbo Lu, Houqiang Li

Visual storytelling aims to generate a narrative based on a sequence of images, necessitating both vision-language alignment and coherent story generation.

Informativeness Visual Storytelling

Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language Grounding

no code implementations Findings (EMNLP) 2021 Yuechen Wang, Wengang Zhou, Houqiang Li

In this work, we propose a novel candidate-free framework: Fine-grained Semantic Alignment Network (FSAN), for weakly supervised TLG.

Sentence

Geometric Representation Learning for Document Image Rectification

2 code implementations15 Oct 2022 Hao Feng, Wengang Zhou, Jiajun Deng, Yuechen Wang, Houqiang Li

In document image rectification, there exist rich geometric constraints between the distorted image and the ground truth one.

Representation Learning

DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction

2 code implementations25 Oct 2021 Hao Feng, Yuechen Wang, Wengang Zhou, Jiajun Deng, Houqiang Li

Specifically, DocTr consists of a geometric unwarping transformer and an illumination correction transformer.

Optical Character Recognition (OCR)

SignBERT: Pre-Training of Hand-Model-Aware Representation for Sign Language Recognition

no code implementations ICCV 2021 Hezhen Hu, Weichao Zhao, Wengang Zhou, Yuechen Wang, Houqiang Li

To validate the effectiveness of our method on SLR, we perform extensive experiments on four public benchmark datasets, i. e., NMFs-CSL, SLR500, MSASL and WLASL.

 Ranked #1 on Sign Language Recognition on WLASL100 (using extra training data)

Self-Supervised Learning Sign Language Recognition

Weakly Supervised Temporal Adjacent Network for Language Grounding

1 code implementation30 Jun 2021 Yuechen Wang, Jiajun Deng, Wengang Zhou, Houqiang Li

To this end, we introduce a novel weakly supervised temporal adjacent network (WSTAN) for temporal language grounding.

Multiple Instance Learning Sentence

Pre-training Text Representations as Meta Learning

no code implementations12 Apr 2020 Shangwen Lv, Yuechen Wang, Daya Guo, Duyu Tang, Nan Duan, Fuqing Zhu, Ming Gong, Linjun Shou, Ryan Ma, Daxin Jiang, Guihong Cao, Ming Zhou, Songlin Hu

In this work, we introduce a learning algorithm which directly optimizes model's ability to learn text representations for effective learning of downstream tasks.

Language Modelling Meta-Learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.