Search Results for author: Danyang Hou

Found 5 papers, 1 papers with code

WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training

2 code implementations • 11 Mar 2021 • Yuqi Huo, Manli Zhang, Guangzhen Liu, Haoyu Lu, Yizhao Gao, Guoxing Yang, Jingyuan Wen, Heng Zhang, Baogui Xu, Weihao Zheng, Zongzheng Xi, Yueqian Yang, Anwen Hu, Jinming Zhao, Ruichen Li, Yida Zhao, Liang Zhang, Yuqing Song, Xin Hong, Wanqing Cui, Danyang Hou, Yingyan Li, Junyi Li, Peiyu Liu, Zheng Gong, Chuhao Jin, Yuchong Sun, ShiZhe Chen, Zhiwu Lu, Zhicheng Dou, Qin Jin, Yanyan Lan, Wayne Xin Zhao, Ruihua Song, Ji-Rong Wen

We further construct a large Chinese multi-source image-text dataset called RUC-CAS-WenLan for pre-training our BriVL model.

Ranked #1 on Image Retrieval on RUC-CAS-WenLan

Contrastive Learning Image Captioning +2

273

Paper
Code

Multi-video Moment Ranking with Multimodal Clue

no code implementations • 29 Jan 2023 • Danyang Hou, Liang Pang, Yanyan Lan, HuaWei Shen, Xueqi Cheng

In this paper, we focus on improving two problems of two-stage method: (1) Moment prediction bias: The predicted moments for most queries come from the top retrieved videos, ignoring the possibility that the target moment is in the bottom retrieved videos, which is caused by the inconsistency of Shared Normalization during training and inference.

Moment Retrieval Retrieval +1

Paper
Add Code

AI-Generated Images Introduce Invisible Relevance Bias to Text-Image Retrieval

no code implementations • 23 Nov 2023 • Shicheng Xu, Danyang Hou, Liang Pang, Jingcheng Deng, Jun Xu, HuaWei Shen, Xueqi Cheng

Furthermore, our subsequent exploration reveals that the inclusion of AI-generated images in the training data of the retrieval models exacerbates the invisible relevance bias.

Cross-Modal Retrieval Image Retrieval +2

Paper
Add Code

Improving Video Corpus Moment Retrieval with Partial Relevance Enhancement

no code implementations • 21 Feb 2024 • Danyang Hou, Liang Pang, HuaWei Shen, Xueqi Cheng

The relevance between the video and query is partial, mainly evident in two aspects: (1) Scope: The untrimmed video contains information-rich frames, and not all are relevant to the query.

Moment Retrieval Retrieval +2

Paper
Add Code

Event-aware Video Corpus Moment Retrieval

no code implementations • 21 Feb 2024 • Danyang Hou, Liang Pang, HuaWei Shen, Xueqi Cheng

Video Corpus Moment Retrieval (VCMR) is a practical video retrieval task focused on identifying a specific moment within a vast corpus of untrimmed videos using the natural language query.

Contrastive Learning Moment Retrieval +4

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.