Search Results for author: Yaowei Li

Found 7 papers, 2 papers with code

Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning

1 code implementation • 30 Jan 2024 • Bang Yang, Yong Dai, Xuxin Cheng, Yaowei Li, Asif Raza, Yuexian Zou

To alleviate CF raised by covariate shift and lexical overlap, we further propose a novel approach that ensures the identical distribution of all token embeddings during initialization and regularizes token embedding learning during training.

Text Retrieval

Paper
Code

CLIP-based Synergistic Knowledge Transfer for Text-based Person Retrieval

no code implementations • 18 Sep 2023 • Yating Liu, Yaowei Li, Zimo Liu, Wenming Yang, YaoWei Wang, Qingmin Liao

Text-based Person Retrieval (TPR) aims to retrieve the target person images given a textual query.

Person Retrieval Retrieval +3

Paper
Add Code

G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory

1 code implementation • ICCV 2023 • Hongxiang Li, Meng Cao, Xuxin Cheng, Yaowei Li, Zhihong Zhu, Yuexian Zou

Due to two annoying issues in video grounding: (1) the co-existence of some visual entities in both ground truth and other moments, \ie semantic overlapping; (2) only a few moments in the video are annotated, \ie sparse annotation dilemma, vanilla contrastive learning is unable to model the correlations between temporally distant moments and learned inconsistent video representations.

Contrastive Learning Video Grounding

Paper
Code

Efficient Multimodal Fusion via Interactive Prompting

no code implementations • CVPR 2023 • Yaowei Li, Ruijie Quan, Linchao Zhu, Yi Yang

Large-scale pre-training has brought unimodal fields such as computer vision and natural language processing to a new era.

Paper
Add Code

Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation

no code implementations • ICCV 2023 • Yaowei Li, Bang Yang, Xuxin Cheng, Zhihong Zhu, Hongxiang Li, Yuexian Zou

Automatic radiology report generation has attracted enormous research interest due to its practical value in reducing the workload of radiologists.

Sentence

Paper
Add Code

Exploiting Auxiliary Caption for Video Grounding

no code implementations • 15 Jan 2023 • Hongxiang Li, Meng Cao, Xuxin Cheng, Zhihong Zhu, Yaowei Li, Yuexian Zou

Video grounding aims to locate a moment of interest matching the given query sentence from an untrimmed video.

Contrastive Learning Dense Video Captioning +2

Paper
Add Code

SIAD: Self-supervised Image Anomaly Detection System

no code implementations • 8 Aug 2022 • Jiawei Li, Chenxi Lan, Xinyi Zhang, Bolin Jiang, Yuqiu Xie, Naiqi Li, Yan Liu, Yaowei Li, Enze Huo, Bin Chen

To make a step forward, this paper outlines an automatic annotation system called SsaA, working in a self-supervised learning manner, for continuously making the online visual inspection in the manufacturing automation scenarios.

Anomaly Detection Cloud Computing +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.