Search Results for author: Daiqing Wu

Found 5 papers, 1 papers with code

Char-SAM: Turning Segment Anything Model into Scene Text Segmentation Annotator with Character-level Visual Prompts

no code implementations27 Dec 2024 Enze Xie, Jiaho Lyu, Daiqing Wu, Huawen Shen, Yu Zhou

Specifically, leveraging some existing text detection datasets with word-level bounding box annotations, we first generate finer-grained character-level bounding box prompts using the Character Bounding-box Refinement CBR module.

Segmentation Text Detection +1

Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues

1 code implementation17 Dec 2024 Yan Zhang, Gangyan Zeng, Huawen Shen, Daiqing Wu, Yu Zhou, Can Ma

Video text-based visual question answering (Video TextVQA) is a practical task that aims to answer questions by jointly reasoning textual and visual information in a given video.

Language Modeling Language Modelling +4

Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via Semantics Completion and Decomposition

no code implementations9 Jul 2024 Daiqing Wu, Dongbao Yang, Huawen Shen, Can Ma, Yu Zhou

In the semantics completion module, we complement image and text representations with the semantics of the OCR text embedded in the image, helping bridge the sentiment gap.

Contrastive Learning Optical Character Recognition (OCR)

Cannot find the paper you are looking for? You can Submit a new open access paper.