Search Results for author: Yonghui Wang

Found 7 papers, 4 papers with code

TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding

1 code implementation15 Apr 2024 Bozhi Luan, Hao Feng, Hong Chen, Yonghui Wang, Wengang Zhou, Houqiang Li

The image overview stage provides a comprehensive understanding of the global scene information, and the coarse localization stage approximates the image area containing the answer based on the question asked.

Question Answering Visual Question Answering (VQA)

Perceptual learning in contour detection transfer across changes in contour path and orientation

no code implementations18 Mar 2024 Yue Ding, Hongqiao Shi, Shuang Song, Yonghui Wang, Ya Li

The integration of local elements into shape contours is critical for target detection and identification in cluttered scenes.

Contour Detection Specificity

Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs

1 code implementation22 Nov 2023 Yonghui Wang, Wengang Zhou, Hao Feng, Keyi Zhou, Houqiang Li

Moreover, we curate a collection of text-rich images and prompt the text-only GPT-4 to generate 12K high-quality conversations, featuring textual locations within text-rich scenarios.

document understanding Instruction Following +3

Progressive Recurrent Network for Shadow Removal

no code implementations1 Nov 2023 Yonghui Wang, Wengang Zhou, Hao Feng, Li Li, Houqiang Li

To handle this issue, we consider removing the shadow in a coarse-to-fine fashion and propose a simple but effective Progressive Recurrent Network (PRNet).

Image Shadow Removal Shadow Removal

Detect Any Shadow: Segment Anything for Video Shadow Detection

1 code implementation26 May 2023 Yonghui Wang, Wengang Zhou, Yunyao Mao, Houqiang Li

Segment anything model (SAM) has achieved great success in the field of natural image segmentation.

Image Segmentation Semantic Segmentation +1

UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior

1 code implementation15 Oct 2022 Yonghui Wang, Wengang Zhou, Zhenbo Lu, Houqiang Li

To this end, we propose UDoc-GAN, the first framework to address the problem of document illumination correction under the unpaired setting.

Fish Detection Using Deep Learning

no code implementations Applied Computational Intelligence and Soft Computing 2020 Suxia Cui, Yu Zhou, Yonghui Wang, Lujun Zhai

An advanced system with more computing power can facilitate deep learning feature, which exploit many neural network algorithms to simulate human brains.

Data Augmentation Fish Detection

Cannot find the paper you are looking for? You can Submit a new open access paper.