no code implementations • 29 Apr 2024 • Huy Quang Pham, Thang Kien-Bao Nguyen, Quan Van Nguyen, Dan Quang Tran, Nghia Hieu Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen
To this end, we introduce a novel dataset, ViOCRVQA (Vietnamese Optical Character Recognition - Visual Question Answering dataset), consisting of 28, 000+ images and 120, 000+ question-answer pairs.
Optical Character Recognition Optical Character Recognition (OCR) +2
1 code implementation • 16 Apr 2024 • Quan Van Nguyen, Dan Quang Tran, Huy Quang Pham, Thang Kien-Bao Nguyen, Nghia Hieu Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen
Visual Question Answering (VQA) is a complicated task that requires the capability of simultaneously processing natural language and images.
Multimodal Deep Learning Optical Character Recognition (OCR) +5
1 code implementation • BMVC 2023 • Quan Van Nguyen, Mai Nguyen, Thanh Tung Nguyen, Huy Trịnh Quang, Toan Pham Van
The proposed model combines the strengths of Transformers and CNNs along with Laplacian images to overcome the limitations of previous models.
no code implementations • 10 Oct 2022 • Mai Nguyen, Tung Thanh Bui, Quan Van Nguyen, Thanh Tung Nguyen, Toan Van Pham
Polyp segmentation is still known as a difficult problem due to the large variety of polyp shapes, scanning and labeling modalities.
1 code implementation • 29 Sep 2022 • Toan Pham Van, Linh Bao Doan, Thanh Tung Nguyen, Duc Trung Tran, Quan Van Nguyen, Dinh Viet Sang
In this work, we present a new pseudo labeling strategy that enhances the quality of pseudo labels used for training student networks.