Search Results for author: Nghia Hieu Nguyen

Found 8 papers, 4 papers with code

ViOCRVQA: Novel Benchmark Dataset and Vision Reader for Visual Question Answering by Understanding Vietnamese Text in Images

1 code implementation29 Apr 2024 Huy Quang Pham, Thang Kien-Bao Nguyen, Quan Van Nguyen, Dan Quang Tran, Nghia Hieu Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

To this end, we introduce a novel dataset, ViOCRVQA (Vietnamese Optical Character Recognition - Visual Question Answering dataset), consisting of 28, 000+ images and 120, 000+ question-answer pairs.

Optical Character Recognition Optical Character Recognition (OCR) +2

PAT: Parallel Attention Transformer for Visual Question Answering in Vietnamese

no code implementations17 Jul 2023 Nghia Hieu Nguyen, Kiet Van Nguyen

Based on these two novel modules, we introduce the Parallel Attention Transformer (PAT), achieving the best accuracy compared to all baselines on the benchmark ViVQA dataset and other SOTA methods including SAAA and MCAN.

Question Answering Vietnamese Visual Question Answering

UIT-OpenViIC: A Novel Benchmark for Evaluating Image Captioning in Vietnamese

no code implementations7 May 2023 Doanh C. Bui, Nghia Hieu Nguyen, Khang Nguyen

To contribute to the low-resources research community as in Vietnam, we introduce a novel image captioning dataset in Vietnamese, the Open-domain Vietnamese Image Captioning dataset (UIT-OpenViIC).

Vietnamese Image Captioning Vietnamese Multimodal Learning

EVJVQA Challenge: Multilingual Visual Question Answering

no code implementations23 Feb 2023 Ngan Luu-Thuy Nguyen, Nghia Hieu Nguyen, Duong T. D Vo, Khanh Quoc Tran, Kiet Van Nguyen

Visual Question Answering (VQA) is a challenging task of natural language processing (NLP) and computer vision (CV), attracting significant attention from researchers.

Language Modelling Question Answering +2

UIT-HWDB: Using Transferring Method to Construct A Novel Benchmark for Evaluating Unconstrained Handwriting Image Recognition in Vietnamese

1 code implementation10 Nov 2022 Nghia Hieu Nguyen, Duong T. D. Vo, Kiet Van Nguyen

Recognizing handwriting images is challenging due to the vast variation in writing style across many people and distinct linguistic aspects of writing languages.

Handwriting Recognition

VieCap4H-VLSP 2021: ObjectAoA-Enhancing performance of Object Relation Transformer with Attention on Attention for Vietnamese image captioning

no code implementations10 Nov 2022 Nghia Hieu Nguyen, Duong T. D. Vo, Minh-Quan Ha

Image captioning is currently a challenging task that requires the ability to both understand visual information and use human language to describe this visual information in the image.

Vietnamese Image Captioning

Cannot find the paper you are looking for? You can Submit a new open access paper.