1 code implementation • 30 Apr 2024 • Yoonsik Kim, Moonbin Yim, Ka Yeon Song
QA pairs are generated by exploiting the large language model (LLM) where the input is a text-formatted table.
1 code implementation • 7 Nov 2022 • Donghyun Kim, Teakgyu Hong, Moonbin Yim, Yoonsik Kim, Geewook Kim
In recent years, research on visual document understanding (VDU) has grown significantly, with a particular emphasis on the development of self-supervised learning methods.
document understanding
Optical Character Recognition (OCR)
+1
5 code implementations • 30 Nov 2021 • Geewook Kim, Teakgyu Hong, Moonbin Yim, Jeongyeon Nam, Jinyoung Park, Jinyeong Yim, Wonseok Hwang, Sangdoo Yun, Dongyoon Han, Seunghyun Park
Current Visual Document Understanding (VDU) methods outsource the task of reading text to off-the-shelf Optical Character Recognition (OCR) engines and focus on the understanding task with the OCR outputs.
Ranked #7 on
Key-value Pair Extraction
on SIBR
no code implementations • 23 Jul 2021 • Junyeop Lee, Yoonsik Kim, Seonghyeon Kim, Moonbin Yim, Seung Shin, Gayoung Lee, Sungrae Park
Scene text editing (STE), which converts a text in a scene image into the desired text while preserving an original style, is a challenging task due to a complex intervention between text and style.
1 code implementation • 20 Jul 2021 • Moonbin Yim, Yoonsik Kim, Han-Cheol Cho, Sungrae Park
For successful scene text recognition (STR) models, synthetic text image generators have alleviated the lack of annotated text images from the real world.