1 code implementation • 29 Jan 2024 • Jieru Lin, Danqing Huang, Tiejun Zhao, Dechen Zhan, Chin-Yew Lin
Furthermore, based on our observation that pixel space is more sensitive in capturing spatial patterns of graphic layouts (e. g., overlap, alignment), we propose a learning-based locator to detect erroneous tokens which takes the wireframe image rendered from the generated layout sequence as input.
no code implementations • 9 Dec 2022 • Yuxin Wang, Jieru Lin, Zhiwei Yu, Wei Hu, Börje F. Karlsson
Storytelling and narrative are fundamental to human experience, intertwined with our social and cultural engagement.