Search Results for author: Dongxing Mao

Found 6 papers, 4 papers with code

TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation

1 code implementation11 Feb 2025 Alex Jinpeng Wang, Dongxing Mao, Jiawei Zhang, Weiming Han, Zhuobai Dong, Linjie Li, Yiqi Lin, Zhengyuan Yang, Libo Qin, Fuwei Zhang, Lijuan Wang, Min Li

Text-conditioned image generation has gained significant attention in recent years and are processing increasingly longer and comprehensive text prompt.

Image Generation

ASSISTGUI: Task-Oriented Desktop Graphical User Interface Automation

1 code implementation20 Dec 2023 Difei Gao, Lei Ji, Zechen Bai, Mingyu Ouyang, Peiran Li, Dongxing Mao, Qinchen Wu, Weichen Zhang, Peiyi Wang, Xiangwu Guo, Hengxu Wang, Luowei Zhou, Mike Zheng Shou

Graphical User Interface (GUI) automation holds significant promise for assisting users with complex tasks, thereby boosting human productivity.

Language Modelling Large Language Model

AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant

4 code implementations8 Mar 2022 Benita Wong, Joya Chen, You Wu, Stan Weixian Lei, Dongxing Mao, Difei Gao, Mike Zheng Shou

In this paper, we define a new task called Affordance-centric Question-driven Task Completion, where the AI assistant should learn from instructional videos to provide step-by-step help in the user's view.

Visual Question Answering (VQA)

Cannot find the paper you are looking for? You can Submit a new open access paper.