no code implementations • 25 Nov 2022 • Zhao Zhou, Xiangcheng Du, Yingbin Zheng, Cheng Jin
We present the Aggregated Text TRansformer(ATTR), which is designed to represent texts in scene images with a multi-scale self-attention mechanism.
no code implementations • 23 Jul 2022 • Xiangcheng Du, Zhao Zhou, Yingbin Zheng, Xingjiao Wu, Tianlong Ma, Cheng Jin
Scene text erasing seeks to erase text contents from scene images and current state-of-the-art text erasing models are trained on large-scale synthetic data.
no code implementations • 27 Nov 2021 • Tianlong Ma, Xingjiao Wu, Xin Li, Xiangcheng Du, Zhao Zhou, Liang Xue, Cheng Jin
To measure the proposed image layer modeling method, we propose a manually-labeled non-Manhattan layout fine-grained segmentation dataset named FPD.
no code implementations • 23 Mar 2019 • Zhao Zhou, Hao Ye, Luhui Chen, Yingbin Zheng
Curve text or arbitrary shape text is very common in real-world scenarios.
no code implementations • 26 Jun 2018 • Zhao Wei, Chai Haixia, Wang Benyou, Ye Jianbo, Yang Min, Zhao Zhou, Chen Xiaojun
In the adversarial process, we train a generator as an agent of reinforcement learning which recommends the next movie to a user sequentially.