1 code implementation • 25 Dec 2023 • Shi Guo, jianqi ma, Xi Yang, Zhengqiang Zhang, Lei Zhang
Extensive experiments demonstrate the leading VJDD performance of our method in term of restoration accuracy, perceptual quality and temporal consistency.
1 code implementation • 1 Dec 2023 • Xi Yang, Chenhang He, jianqi ma, Lei Zhang
To ensure the content consistency among adjacent frames, we exploit the temporal dynamics in LR videos to guide the diffusion process by optimizing the latent sampling path with a motion-guided loss, ensuring that the generated HR video maintains a coherent and continuous visual flow.
1 code implementation • ICCV 2023 • jianqi ma, Zhetong Liang, Wangmeng Xiang, Xi Yang, Lei Zhang
Scene Text Image Super-resolution (STISR) aims to recover high-resolution (HR) scene text images with visually pleasant and readable text content from the given low-resolution (LR) input.
no code implementations • 27 Feb 2023 • Shi Guo, Hongwei Yong, Xindong Zhang, jianqi ma, Lei Zhang
In this paper, we propose the spatial-frequency attention network (SFANet) to enhance the network's ability in exploiting long-range dependency.
1 code implementation • CVPR 2022 • Shi Guo, Xi Yang, jianqi ma, Gaofeng Ren, Lei Zhang
Denoising and demosaicking are two essential steps to reconstruct a clean full-color image from the raw data.
1 code implementation • CVPR 2022 • jianqi ma, Zhetong Liang, Lei Zhang
The semantics of the text are firstly extracted by a text recognition module as text prior information.
no code implementations • CVPR 2022 • Xixi Xu, Zhongang Qi, jianqi ma, Honglun Zhang, Ying Shan, XiaoHu Qie
Current researches mainly focus on only English characters and digits, while few work studies Chinese characters due to the lack of public large-scale and high-quality Chinese datasets, which limits the practical application scenarios of text segmentation.
1 code implementation • 30 Dec 2021 • Haiyang Yu, Jingye Chen, Bin Li, jianqi ma, Mengnan Guan, Xixi Xu, Xiaocong Wang, Shaobo Qu, xiangyang xue
The experimental results indicate that the performance of baselines on CTR datasets is not as good as that on English datasets due to the characteristics of Chinese texts that are quite different from the Latin alphabet.
1 code implementation • 13 Dec 2021 • Jingye Chen, Haiyang Yu, jianqi ma, Bin Li, xiangyang xue
However, the recognition of low-resolution scene text images remains a challenge.
1 code implementation • 29 Jun 2021 • jianqi ma, Shi Guo, Lei Zhang
Our experiments on the benchmark TextZoom dataset show that TPGSR can not only effectively improve the visual quality of scene text images, but also significantly improve the text recognition accuracy over existing STISR methods.
1 code implementation • 28 Sep 2020 • Jianqi Ma
In inference stage, the detection branch outputs the proposal refinement and the recognition branch predicts the transcript of the refined text region.
no code implementations • 17 Apr 2019 • Yanze Wu, Qiang Sun, Jianqi Ma, Bin Li, Yanwei Fu, Yao Peng, xiangyang xue
Particularly, The QGMRN is composed of visual, textual and routing network.
no code implementations • ICLR 2018 • jianqi ma, Hangyu Lin, yinda zhang, Yanwei Fu, xiangyang xue
Besides directly augmenting image features, we transform the image features to semantic space using the encoder and perform the data augmentation.
4 code implementations • 3 Mar 2017 • Jianqi Ma, Weiyuan Shao, Hao Ye, Li Wang, Hong Wang, Yingbin Zheng, xiangyang xue
This paper introduces a novel rotation-based framework for arbitrary-oriented text detection in natural scene images.