1 code implementation • 20 Sep 2023 • Qihang Fan, Huaibo Huang, Mingrui Chen, Hongmin Liu, Ran He
To alleviate these issues, we draw inspiration from the recent Retentive Network (RetNet) in the field of NLP, and propose RMT, a strong vision backbone with explicit spatial prior for general purposes.
no code implementations • 5 Jun 2023 • Wenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, MingYu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang, Xiang Bai
It is hoped that this competition will attract many researchers in the field of CV and NLP, and bring some new thoughts to the field of Document AI.
no code implementations • 24 Apr 2023 • Wenwen Yu, MingYu Liu, Mingrui Chen, Ning Lu, Yinlong Wen, Yuliang Liu, Dimosthenis Karatzas, Xiang Bai
To promote research in this area, we organized ICDAR 2023 competition on reading the seal title (ReST), which included two tasks: seal title text detection (Task 1) and end-to-end seal title recognition (Task 2).
1 code implementation • 27 Feb 2023 • Ruihang Miao, Weizhou Liu, Mingrui Chen, Zheng Gong, Weixin Xu, Chen Hu, Shuchang Zhou
3D Semantic Scene Completion (SSC) can provide dense geometric and semantic scene representations, which can be applied in the field of autonomous driving and robotic systems.
Ranked #13 on 3D Semantic Scene Completion on SemanticKITTI
no code implementations • ICCV 2023 • Miao Fan, Mingrui Chen, Chen Hu, Shuchang Zhou
Image matching is a fundamental and critical task in various visual applications, such as Simultaneous Localization and Mapping (SLAM) and image retrieval, which require accurate pose estimation.
no code implementations • 31 Mar 2022 • Weizhi Lu, Mingrui Chen, Kai Guo, Weiyu Li
Furthermore, this quantization property could be maintained in the random projections of sparse features, if both the features and random projection matrices are sufficiently sparse.
no code implementations • 20 Oct 2021 • Weizhi Lu, Mingrui Chen, Kai Guo, Weiyu Li
In the letter, we show that target propagation could be achieved by modeling the network s each layer with compressed sensing, without the need of auxiliary networks.
no code implementations • 16 Jul 2021 • Mingrui Chen, Weiyu Li, Weizhi Lu
Recently, it has been observed that {0, 1,-1}-ternary codes which are simply generated from deep features by hard thresholding, tend to outperform {-1, 1}-binary codes in image retrieval.