1 code implementation • 8 Oct 2023 • Zixiao Wang, Hongtao Xie, Yuxin Wang, Jianjun Xu, Boqiang Zhang, Yongdong Zhang
In this paper, we explore the potential of the Contrastive Language-Image Pretraining (CLIP) model in scene text recognition (STR), and establish a novel Symmetrical Linguistic Feature Distillation framework (named CLIP-OCR) to leverage both visual and linguistic knowledge in CLIP.
no code implementations • 6 Jul 2023 • Zijun Yao, Yuanyong Chen, Xin Lv, Shulin Cao, Amy Xin, Jifan Yu, Hailong Jin, Jianjun Xu, Peng Zhang, Lei Hou, Juanzi Li
We present Visual Knowledge oriented Programming platform (VisKoP), a knowledge base question answering (KBQA) system that integrates human into the loop to edit and debug the knowledge base (KB) queries.
no code implementations • 18 May 2023 • Xinyu Li, Jianjun Xu, Wenquan Cui, Haoyang Cheng
Considering the case where the response variable is a categorical variable and the predictor is a random function, two novel functional sufficient dimensional reduction (FSDR) methods are proposed based on mutual information and square loss mutual information.
1 code implementation • 9 May 2023 • Boqiang Zhang, Hongtao Xie, Yuxin Wang, Jianjun Xu, Yongdong Zhang
Vision model have gained increasing attention due to their simplicity and efficiency in Scene Text Recognition (STR) task.
no code implementations • 28 Feb 2023 • Yaqian Xu, Wenquan Cui, Jianjun Xu, Haoyang Cheng
The most recent multi-source covariate shift algorithm is an efficient hyperparameter optimization algorithm for missing target output.
no code implementations • 23 Jan 2023 • Wenquan Cui, Yue Zhao, Jianjun Xu, Haoyang Cheng
Online dimension reduction is a common method for high-dimensional streaming data processing.
no code implementations • 23 Jan 2023 • Wenquan Cui, Yue Zhao, Jianjun Xu, Haoyang Cheng
Federated learning has become a popular tool in the big data era nowadays.
1 code implementation • 5 Dec 2022 • Yadong Qu, Qingfeng Tan, Hongtao Xie, Jianjun Xu, Yuxin Wang, Yongdong Zhang
Moreover, two new datasets (Tamper-Syn2k and Tamper-Scene) are proposed to fill the blank of public evaluation datasets.
1 code implementation • ACL 2019 • Haoyu Zhang, Jingjing Cai, Jianjun Xu, Ji Wang
We conduct experiments on COMPLEXWEBQUESTIONS which is a large scale complex question semantic parsing dataset, results show that our model achieves significant improvement compared to state-of-the-art methods.
Ranked #1 on Semantic Parsing on complexWebQuestions-V1.0
4 code implementations • CONLL 2019 • Haoyu Zhang, Jianjun Xu, Ji Wang
For the decoder, there are two stages in our model, in the first stage, we use a Transformer-based decoder to generate a draft output sequence.
Ranked #30 on Abstractive Text Summarization on CNN / Daily Mail