Search Results for author: Kai Ding

Found 13 papers, 6 papers with code

Datasets for Large Language Models: A Comprehensive Survey

1 code implementation • 28 Feb 2024 • Yang Liu, Jiahuan Cao, Chongyu Liu, Kai Ding, Lianwen Jin

Additionally, a comprehensive review of the existing available dataset resources is also provided, including statistics from 444 datasets, covering 8 language categories and spanning 32 domains.

Language Modelling Large Language Model

509

Paper
Code

EXACT-Net:EHR-guided lung tumor auto-segmentation for non-small cell lung cancer radiotherapy

no code implementations • 21 Feb 2024 • Hamed Hooshangnejad, Xue Feng, Gaofeng Huang, Rui Zhang, Quan Chen, Kai Ding

Lung cancer is a devastating disease with the highest mortality rate among cancer types.

Computed Tomography (CT) Language Modelling +5

Paper
Add Code

UPOCR: Towards Unified Pixel-Level OCR Interface

no code implementations • 5 Dec 2023 • Dezhi Peng, Zhenhua Yang, Jiaxin Zhang, Chongyu Liu, Yongxin Shi, Kai Ding, Fengjun Guo, Lianwen Jin

Without bells and whistles, the experimental results showcase that the proposed method can simultaneously achieve state-of-the-art performance on three tasks with a unified single model, which provides valuable strategies and insights for future research on generalist OCR models.

Optical Character Recognition Optical Character Recognition (OCR) +2

Paper
Add Code

DocAligner: Annotating Real-world Photographic Document Images by Simply Taking Pictures

no code implementations • 9 Jun 2023 • Jiaxin Zhang, Bangdong Chen, Hiuyi Cheng, Fengjun Guo, Kai Ding, Lianwen Jin

Furthermore, considering the importance of fine-grained elements in document images, we present a details recurrent refinement module to enhance the output in a high-resolution space.

Self-Supervised Learning

Paper
Add Code

M$^{6}$Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis

no code implementations • 15 May 2023 • Hiuyi Cheng, Peirong Zhang, Sihang Wu, Jiaxin Zhang, Qiyuan Zhu, Zecheng Xie, Jing Li, Kai Ding, Lianwen Jin

Document layout analysis is a crucial prerequisite for document understanding, including document retrieval and conversion.

Document Layout Analysis document understanding +3

Paper
Add Code

M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis

1 code implementation • CVPR 2023 • Hiuyi Cheng, Peirong Zhang, Sihang Wu, Jiaxin Zhang, Qiyuan Zhu, Zecheng Xie, Jing Li, Kai Ding, Lianwen Jin

Document layout analysis is a crucial prerequisite for document understanding, including document retrieval and conversion.

Document Layout Analysis document understanding +3

Paper
Code

Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild

1 code implementation • 23 Jul 2022 • Jiaxin Zhang, Canjie Luo, Lianwen Jin, Fengjun Guo, Kai Ding

To address this issue, we propose a novel approach called Marior (Margin Removal and \Iterative Content Rectification).

Optical Character Recognition (OCR)

Paper
Code

Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context

1 code implementation • 21 Jul 2022 • Chongyu Liu, Lianwen Jin, Yuliang Liu, Canjie Luo, Bangdong Chen, Fengjun Guo, Kai Ding

To address this issue, we propose a Contextual-guided Text Removal Network, termed as CTRNet.

Paper
Code

SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition

2 code implementations • CVPR 2022 • Mingxin Huang, Yuliang Liu, Zhenghao Peng, Chongyu Liu, Dahua Lin, Shenggao Zhu, Nicholas Yuan, Kai Ding, Lianwen Jin

End-to-end scene text spotting has attracted great attention in recent years due to the success of excavating the intrinsic synergy of the scene text detection and recognition.

Ranked #3 on Text Spotting on Inverse-Text

Scene Text Detection Text Detection +1

252

Paper
Code

LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding

2 code implementations • ACL 2022 • Jiapeng Wang, Lianwen Jin, Kai Ding

LiLT can be pre-trained on the structured documents of a single language and then directly fine-tuned on other languages with the corresponding off-the-shelf monolingual/multilingual pre-trained textual models.

Ranked #5 on Key Information Extraction on CORD

Document Image Classification document understanding +2

124,457

Paper
Code

Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences

no code implementations • 20 Jun 2021 • Jiapeng Wang, Tianwei Wang, Guozhi Tang, Lianwen Jin, Weihong Ma, Kai Ding, Yichao Huang

Visual information extraction (VIE) has attracted increasing attention in recent years.

Optical Character Recognition Optical Character Recognition (OCR) +2

Paper
Add Code

Fault Injectors for TensorFlow: Evaluation of the Impact of Random Hardware Faults on Deep CNNs

no code implementations • 13 Dec 2020 • Michael Beyer, Andrey Morozov, Emil Valiev, Christoph Schorn, Lydia Gauerhof, Kai Ding, Klaus Janschek

The results demonstrate how random bit flips in the output of particular mathematical operations and layers of NNs affect the classification accuracy.

Paper
Add Code

Agent Prioritization for Autonomous Navigation

no code implementations • 19 Sep 2019 • Khaled S. Refaat, Kai Ding, Natalia Ponomareva, Stéphane Ross

We propose a system to rank agents around an autonomous vehicle (AV) in real time.

Autonomous Navigation Decision Making +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.