Search Results for author: Lei Cui

Found 46 papers, 18 papers with code

Bilingual Data Cleaning for SMT using Graph-based Random Walk

no code implementations • ACL 2013 • Lei Cui, Dong-dong Zhang, Shujie Liu, Mu Li, Ming Zhou

Machine Translation Word Alignment

Paper
Add Code

Multi-Domain Adaptation for SMT Using Multi-Task Learning

no code implementations • EMNLP 2013 • Lei Cui, Xilun Chen, Dong-dong Zhang, Shujie Liu, Mu Li, Ming Zhou

Domain Adaptation Machine Translation +2

Paper
Add Code

Learning Topic Representation for SMT with Neural Networks

no code implementations • ACL 2014 • Lei Cui, Dong-dong Zhang, Shujie Liu, Qiming Chen, Mu Li, Ming Zhou, Muyun Yang

Domain Adaptation Machine Translation

Paper
Add Code

Aligning Coordinated Text Streams through Burst Information Network Construction and Decipherment

no code implementations • 27 Sep 2016 • Tao Ge, Qing Dou, Xiaoman Pan, Heng Ji, Lei Cui, Baobao Chang, Zhifang Sui, Ming Zhou

We introduce a novel Burst Information Network (BINet) representation that can display the most important information and illustrate the connections among bursty entities, events and keywords in the corpus.

Decipherment Translation

Paper
Add Code

News Stream Summarization using Burst Information Networks

no code implementations • EMNLP 2016 • Tao Ge, Lei Cui, Baobao Chang, Sujian Li, Ming Zhou, Zhifang Sui

Document Summarization Multi-Document Summarization

Paper
Add Code

Event Detection with Burst Information Networks

no code implementations • COLING 2016 • Tao Ge, Lei Cui, Baobao Chang, Zhifang Sui, Ming Zhou

Retrospective event detection is an important task for discovering previously unidentified events in a text stream.

Clustering Event Detection

Paper
Add Code

SuperAgent: A Customer Service Chatbot for E-commerce Websites

no code implementations • ACL 2017 • Lei Cui, Shaohan Huang, Furu Wei, Chuanqi Tan, Chaoqun Duan, Ming Zhou

Chatbot Opinion Mining +1

Paper
Add Code

EventWiki: A Knowledge Base of Major Events

no code implementations • LREC 2018 • Tao Ge, Lei Cui, Baobao Chang, Zhifang Sui, Furu Wei, Ming Zhou

Question Answering Semantic Parsing

Paper
Add Code

Neural Open Information Extraction

no code implementations • ACL 2018 • Lei Cui, Furu Wei, Ming Zhou

Conventional Open Information Extraction (Open IE) systems are usually built on hand-crafted patterns from other NLP tools such as syntactic parsing, yet they face problems of error propagation.

Computational Efficiency Open Information Extraction

Paper
Add Code

Retrieval-Enhanced Adversarial Training for Neural Response Generation

no code implementations • ACL 2019 • Qingfu Zhu, Lei Cui, Wei-Nan Zhang, Furu Wei, Ting Liu

Dialogue systems are usually built on either generation-based or retrieval-based approaches, yet they do not benefit from the advantages of different models.

Response Generation Retrieval

Paper
Add Code

Neural Melody Composition from Lyrics

no code implementations • 12 Sep 2018 • Hangbo Bao, Shaohan Huang, Furu Wei, Lei Cui, Yu Wu, Chuanqi Tan, Songhao Piao, Ming Zhou

In this paper, we study a novel task that learns to compose music from natural language.

Paper
Add Code

Unsupervised Machine Commenting with Neural Variational Topic Model

no code implementations • 13 Sep 2018 • Shuming Ma, Lei Cui, Furu Wei, Xu sun

To fully exploit the unpaired data, we completely remove the need for parallel data and propose a novel unsupervised approach to train an automatic article commenting model, relying on nothing but unpaired articles and comments.

Retrieval

Paper
Add Code

LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts

3 code implementations • 13 Sep 2018 • Shuming Ma, Lei Cui, Damai Dai, Furu Wei, Xu sun

We introduce the task of automatic live commenting.

Retrieval

124

Paper
Code

Fine-grained Coordinated Cross-lingual Text Stream Alignment for Endless Language Knowledge Acquisition

no code implementations • EMNLP 2018 • Tao Ge, Qing Dou, Heng Ji, Lei Cui, Baobao Chang, Zhifang Sui, Furu Wei, Ming Zhou

This paper proposes to study fine-grained coordinated cross-lingual text stream alignment through a novel information network decipherment paradigm.

Decipherment Information Retrieval

Paper
Add Code

TableBank: A Benchmark Dataset for Table Detection and Recognition

2 code implementations • LREC 2020 • Minghao Li, Lei Cui, Shaohan Huang, Furu Wei, Ming Zhou, Zhoujun Li

We present TableBank, a new image-based table detection and recognition dataset built with novel weak supervision from Word and Latex documents on the internet.

Table Detection

966

Paper
Code

Inspecting Unification of Encoding and Matching with Transformer: A Case Study of Machine Reading Comprehension

no code implementations • WS 2019 • Hangbo Bao, Li Dong, Furu Wei, Wenhui Wang, Nan Yang, Lei Cui, Songhao Piao, Ming Zhou

Most machine reading comprehension (MRC) models separately handle encoding and matching with different network architectures.

Machine Reading Comprehension

Paper
Add Code

Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification

no code implementations • 17 Dec 2019 • Renchun You, Zhiyao Guo, Lei Cui, Xiang Long, Yingze Bao, Shilei Wen

In order to overcome these challenges, we propose to use cross-modality attention with semantic graph embedding for multi label classification.

Ranked #8 on Multi-Label Classification on NUS-WIDE

Classification General Classification +4

Paper
Add Code

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

15 code implementations • 31 Dec 2019 • Yiheng Xu, Minghao Li, Lei Cui, Shaohan Huang, Furu Wei, Ming Zhou

In this paper, we propose the \textbf{LayoutLM} to jointly model interactions between text and layout information across scanned document images, which is beneficial for a great number of real-world document image understanding tasks such as information extraction from scanned documents.

Ranked #7 on Relation Extraction on FUNSD

Document AI Document Image Classification +3

124,527

Paper
Code

Multimodal Matching Transformer for Live Commenting

no code implementations • 7 Feb 2020 • Chaoqun Duan, Lei Cui, Shuming Ma, Furu Wei, Conghui Zhu, Tiejun Zhao

In this work, we aim to improve the relevance between live comments and videos by modeling the cross-modal interactions among different modalities.

Text Generation

Paper
Add Code

TableBank: Table Benchmark for Image-based Table Detection and Recognition

1 code implementation • LREC 2020 • Minghao Li, Lei Cui, Shaohan Huang, Furu Wei, Ming Zhou, Zhoujun Li

We present TableBank, a new image-based table detection and recognition dataset built with novel weak supervision from Word and Latex documents on the internet.

Table Detection

966

Paper
Code

DocBank: A Benchmark Dataset for Document Layout Analysis

2 code implementations • COLING 2020 • Minghao Li, Yiheng Xu, Lei Cui, Shaohan Huang, Furu Wei, Zhoujun Li, Ming Zhou

DocBank is constructed using a simple yet effective way with weak supervision from the \LaTeX{} documents available on the arXiv. com.

Document Layout Analysis

514

Paper
Code

Unsupervised Fine-tuning for Text Clustering

no code implementations • COLING 2020 • Shaohan Huang, Furu Wei, Lei Cui, Xingxing Zhang, Ming Zhou

Fine-tuning with pre-trained language models (e. g. BERT) has achieved great success in many language understanding tasks in supervised settings (e. g. text classification).

Clustering text-classification +2

Paper
Add Code

LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding

5 code implementations • ACL 2021 • Yang Xu, Yiheng Xu, Tengchao Lv, Lei Cui, Furu Wei, Guoxin Wang, Yijuan Lu, Dinei Florencio, Cha Zhang, Wanxiang Che, Min Zhang, Lidong Zhou

Pre-training of text and layout has proved effective in a variety of visually-rich document understanding tasks due to its effective model architecture and the advantage of large-scale unlabeled scanned/digital-born documents.

Ranked #1 on Key Information Extraction on SROIE

Document Image Classification Document Layout Analysis +6

124,527

Paper
Code

LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding

6 code implementations • 18 Apr 2021 • Yiheng Xu, Tengchao Lv, Lei Cui, Guoxin Wang, Yijuan Lu, Dinei Florencio, Cha Zhang, Furu Wei

In this paper, we present LayoutXLM, a multimodal pre-trained model for multilingual document understanding, which aims to bridge the language barriers for visually-rich document understanding.

Ranked #13 on Document Image Classification on RVL-CDIP

Document Image Classification document understanding

124,527

Paper
Code

AutoSampling: Search for Effective Data Sampling Schedules

no code implementations • 28 May 2021 • Ming Sun, Haoxuan Dou, Baopu Li, Lei Cui, Junjie Yan, Wanli Ouyang

Data sampling acts as a pivotal role in training deep learning models.

Image Classification

Paper
Add Code

VT-SSum: A Benchmark Dataset for Video Transcript Segmentation and Summarization

1 code implementation • 10 Jun 2021 • Tengchao Lv, Lei Cui, Momcilo Vasilijevic, Furu Wei

Video transcript summarization is a fundamental task for video understanding.

Segmentation Text Summarization +1

Paper
Code

Glancing at the Patch: Anomaly Localization With Global and Local Feature Comparison

no code implementations • CVPR 2021 • Shenzhi Wang, Liwei Wu, Lei Cui, Yujun Shen

More concretely, we employ a Local-Net and Global-Net to extract features from any individual patch and its surrounding respectively.

Anomaly Detection

Paper
Add Code

LayoutReader: Pre-training of Text and Layout for Reading Order Detection

1 code implementation • EMNLP 2021 • Zilong Wang, Yiheng Xu, Lei Cui, Jingbo Shang, Furu Wei

Reading order detection is the cornerstone to understanding visually-rich documents (e. g., receipts and forms).

Ranked #2 on Reading Order Detection on ReadingBank

Document Layout Analysis Optical Character Recognition (OCR) +1

18,274

Paper
Code

TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models

2 code implementations • 21 Sep 2021 • Minghao Li, Tengchao Lv, Jingye Chen, Lei Cui, Yijuan Lu, Dinei Florencio, Cha Zhang, Zhoujun Li, Furu Wei

Text recognition is a long-standing research problem for document digitalization.

Ranked #3 on Handwritten Text Recognition on IAM

Handwritten Text Recognition Language Modelling +4

124,527

Paper
Code

MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding

2 code implementations • 16 Oct 2021 • Junlong Li, Yiheng Xu, Lei Cui, Furu Wei

Multimodal pre-training with text, layout, and image has made significant progress for Visually Rich Document Understanding (VRDU), especially the fixed-layout documents such as scanned document images.

document understanding

124,527

Paper
Code

Document AI: Benchmarks, Models and Applications

no code implementations • 16 Nov 2021 • Lei Cui, Yiheng Xu, Tengchao Lv, Furu Wei

Document AI, or Document Intelligence, is a relatively new research topic that refers to the techniques for automatically reading, understanding, and analyzing business documents.

Document AI Document Image Classification +3

Paper
Add Code

Few-shot Object Counting with Similarity-Aware Feature Enhancement

1 code implementation • 22 Jan 2022 • Zhiyuan You, Kai Yang, Wenhan Luo, Xin Lu, Lei Cui, Xinyi Le

This work studies the problem of few-shot object counting, which counts the number of exemplar objects (i. e., described by one or several support images) occurring in the query image.

Ranked #2 on Object Counting on CARPK

Crowd Counting Object Counting

113

Paper
Code

DiT: Self-supervised Pre-training for Document Image Transformer

3 code implementations • 4 Mar 2022 • Junlong Li, Yiheng Xu, Tengchao Lv, Lei Cui, Cha Zhang, Furu Wei

We leverage DiT as the backbone network in a variety of vision-based Document AI tasks, including document image classification, document layout analysis, table detection as well as text detection for OCR.

Ranked #1 on Table Detection on ICDAR 2019

Document AI Document Image Classification +4

124,527

Paper
Code

LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

2 code implementations • 18 Apr 2022 • Yupan Huang, Tengchao Lv, Lei Cui, Yutong Lu, Furu Wei

In this paper, we propose \textbf{LayoutLMv3} to pre-train multimodal Transformers for Document AI with unified text and image masking.

Ranked #1 on Key Information Extraction on EPHOIE

Document AI Document Image Classification +10

124,527

Paper
Code

Invariant Content Synergistic Learning for Domain Generalization of Medical Image Segmentation

no code implementations • 5 May 2022 • Yuxin Kang, Hansheng Li, Xuan Zhao, Dongqing Hu, Feihong Liu, Lei Cui, Jun Feng, Lin Yang

In this paper, we propose a method, named Invariant Content Synergistic Learning (ICSL), to improve the generalization ability of DCNNs on unseen datasets by controlling the inductive bias.

Domain Generalization Image Segmentation +4

Paper
Add Code

A Unified Model for Multi-class Anomaly Detection

1 code implementation • 8 Jun 2022 • Zhiyuan You, Lei Cui, Yujun Shen, Kai Yang, Xin Lu, Yu Zheng, Xinyi Le

For example, when learning a unified model for 15 categories in MVTec-AD, we surpass the second competitor on the tasks of both anomaly detection (from 88. 1% to 96. 5%) and anomaly localization (from 89. 5% to 96. 8%).

Unsupervised Anomaly Detection

211

Paper
Code

ADTR: Anomaly Detection Transformer with Feature Reconstruction

no code implementations • 5 Sep 2022 • Zhiyuan You, Kai Yang, Wenhan Luo, Lei Cui, Yu Zheng, Xinyi Le

Second, CNN tends to reconstruct both normal samples and anomalies well, making them still hard to distinguish.

Anomaly Detection

Paper
Add Code

XDoc: Unified Pre-training for Cross-Format Document Understanding

1 code implementation • 6 Oct 2022 • Jingye Chen, Tengchao Lv, Lei Cui, Cha Zhang, Furu Wei

The surge of pre-training has witnessed the rapid development of document understanding recently.

Ranked #7 on Semantic entity labeling on FUNSD

document understanding Semantic entity labeling

18,279

Paper
Code

Neighbor Regularized Bayesian Optimization for Hyperparameter Optimization

no code implementations • 7 Oct 2022 • Lei Cui, Yangguang Li, Xin Lu, Dong An, Fenggang Liu

Bayesian Optimization (BO) is a common solution to search optimal hyperparameters based on sample observations of a machine learning model.

Bayesian Optimization Hyperparameter Optimization

Paper
Add Code

Language Is Not All You Need: Aligning Perception with Language Models

1 code implementation • NeurIPS 2023 • Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal, Shuming Ma, Tengchao Lv, Lei Cui, Owais Khan Mohammed, Barun Patra, Qiang Liu, Kriti Aggarwal, Zewen Chi, Johan Bjorck, Vishrav Chaudhary, Subhojit Som, Xia Song, Furu Wei

A big convergence of language, multimodal perception, action, and world modeling is a key step toward artificial general intelligence.

Image Captioning Language Modelling +4

18,274

Paper
Code

TextDiffuser: Diffusion Models as Text Painters

no code implementations • NeurIPS 2023 • Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei

Diffusion models have gained increasing attention for their impressive generation abilities but currently struggle with rendering accurate and coherent text.

Optical Character Recognition (OCR)

Paper
Add Code

Kosmos-2.5: A Multimodal Literate Model

no code implementations • 20 Sep 2023 • Tengchao Lv, Yupan Huang, Jingye Chen, Lei Cui, Shuming Ma, Yaoyao Chang, Shaohan Huang, Wenhui Wang, Li Dong, Weiyao Luo, Shaoxiang Wu, Guoxin Wang, Cha Zhang, Furu Wei

We present Kosmos-2. 5, a multimodal literate model for machine reading of text-intensive images.

Reading Comprehension Text Generation

Paper
Add Code

TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering

no code implementations • 28 Nov 2023 • Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei

The diffusion model has been proven a powerful generative model in recent years, yet remains a challenge in generating visual text.

Language Modelling Large Language Model +1

Paper
Add Code

Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models

1 code implementation • 4 Apr 2024 • Wenshan Wu, Shaoguang Mao, Yadong Zhang, Yan Xia, Li Dong, Lei Cui, Furu Wei

Large language models (LLMs) have exhibited impressive performance in language comprehension and various reasoning tasks.

Visual Navigation

Paper
Code

MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document Understanding

no code implementations • ACL 2022 • Junlong Li, Yiheng Xu, Lei Cui, Furu Wei

document understanding

Paper
Add Code

XFUND: A Benchmark Dataset for Multilingual Visually Rich Form Understanding

no code implementations • Findings (ACL) 2022 • Yiheng Xu, Tengchao Lv, Lei Cui, Guoxin Wang, Yijuan Lu, Dinei Florencio, Cha Zhang, Furu Wei

Multimodal pre-training with text, layout, and image has achieved SOTA performance for visually rich document understanding tasks recently, which demonstrates the great potential for joint learning across different modalities.

document understanding

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.