Search Results for author: Guangwei Xu

Found 20 papers, 14 papers with code

Few-NERD: A Few-Shot Named Entity Recognition Dataset

7 code implementations • ACL 2021 • Ning Ding, Guangwei Xu, Yulin Chen, Xiaobin Wang, Xu Han, Pengjun Xie, Hai-Tao Zheng, Zhiyuan Liu

In this paper, we present Few-NERD, a large-scale human-annotated few-shot NER dataset with a hierarchy of 8 coarse-grained and 66 fine-grained entity types.

Ranked #5 on Named Entity Recognition (NER) on Few-NERD (SUP)

Few-shot NER Named Entity Recognition

375

Paper
Code

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks

1 code implementation • 7 Jun 2023 • Haiyang Xu, Qinghao Ye, Xuan Wu, Ming Yan, Yuan Miao, Jiabo Ye, Guohai Xu, Anwen Hu, Yaya Shi, Guangwei Xu, Chenliang Li, Qi Qian, Maofei Que, Ji Zhang, Xiao Zeng, Fei Huang

In addition, to facilitate a comprehensive evaluation of video-language models, we carefully build the largest human-annotated Chinese benchmarks covering three popular video-language tasks of cross-modal retrieval, video captioning, and video category classification.

Cross-Modal Retrieval Language Modelling +3

254

Paper
Code

Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval

1 code implementation • 7 Mar 2022 • Dingkun Long, Qiong Gao, Kuan Zou, Guangwei Xu, Pengjun Xie, Ruijie Guo, Jian Xu, Guanjun Jiang, Luxi Xing, Ping Yang

We find that the performance of retrieval models trained on dataset from general domain will inevitably decrease on specific domain.

Passage Retrieval Retrieval

150

Paper
Code

Retrieval Oriented Masking Pre-training Language Model for Dense Passage Retrieval

1 code implementation • 27 Oct 2022 • Dingkun Long, Yanzhao Zhang, Guangwei Xu, Pengjun Xie

Pre-trained language model (PTM) has been shown to yield powerful text representations for dense passage retrieval task.

Language Modelling Masked Language Modeling +2

150

Paper
Code

Parallel Instance Query Network for Named Entity Recognition

1 code implementation • ACL 2022 • Yongliang Shen, Xiaobin Wang, Zeqi Tan, Guangwei Xu, Pengjun Xie, Fei Huang, Weiming Lu, Yueting Zhuang

Each instance query predicts one entity, and by feeding all instance queries simultaneously, we can query all entities in parallel.

Ranked #1 on Nested Named Entity Recognition on GENIA

Chinese Named Entity Recognition named-entity-recognition +5

Paper
Code

Probing BERT in Hyperbolic Spaces

1 code implementation • ICLR 2021 • Boli Chen, Yao Fu, Guangwei Xu, Pengjun Xie, Chuanqi Tan, Mosha Chen, Liping Jing

We introduce a Poincare probe, a structural probe projecting these embeddings into a Poincare subspace with explicitly defined hierarchies.

Word Embeddings

Paper
Code

Prototypical Representation Learning for Relation Extraction

1 code implementation • ICLR 2021 • Ning Ding, Xiaobin Wang, Yao Fu, Guangwei Xu, Rui Wang, Pengjun Xie, Ying Shen, Fei Huang, Hai-Tao Zheng, Rui Zhang

This approach allows us to learn meaningful, interpretable prototypes for the final classification.

Few-Shot Learning Relation +3

Paper
Code

Robust Self-Augmentation for Named Entity Recognition with Meta Reweighting

1 code implementation • NAACL 2022 • Linzhi Wu, Pengjun Xie, Jie zhou, Meishan Zhang, Chunping Ma, Guangwei Xu, Min Zhang

Prior research has mainly resorted to heuristic rule-based constraints to reduce the noise for specific self-augmentation methods individually.

named-entity-recognition Named Entity Recognition +1

Paper
Code

Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation

1 code implementation • ACL 2020 • Ning Ding, Dingkun Long, Guangwei Xu, Muhua Zhu, Pengjun Xie, Xiaobin Wang, Hai-Tao Zheng

In order to simultaneously alleviate these two issues, this paper proposes to couple distant annotation and adversarial training for cross-domain CWS.

Chinese Word Segmentation Sentence

Paper
Code

AISHELL-NER: Named Entity Recognition from Chinese Speech

1 code implementation • 17 Feb 2022 • Boli Chen, Guangwei Xu, Xiaobin Wang, Pengjun Xie, Meishan Zhang, Fei Huang

Named Entity Recognition (NER) from speech is among Spoken Language Understanding (SLU) tasks, aiming to extract semantic information from the speech signal.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Code

HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Reranking

1 code implementation • 21 May 2022 • Yanzhao Zhang, Dingkun Long, Guangwei Xu, Pengjun Xie

Existing text retrieval systems with state-of-the-art performance usually adopt a retrieve-then-reranking architecture due to the high computational cost of pre-trained language models and the large corpus size.

Ranked #1 on Passage Re-Ranking on MS MARCO

Passage Ranking Passage Re-Ranking +2

Paper
Code

Crowdsourcing Learning as Domain Adaptation: A Case Study on Named Entity Recognition

1 code implementation • ACL 2021 • Xin Zhang, Guangwei Xu, Yueheng Sun, Meishan Zhang, Pengjun Xie

Crowdsourcing is regarded as one prospective solution for effective supervised learning, aiming to build large-scale annotated training data by crowd workers.

Domain Adaptation named-entity-recognition +3

Paper
Code

A Fine-Grained Domain Adaption Model for Joint Word Segmentation and POS Tagging

1 code implementation • EMNLP 2021 • Peijie Jiang, Dingkun Long, Yueheng Sun, Meishan Zhang, Guangwei Xu, Pengjun Xie

Self-training is one promising solution for it, which struggles to construct a set of high-quality pseudo training instances for the target domain.

Domain Adaptation POS +3

Paper
Code

Identifying Chinese Opinion Expressions with Extremely-Noisy Crowdsourcing Annotations

1 code implementation • ACL 2022 • Xin Zhang, Guangwei Xu, Yueheng Sun, Meishan Zhang, Xiaobin Wang, Min Zhang

Recent works of opinion expression identification (OEI) rely heavily on the quality and scale of the manually-constructed training corpus, which could be extremely difficult to satisfy.

Paper
Code

A Hybrid System for Chinese Grammatical Error Diagnosis and Correction

no code implementations • WS 2018 • Chen Li, Junpei Zhou, Zuyi Bao, Hengyou Liu, Guangwei Xu, Linlin Li

In the correction stage, candidates were generated by the three GEC models and then merged to output the final corrections for M and S types.

Grammatical Error Correction TAG

Paper
Add Code

Alibaba at IJCNLP-2017 Task 1: Embedding Grammatical Features into LSTMs for Chinese Grammatical Error Diagnosis Task

no code implementations • IJCNLP 2017 • Yi Yang, Pengjun Xie, Jun Tao, Guangwei Xu, Linlin Li, Luo Si

This paper introduces Alibaba NLP team system on IJCNLP 2017 shared task No.

Ranked #1 on 2D Human Pose Estimation on Alibaba Cluster Trace (using extra training data)

2D Human Pose Estimation Position

Paper
Add Code

Hierarchy-Aware Global Model for Hierarchical Text Classification

no code implementations • ACL 2020 • Jie Zhou, Chunping Ma, Dingkun Long, Guangwei Xu, Ning Ding, Haoyu Zhang, Pengjun Xie, Gongshen Liu

Hierarchical text classification is an essential yet challenging subtask of multi-label text classification with a taxonomic hierarchy.

General Classification Multi Label Text Classification +2

Paper
Add Code

Keyphrase Extraction with Dynamic Graph Convolutional Networks and Diversified Inference

no code implementations • 24 Oct 2020 • Haoyu Zhang, Dingkun Long, Guangwei Xu, Pengjun Xie, Fei Huang, Ji Wang

Keyphrase extraction (KE) aims to summarize a set of phrases that accurately express a concept or a topic covered in a given document.

Keyphrase Extraction Representation Learning

Paper
Add Code

Prompt-Learning for Fine-Grained Entity Typing

no code implementations • 24 Aug 2021 • Ning Ding, Yulin Chen, Xu Han, Guangwei Xu, Pengjun Xie, Hai-Tao Zheng, Zhiyuan Liu, Juanzi Li, Hong-Gee Kim

In this work, we investigate the application of prompt-learning on fine-grained entity typing in fully supervised, few-shot and zero-shot scenarios.

Entity Typing Knowledge Probing +5

Paper
Add Code

Hybrid Retrieval and Multi-stage Text Ranking Solution at TREC 2022 Deep Learning Track

no code implementations • 23 Aug 2023 • Guangwei Xu, Yangzhao Zhang, Longhui Zhang, Dingkun Long, Pengjun Xie, Ruijie Guo

Large-scale text retrieval technology has been widely used in various practical business scenarios.

Document Ranking Language Modelling +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.