Search Results for author: Xiaotao Gu

Found 16 papers, 8 papers with code

Phrase-aware Unsupervised Constituency Parsing

no code implementations • ACL 2022 • Xiaotao Gu, Yikang Shen, Jiaming Shen, Jingbo Shang, Jiawei Han

Recent studies have achieved inspiring success in unsupervised grammar induction using masked language modeling (MLM) as the proxy task.

Constituency Parsing Language Modelling +1

Paper
Add Code

NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Prompts

no code implementations • 7 May 2024 • Shudan Zhang, Hanlin Zhao, Xiao Liu, Qinkai Zheng, Zehan Qi, Xiaotao Gu, Xiaohan Zhang, Yuxiao Dong, Jie Tang

To fill this gap, we propose NaturalCodeBench (NCB), a challenging code benchmark designed to mirror the complexity and variety of scenarios in real coding tasks.

Paper
Add Code

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion

no code implementations • 8 Mar 2024 • Wendi Zheng, Jiayan Teng, Zhuoyi Yang, Weihan Wang, Jidong Chen, Xiaotao Gu, Yuxiao Dong, Ming Ding, Jie Tang

Recent advancements in text-to-image generative systems have been largely driven by diffusion models.

Computational Efficiency Super-Resolution +1

Paper
Add Code

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

1 code implementation • 26 Sep 2023 • Yuhui Xu, Lingxi Xie, Xiaotao Gu, Xin Chen, Heng Chang, Hengheng Zhang, Zhengsu Chen, Xiaopeng Zhang, Qi Tian

Recently years have witnessed a rapid development of large language models (LLMs).

Quantization

6,113

Paper
Code

Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models

no code implementations • 14 Jun 2023 • Lingxi Xie, Longhui Wei, Xiaopeng Zhang, Kaifeng Bi, Xiaotao Gu, Jianlong Chang, Qi Tian

In this paper, we start with a conceptual definition of AGI and briefly review how NLP solves a wide range of tasks via a chat system.

Paper
Add Code

Pipeline MoE: A Flexible MoE Implementation with Pipeline Parallelism

no code implementations • 22 Apr 2023 • Xin Chen, Hengheng Zhang, Xiaotao Gu, Kaifeng Bi, Lingxi Xie, Qi Tian

The Mixture of Experts (MoE) model becomes an important choice of large language models nowadays because of its scalability with sublinear computational complexity for training and inference.

Paper
Add Code

Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global Weather Forecast

3 code implementations • 3 Nov 2022 • Kaifeng Bi, Lingxi Xie, Hengheng Zhang, Xin Chen, Xiaotao Gu, Qi Tian

In this paper, we present Pangu-Weather, a deep learning based system for fast and accurate global weather forecast.

961

Paper
Code

UCPhrase: Unsupervised Context-aware Quality Phrase Tagging

2 code implementations • 28 May 2021 • Xiaotao Gu, Zihan Wang, Zhenyu Bi, Yu Meng, Liyuan Liu, Jiawei Han, Jingbo Shang

Training a conventional neural tagger based on silver labels usually faces the risk of overfitting phrase surface names.

Ranked #1 on Phrase Tagging on KPTimes

Keyphrase Extraction Language Modelling +3

165

Paper
Code

On the Transformer Growth for Progressive BERT Training

no code implementations • NAACL 2021 • Xiaotao Gu, Liyuan Liu, Hongkun Yu, Jing Li, Chen Chen, Jiawei Han

Due to the excessive cost of large-scale language model pre-training, considerable efforts have been made to train BERT progressively -- start from an inferior but low-cost model and gradually grow the model to increase the computational complexity.

Language Modelling

Paper
Add Code

Learning Collaborative Agents with Rule Guidance for Knowledge Graph Reasoning

1 code implementation • EMNLP 2020 • Deren Lei, Gangrong Jiang, Xiaotao Gu, Kexuan Sun, Yuning Mao, Xiang Ren

Walk-based models have shown their advantages in knowledge graph (KG) reasoning by achieving decent performance while providing interpretable decisions.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Generating Representative Headlines for News Stories

2 code implementations • 26 Jan 2020 • Xiaotao Gu, Yuning Mao, Jiawei Han, Jialu Liu, Hongkun Yu, You Wu, Cong Yu, Daniel Finnie, Jiaqi Zhai, Nicholas Zukoski

In this work, we study the problem of generating representative headlines for news stories.

75,763

Paper
Code

Learning Dynamic Context Augmentation for Global Entity Linking

2 code implementations • IJCNLP 2019 • Xiyuan Yang, Xiaotao Gu, Sheng Lin, Siliang Tang, Yueting Zhuang, Fei Wu, Zhigang Chen, Guoping Hu, Xiang Ren

Despite of the recent success of collective entity linking (EL) methods, these "global" inference methods may yield sub-optimal results when the "all-mention coherence" assumption breaks, and often suffer from high computational cost at the inference stage, due to the complex search space.

Ranked #5 on Entity Disambiguation on AIDA-CoNLL

Entity Disambiguation Entity Linking +1

Paper
Code

Improving Distantly-supervised Entity Typing with Compact Latent Space Clustering

no code implementations • NAACL 2019 • Bo Chen, Xiaotao Gu, Yu-Feng Hu, Siliang Tang, Guoping Hu, Yueting Zhuang, Xiang Ren

Recently, distant supervision has gained great success on Fine-grained Entity Typing (FET).

Clustering Entity Typing

Paper
Add Code

Learning Named Entity Tagger using Domain-Specific Dictionary

1 code implementation • EMNLP 2018 • Jingbo Shang, Liyuan Liu, Xiang Ren, Xiaotao Gu, Teng Ren, Jiawei Han

Recent advances in deep neural models allow us to build reliable named entity recognition (NER) systems without handcrafting features.

named-entity-recognition Named Entity Recognition +1

482

Paper
Code

End-to-End Reinforcement Learning for Automatic Taxonomy Induction

1 code implementation • ACL 2018 • Yuning Mao, Xiang Ren, Jiaming Shen, Xiaotao Gu, Jiawei Han

We present a novel end-to-end reinforcement learning approach to automatic taxonomy induction from a set of terms.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Large-scale Validation of Counterfactual Learning Methods: A Test-Bed

no code implementations • 1 Dec 2016 • Damien Lefortier, Adith Swaminathan, Xiaotao Gu, Thorsten Joachims, Maarten de Rijke

The ability to perform effective off-policy learning would revolutionize the process of building better interactive systems, such as search engines and recommendation systems for e-commerce, computational advertising and news.

counterfactual Off-policy evaluation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.