Search Results for author: Zhengzhong Liu

Found 24 papers, 9 papers with code

LLM360: Towards Fully Transparent Open-Source LLMs

1 code implementation • 11 Dec 2023 • Zhengzhong Liu, Aurick Qiao, Willie Neiswanger, Hongyi Wang, Bowen Tan, Tianhua Tao, Junbo Li, Yuqi Wang, Suqi Sun, Omkar Pangarkar, Richard Fan, Yi Gu, Victor Miller, Yonghao Zhuang, Guowei He, Haonan Li, Fajri Koto, Liping Tang, Nikhil Ranjan, Zhiqiang Shen, Xuguang Ren, Roberto Iriondo, Cun Mu, Zhiting Hu, Mark Schulze, Preslav Nakov, Tim Baldwin, Eric P. Xing

The recent surge in open-source Large Language Models (LLMs), such as LLaMA, Falcon, and Mistral, provides diverse options for AI practitioners and researchers.

Paper
Code

SegMix: A Simple Structure-Aware Data Augmentation Method

no code implementations • 16 Nov 2023 • Yuxin Pei, Pushkar Bhuse, Zhengzhong Liu, Eric Xing

We argue that the direct-adoption methods do not account for structures in NLP tasks.

Data Augmentation named-entity-recognition +3

Paper
Add Code

SlimPajama-DC: Understanding Data Combinations for LLM Training

no code implementations • 19 Sep 2023 • Zhiqiang Shen, Tianhua Tao, Liqun Ma, Willie Neiswanger, Zhengzhong Liu, Hongyi Wang, Bowen Tan, Joel Hestness, Natalia Vassilieva, Daria Soboleva, Eric Xing

This paper aims to understand the impacts of various data combinations (e. g., web text, wikipedia, github, books) on the training of large language models using SlimPajama.

Paper
Add Code

Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models

no code implementations • 30 Aug 2023 • Neha Sengupta, Sunil Kumar Sahu, Bokang Jia, Satheesh Katipomu, Haonan Li, Fajri Koto, William Marshall, Gurpreet Gosal, Cynthia Liu, Zhiming Chen, Osama Mohammed Afzal, Samta Kamboj, Onkar Pandit, Rahul Pal, Lalit Pradhan, Zain Muhammad Mujahid, Massa Baali, Xudong Han, Sondos Mahmoud Bsharat, Alham Fikri Aji, Zhiqiang Shen, Zhengzhong Liu, Natalia Vassilieva, Joel Hestness, Andy Hock, Andrew Feldman, Jonathan Lee, Andrew Jackson, Hector Xuguang Ren, Preslav Nakov, Timothy Baldwin, Eric Xing

We release two open versions of the model -- the foundation Jais model, and an instruction-tuned Jais-chat variant -- with the aim of promoting research on Arabic LLMs.

Paper
Add Code

ASDOT: Any-Shot Data-to-Text Generation with Pretrained Language Models

1 code implementation • 9 Oct 2022 • Jiannan Xiang, Zhengzhong Liu, Yucheng Zhou, Eric P. Xing, Zhiting Hu

In the data disambiguation stage, we employ the prompted GPT-3 model to understand possibly ambiguous triples from the input data and convert each into a short sentence with reduced ambiguity.

Data-to-Text Generation Sentence +1

Paper
Code

Text Generation with Efficient (Soft) $Q$-Learning

no code implementations • 29 Sep 2021 • Han Guo, Bowen Tan, Zhengzhong Liu, Eric Xing, Zhiting Hu

We apply the approach to a wide range of text generation tasks, including learning from noisy/negative examples, adversarial attacks, and prompt generation.

Q-Learning Reinforcement Learning (RL) +1

Paper
Add Code

Compression, Transduction, and Creation: A Unified Framework for Evaluating Natural Language Generation

1 code implementation • EMNLP 2021 • Mingkai Deng, Bowen Tan, Zhengzhong Liu, Eric P. Xing, Zhiting Hu

Based on the nature of information change from input to output, we classify NLG tasks into compression (e. g., summarization), transduction (e. g., text rewriting), and creation (e. g., dialog).

nlg evaluation Style Transfer +2

Paper
Code

Cross-document Event Identity via Dense Annotation

1 code implementation • CoNLL (EMNLP) 2021 • Adithya Pratapa, Zhengzhong Liu, Kimihiro Hasegawa, Linwei Li, Yukari Yamakawa, Shikun Zhang, Teruko Mitamura

To this end, we design a new annotation workflow with careful quality control and an easy-to-use annotation interface.

Paper
Code

Efficient (Soft) Q-Learning for Text Generation with Limited Good Data

1 code implementation • 14 Jun 2021 • Han Guo, Bowen Tan, Zhengzhong Liu, Eric P. Xing, Zhiting Hu

We apply the approach to a wide range of novel text generation tasks, including learning from noisy/negative examples, adversarial attacks, and prompt generation.

Q-Learning Reinforcement Learning (RL) +1

Paper
Code

A Data-Centric Framework for Composable NLP Workflows

1 code implementation • EMNLP 2020 • Zhengzhong Liu, Guanxiong Ding, Avinash Bukkittu, Mansi Gupta, Pengzhi Gao, Atif Ahmed, Shikun Zhang, Xin Gao, Swapnil Singhavi, Linwei Li, Wei Wei, Zecong Hu, Haoran Shi, Haoying Zhang, Xiaodan Liang, Teruko Mitamura, Eric P. Xing, Zhiting Hu

Empirical natural language processing (NLP) systems in application domains (e. g., healthcare, finance, education) involve interoperation among multiple components, ranging from data ingestion, human annotation, to text retrieval, analysis, generation, and visualization.

Retrieval Text Retrieval

235

Paper
Code

A Two-Step Approach for Implicit Event Argument Detection

no code implementations • ACL 2020 • Zhisong Zhang, Xiang Kong, Zhengzhong Liu, Xuezhe Ma, Eduard Hovy

It remains a challenge to detect implicit arguments, calling for more future work of document-level modeling for this task.

Sentence Vocal Bursts Valence Prediction

Paper
Add Code

Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation

4 code implementations • ACL 2019 • Zhiting Hu, Haoran Shi, Bowen Tan, Wentao Wang, Zichao Yang, Tiancheng Zhao, Junxian He, Lianhui Qin, Di Wang, Xuezhe Ma, Zhengzhong Liu, Xiaodan Liang, Wangrong Zhu, Devendra Singh Sachan, Eric P. Xing

The versatile toolkit also fosters technique sharing across different text generation tasks.

Machine Translation Text Generation +1

2,381

Paper
Code

Automatic Event Salience Identification

1 code implementation • EMNLP 2018 • Zhengzhong Liu, Chenyan Xiong, Teruko Mitamura, Eduard Hovy

Our analyses demonstrate that our neural model captures interesting connections between salience and discourse unit relations (e. g., scripts and frame structures).

Paper
Code

Graph Based Decoding for Event Sequencing and Coreference Resolution

no code implementations • COLING 2018 • Zhengzhong Liu, Teruko Mitamura, Eduard Hovy

In this paper, we study two types of relation: Event Coreference and Event Sequencing.

coreference-resolution

Paper
Add Code

Texar: A Modularized, Versatile, and Extensible Toolbox for Text Generation

no code implementations • WS 2018 • Zhiting Hu, Zichao Yang, Tiancheng Zhao, Haoran Shi, Junxian He, Di Wang, Xuezhe Ma, Zhengzhong Liu, Xiaodan Liang, Lianhui Qin, Devendra Singh Chaplot, Bowen Tan, Xingjiang Yu, Eric Xing

The features make Texar particularly suitable for technique sharing and generalization across different text generation applications.

Image Captioning Machine Translation +3

Paper
Add Code

Graph-Based Decoding for Event Sequencing and Coreference Resolution

no code implementations • 13 Jun 2018 • Zhengzhong Liu, Teruko Mitamura, Eduard Hovy

In this paper, we study two types of relation: Event Coreference and Event Sequencing.

coreference-resolution

Paper
Add Code

Towards Better Text Understanding and Retrieval through Kernel Entity Salience Modeling

no code implementations • 3 May 2018 • Chenyan Xiong, Zhengzhong Liu, Jamie Callan, Tie-Yan Liu

The salience model also improves ad hoc search accuracy, providing effective ranking features by modeling the salience of query entities in candidate documents.

Retrieval

Paper
Add Code

Exploring the steps of Verb Phrase Ellipsis

no code implementations • WS 2016 • Zhengzhong Liu, Edgar Gonz{\`a}lez Pellicer, Daniel Gillick

Boundary Detection Coreference Resolution

Paper
Add Code

Harnessing Deep Neural Networks with Logic Rules

2 code implementations • ACL 2016 • Zhiting Hu, Xuezhe Ma, Zhengzhong Liu, Eduard Hovy, Eric Xing

Combining deep neural networks with structured logic rules is desirable to harness flexibility and reduce uninterpretability of the neural models.

Ranked #65 on Sentiment Analysis on SST-2 Binary classification

named-entity-recognition Named Entity Recognition +2

133

Paper
Code

Unsupervised Ranking Model for Entity Coreference Resolution

no code implementations • NAACL 2016 • Xuezhe Ma, Zhengzhong Liu, Eduard Hovy

Coreference resolution is one of the first stages in deep language understanding and its importance has been well recognized in the natural language processing community.

coreference-resolution