Search Results for author: Yekun Chai

Found 19 papers, 6 papers with code

Counter-Contrastive Learning for Language GANs

no code implementations Findings (EMNLP) 2021 Yekun Chai, Haidong Zhang, Qiyue Yin, Junge Zhang

Generative Adversarial Networks (GANs) have achieved great success in image synthesis, but have proven to be difficult to generate natural language.

Contrastive Learning Image Generation

Dual Modalities of Text: Visual and Textual Generative Pre-training

no code implementations16 Apr 2024 Yekun Chai, Qingyi Liu, Jingwu Xiao, Shuohuan Wang, Yu Sun, Hua Wu

Harnessing visual texts represents a burgeoning frontier in the evolution of language modeling.

Language Modelling

On Training Data Influence of GPT Models

1 code implementation11 Apr 2024 Qingyi Liu, Yekun Chai, Shuohuan Wang, Yu Sun, Qiwei Peng, Keze Wang, Hua Wu

This paper presents GPTfluence, a novel approach that leverages a featurized simulation to assess the impact of training examples on the training dynamics of GPT models.

Natural Language Understanding

HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization

1 code implementation26 Feb 2024 Qiwei Peng, Yekun Chai, Xuhong LI

These benchmarks have overlooked the vast landscape of massively multilingual NL to multilingual code, leaving a critical gap in the evaluation of multilingual LLMs.

Code Generation

Tool-Augmented Reward Modeling

1 code implementation2 Oct 2023 Lei LI, Yekun Chai, Shuohuan Wang, Yu Sun, Hao Tian, Ningyu Zhang, Hua Wu

We validate our approach across a wide range of domains, incorporating seven distinct external tools.

Improved Training of Mixture-of-Experts Language GANs

no code implementations23 Feb 2023 Yekun Chai, Qiyue Yin, Junge Zhang

In this work, we (1) first empirically show that the mixture-of-experts approach is able to enhance the representation capacity of the generator for language GANs and (2) harness the Feature Statistics Alignment (FSA) paradigm to render fine-grained learning signals to advance the generator training.

Adversarial Text Image Generation +1

ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models

no code implementations9 Feb 2023 Pengfei Zhu, Chao Pang, Yekun Chai, Lei LI, Shuohuan Wang, Yu Sun, Hao Tian, Hua Wu

In response to this lacuna, this paper introduces a pioneering contribution in the form of a text-to-waveform music generation model, underpinned by the utilization of diffusion models.

Music Generation Text-to-Music Generation

ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages

1 code implementation13 Dec 2022 Yekun Chai, Shuohuan Wang, Chao Pang, Yu Sun, Hao Tian, Hua Wu

Extensive results show that ERNIE-Code outperforms previous multilingual LLMs for PL or NL across a wide range of end tasks of code intelligence, including multilingual code-to-text, text-to-code, code-to-code, and text-to-text generation.

Code Summarization Language Modelling +2

Clip-Tuning: Towards Derivative-free Prompt Learning with a Mixture of Rewards

no code implementations21 Oct 2022 Yekun Chai, Shuohuan Wang, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

Derivative-free prompt learning has emerged as a lightweight alternative to prompt tuning, which only requires model inference to optimize the prompts.

RefineCap: Concept-Aware Refinement for Image Captioning

no code implementations8 Sep 2021 Yekun Chai, Shuo Jin, Junliang Xing

Automatically translating images to texts involves image scene understanding and language modeling.

Descriptive Image Captioning +3

Improving Sequence Generative Adversarial Networks with Feature Statistics Alignment

no code implementations1 Jan 2021 Yekun Chai, Qiyue Yin, Junge Zhang

Generative Adversarial Networks (GAN) are facing great challenges in synthesizing sequences of discrete elements, such as mode dropping and unstable training.

Binary Classification

Neural Text Classification by Jointly Learning to Cluster and Align

no code implementations24 Nov 2020 Yekun Chai, Haidong Zhang, Shuo Jin

Distributional text clustering delivers semantically informative representations and captures the relevance between each word and semantic clustering centroids.

Clustering General Classification +4

Highway Transformer: Self-Gating Enhanced Self-Attentive Networks

1 code implementation ACL 2020 Yekun Chai, Shuo Jin, Xinwen Hou

Self-attention mechanisms have made striking state-of-the-art (SOTA) progress in various sequence learning tasks, standing on the multi-headed dot product attention by attending to all the global contexts at different locations.

How to Evaluate Word Representations of Informal Domain?

1 code implementation12 Nov 2019 Yekun Chai, Naomi Saphra, Adam Lopez

Diverse word representations have surged in most state-of-the-art natural language processing (NLP) applications.

Word Embeddings

Cannot find the paper you are looking for? You can Submit a new open access paper.