Search Results for author: Shaojie Jiang

Found 5 papers, 3 papers with code

A Simple Contrastive Learning Objective for Alleviating Neural Text Degeneration

1 code implementation • 5 May 2022 • Shaojie Jiang, Ruqing Zhang, Svitlana Vakulenko, Maarten de Rijke

The cross-entropy objective has proved to be an all-purpose training objective for autoregressive language models (LMs).

Contrastive Learning Dialogue Generation +1

Paper
Code

TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation

1 code implementation • 26 Mar 2020 • Shaojie Jiang, Thomas Wolf, Christof Monz, Maarten de Rijke

We hypothesize that the deeper reason is that in the training corpora, there are hard tokens that are more difficult for a generative model to learn than others and, once learning has finished, hard tokens are still under-learned, so that repetitive generations are more likely to happen.

Text Generation

Paper
Code

Improving Neural Response Diversity with Frequency-Aware Cross-Entropy Loss

2 code implementations • 25 Feb 2019 • Shaojie Jiang, Pengjie Ren, Christof Monz, Maarten de Rijke

Specifically, we first analyze the influence of the commonly used Cross-Entropy (CE) loss function, and find that the CE loss function prefers high-frequency tokens, which results in low-diversity responses.

Response Generation