Conversational Models

Meena

Introduced by Adiwardana et al. in Towards a Human-like Open-Domain Chatbot

Meena is a multi-turn open-domain chatbot trained end-to-end on data mined and filtered from public domain social media conversations. This 2.6B parameter neural network is simply trained to minimize perplexity of the next token. A seq2seq model is used with the Evolved Transformer as the main architecture. The model is trained on multi-turn conversations where the input sequence is all turns of the context and the output sequence is the response.

Source: Towards a Human-like Open-Domain Chatbot

Papers


Paper Code Results Date Stars

Tasks


Task Papers Share
Chatbot 2 66.67%
Specificity 1 33.33%

Components


Component Type
🤖 No Components Found You can add them if they exist; e.g. Mask R-CNN uses RoIAlign

Categories