Search Results for author: Zhaojiang Lin

Found 45 papers, 29 papers with code

CAiRE in DialDoc21: Data Augmentation for Information Seeking Dialogue System

1 code implementation • ACL (dialdoc) 2021 • Yan Xu, Etsuko Ishii, Genta Indra Winata, Zhaojiang Lin, Andrea Madotto, Zihan Liu, Peng Xu, Pascale Fung

Information-seeking dialogue systems, including knowledge identification and response generation, aim to respond to users with fluent, coherent, and informative responses based on users’ needs, which.

Data Augmentation Response Generation

Paper
Code

SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM

no code implementations • 7 Mar 2024 • JieLin Qiu, Andrea Madotto, Zhaojiang Lin, Paul A. Crook, Yifan Ethan Xu, Xin Luna Dong, Christos Faloutsos, Lei LI, Babak Damavandi, Seungwhan Moon

We have developed the \textbf{SnapNTell Dataset}, distinct from traditional VQA datasets: (1) It encompasses a wide range of categorized entities, each represented by images and explicitly named in the answers; (2) It features QA pairs that require extensive knowledge for accurate responses.

Question Answering Retrieval +1

Paper
Add Code

Large Language Models as Zero-shot Dialogue State Tracker through Function Calling

1 code implementation • 16 Feb 2024 • Zekun Li, Zhiyu Zoey Chen, Mike Ross, Patrick Huber, Seungwhan Moon, Zhaojiang Lin, Xin Luna Dong, Adithya Sagar, Xifeng Yan, Paul A. Crook

We also show that by fine-tuning on a small collection of diverse task-oriented dialogues, we can equip modestly sized models, specifically a 13B parameter LLaMA2-Chat model, with function-calling capabilities and DST performance comparable to ChatGPT while maintaining their chat capabilities.

Avg Dialogue State Tracking +1

Paper
Code

AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

no code implementations • 27 Sep 2023 • Seungwhan Moon, Andrea Madotto, Zhaojiang Lin, Tushar Nagarajan, Matt Smith, Shashank Jain, Chun-Fu Yeh, Prakash Murugesan, Peyman Heidari, Yue Liu, Kavya Srinet, Babak Damavandi, Anuj Kumar

We present Any-Modality Augmented Language Model (AnyMAL), a unified model that reasons over diverse input modality signals (i. e. text, image, video, audio, IMU motion sensor), and generates textual responses.

Ranked #7 on Video Question Answering on STAR Benchmark

Language Modelling Video Question Answering

Paper
Add Code

Continual Dialogue State Tracking via Example-Guided Question Answering

1 code implementation • 23 May 2023 • Hyundong Cho, Andrea Madotto, Zhaojiang Lin, Khyathi Raghavi Chandu, Satwik Kottur, Jing Xu, Jonathan May, Chinnadhurai Sankar

Dialogue systems are frequently updated to accommodate new services, but naively updating them by continually training with data for new services in diminishing performance on previously learnt services.

Continual Learning Dialogue State Tracking +3

Paper
Code

Introducing Semantics into Speech Encoders

no code implementations • 15 Nov 2022 • Derek Xu, Shuyan Dong, Changhan Wang, Suyoun Kim, Zhaojiang Lin, Akshat Shrivastava, Shang-Wen Li, Liang-Hsuan Tseng, Alexei Baevski, Guan-Ting Lin, Hung-Yi Lee, Yizhou Sun, Wei Wang

Recent studies find existing self-supervised speech encoders contain primarily acoustic rather than semantic information.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +10

Paper
Add Code

IMU2CLIP: Multimodal Contrastive Learning for IMU Motion Sensors from Egocentric Videos and Text

1 code implementation • 26 Oct 2022 • Seungwhan Moon, Andrea Madotto, Zhaojiang Lin, Alireza Dirafzoon, Aparajita Saraf, Amy Bearman, Babak Damavandi

We present IMU2CLIP, a novel pre-training approach to align Inertial Measurement Unit (IMU) motion sensor recordings with video and text, by projecting them into the joint representation space of Contrastive Language-Image Pre-training (CLIP).

Activity Recognition Contrastive Learning +1

Paper
Code

Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values

no code implementations • 14 Oct 2022 • Yejin Bang, Tiezheng Yu, Andrea Madotto, Zhaojiang Lin, Mona Diab, Pascale Fung

Therefore, we introduce a framework for value-aligned classification that performs prediction based on explicitly written human values in the command.

Classification Few-Shot Learning +1

Paper
Add Code

FaceFormer: Speech-Driven 3D Facial Animation with Transformers

1 code implementation • CVPR 2022 • Yingruo Fan, Zhaojiang Lin, Jun Saito, Wenping Wang, Taku Komura

Speech-driven 3D facial animation is challenging due to the complex geometry of human faces and the limited availability of 3D audio-visual data.

Ranked #1 on 3D Face Animation on VOCASET

3D Face Animation

741

Paper
Code

Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation

no code implementations • 4 Dec 2021 • Yingruo Fan, Zhaojiang Lin, Jun Saito, Wenping Wang, Taku Komura

The existing datasets are collected to cover as many different phonemes as possible instead of sentences, thus limiting the capability of the audio-based model to learn more diverse contexts.

Language Modelling

Paper
Add Code

Few-Shot Bot: Prompt-Based Learning for Dialogue Systems

2 code implementations • 15 Oct 2021 • Andrea Madotto, Zhaojiang Lin, Genta Indra Winata, Pascale Fung

A simple yet unexplored solution is prompt-based few-shot learning (Brown et al. 2020) which does not require gradient-based fine-tuning but instead uses a few examples in the LM context as the only source of learning.

Chatbot Dialogue State Tracking +3

756

Paper
Code

Language Models are Few-shot Multilingual Learners

1 code implementation • EMNLP (MRL) 2021 • Genta Indra Winata, Andrea Madotto, Zhaojiang Lin, Rosanne Liu, Jason Yosinski, Pascale Fung

General-purpose language models have demonstrated impressive capabilities, performing on par with state-of-the-art approaches on a range of downstream natural language processing (NLP) tasks and benchmarks when inferring instructions from very few examples.

Multi-class Classification

Paper
Code

Zero-Shot Dialogue State Tracking via Cross-Task Transfer

1 code implementation • EMNLP 2021 • Zhaojiang Lin, Bing Liu, Andrea Madotto, Seungwhan Moon, Paul Crook, Zhenpeng Zhou, Zhiguang Wang, Zhou Yu, Eunjoon Cho, Rajen Subba, Pascale Fung

Zero-shot transfer learning for dialogue state tracking (DST) enables us to handle a variety of task-oriented dialogue domains without the expense of collecting in-domain data.

Dialogue State Tracking Question Answering +1

Paper
Code

CAiRE in DialDoc21: Data Augmentation for Information-Seeking Dialogue System

1 code implementation • 7 Jun 2021 • Etsuko Ishii, Yan Xu, Genta Indra Winata, Zhaojiang Lin, Andrea Madotto, Zihan Liu, Peng Xu, Pascale Fung

Data Augmentation Response Generation

Paper
Code

BiToD: A Bilingual Multi-Domain Dataset For Task-Oriented Dialogue Modeling

1 code implementation • 5 Jun 2021 • Zhaojiang Lin, Andrea Madotto, Genta Indra Winata, Peng Xu, Feijun Jiang, Yuxiang Hu, Chen Shi, Pascale Fung

However, existing datasets for end-to-end ToD modeling are limited to a single language, hindering the development of robust end-to-end ToD systems for multilingual countries and regions.

Cross-Lingual Transfer Transfer Learning

Paper
Code

Leveraging Slot Descriptions for Zero-Shot Cross-Domain Dialogue StateTracking

1 code implementation • NAACL 2021 • Zhaojiang Lin, Bing Liu, Seungwhan Moon, Paul Crook, Zhenpeng Zhou, Zhiguang Wang, Zhou Yu, Andrea Madotto, Eunjoon Cho, Rajen Subba

Zero-shot cross-domain dialogue state tracking (DST) enables us to handle unseen domains without the expense of collecting in-domain data.

Dialogue State Tracking Transfer Learning

Paper
Code

Leveraging Slot Descriptions for Zero-Shot Cross-Domain Dialogue State Tracking

2 code implementations • 10 May 2021 • Zhaojiang Lin, Bing Liu, Seungwhan Moon, Paul Crook, Zhenpeng Zhou, Zhiguang Wang, Zhou Yu, Andrea Madotto, Eunjoon Cho, Rajen Subba

Zero-shot cross-domain dialogue state tracking (DST) enables us to handle task-oriented dialogue in unseen domains without the expense of collecting in-domain data.

Dialogue State Tracking Transfer Learning

Paper
Code

Are Multilingual Models Effective in Code-Switching?

no code implementations • NAACL (CALCS) 2021 • Genta Indra Winata, Samuel Cahyawijaya, Zihan Liu, Zhaojiang Lin, Andrea Madotto, Pascale Fung

Multilingual language models have shown decent performance in multilingual and cross-lingual natural language understanding tasks.

named-entity-recognition Named Entity Recognition +3

Paper
Add Code

Continual Learning in Task-Oriented Dialogue Systems

1 code implementation • EMNLP 2021 • Andrea Madotto, Zhaojiang Lin, Zhenpeng Zhou, Seungwhan Moon, Paul Crook, Bing Liu, Zhou Yu, Eunjoon Cho, Zhiguang Wang

Continual learning in task-oriented dialogue systems can allow us to add new domains and functionalities through time without incurring the high cost of a whole system retraining.

Continual Learning Intent Recognition +3

Paper
Code

Plug-and-Play Conversational Models

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Andrea Madotto, Etsuko Ishii, Zhaojiang Lin, Sumanth Dathathri, Pascale Fung

These large conversational models provide little control over the generated responses, and this control is further limited in the absence of annotated conversational datasets for attribute specific generation that can be used for fine-tuning the model.

Attribute Language Modelling +2

Paper
Code

Cross-lingual Spoken Language Understanding with Regularized Representation Alignment

1 code implementation • EMNLP 2020 • Zihan Liu, Genta Indra Winata, Peng Xu, Zhaojiang Lin, Pascale Fung

Despite the promising results of current cross-lingual models for spoken language understanding systems, they still suffer from imperfect cross-lingual representation alignments between the source and target languages, which makes the performance sub-optimal.

Sentence Spoken Language Understanding

Paper
Code

Learning Knowledge Bases with Parameters for Task-Oriented Dialogue Systems

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Andrea Madotto, Samuel Cahyawijaya, Genta Indra Winata, Yan Xu, Zihan Liu, Zhaojiang Lin, Pascale Fung

In this paper, we propose a method to embed the KB, of any size, directly into the model parameters.

Dialogue State Tracking Management +1

Paper
Code

MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems

1 code implementation • EMNLP 2020 • Zhaojiang Lin, Andrea Madotto, Genta Indra Winata, Pascale Fung

In this paper, we propose Minimalist Transfer Learning (MinTL) to simplify the system design process of task-oriented dialogue systems and alleviate the over-dependency on annotated data.

Ranked #15 on Multi-domain Dialogue State Tracking on MULTIWOZ 2.1

Dialogue State Tracking Multi-domain Dialogue State Tracking +3

Paper
Code

The Adapter-Bot: All-In-One Controllable Conversational Model

1 code implementation • 28 Aug 2020 • Andrea Madotto, Zhaojiang Lin, Yejin Bang, Pascale Fung

The dialogue skills can be triggered automatically via a dialogue manager, or manually, thus allowing high-level control of the generated responses.

Movie Recommendation

Paper
Code

EmoGraph: Capturing Emotion Correlations using Graph Networks

no code implementations • 21 Aug 2020 • Peng Xu, Zihan Liu, Genta Indra Winata, Zhaojiang Lin, Pascale Fung

Most emotion recognition methods tackle the emotion understanding task by considering individual emotion independently while ignoring their fuzziness nature and the interconnections among them.

Ranked #3 on Emotion Classification on SemEval 2018 Task 1E-c

Classification Emotion Classification +3

Paper
Add Code

Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems

no code implementations • 14 Aug 2020 • Andrea Madotto, Zihan Liu, Zhaojiang Lin, Pascale Fung

In this paper, we evaluate the priming few-shot ability of language models in the NLU, DST, DP and NLG tasks.

Dialogue State Tracking Few-Shot Learning +4

Paper
Add Code

Meta-Transfer Learning for Code-Switched Speech Recognition

1 code implementation • ACL 2020 • Genta Indra Winata, Samuel Cahyawijaya, Zhaojiang Lin, Zihan Liu, Peng Xu, Pascale Fung

An increasing number of people in the world today speak a mixed-language as a result of being multilingual.

Language Modelling speech-recognition +2

Paper
Code

Exploring Versatile Generative Language Model Via Parameter-Efficient Transfer Learning

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Zhaojiang Lin, Andrea Madotto, Pascale Fung

Fine-tuning pre-trained generative language models to down-stream language generation tasks has shown promising results.

Language Modelling Text Generation +1

Paper
Code

Variational Transformers for Diverse Response Generation

2 code implementations • 28 Mar 2020 • Zhaojiang Lin, Genta Indra Winata, Peng Xu, Zihan Liu, Pascale Fung

Despite the great promise of Transformers in many sequence modeling tasks (e. g., machine translation), their deterministic nature hinders them from generalizing to high entropy tasks such as dialogue response generation.

Decoder Machine Translation +2

Paper
Code

XPersona: Evaluating Multilingual Personalized Chatbot

1 code implementation • EMNLP (NLP4ConvAI) 2021 • Zhaojiang Lin, Zihan Liu, Genta Indra Winata, Samuel Cahyawijaya, Andrea Madotto, Yejin Bang, Etsuko Ishii, Pascale Fung

Experimental results show that the multilingual trained models outperform the translation-pipeline and that they are on par with the monolingual models, with the advantage of having a single model across multiple languages.

Chatbot Translation

Paper
Code

Learning Fast Adaptation on Cross-Accented Speech Recognition

1 code implementation • 4 Mar 2020 • Genta Indra Winata, Samuel Cahyawijaya, Zihan Liu, Zhaojiang Lin, Andrea Madotto, Peng Xu, Pascale Fung

The great variability and complex characteristics of accents creates a major challenge for training a robust and accent-agnostic automatic speech recognition (ASR) system.

Audio and Speech Processing Sound

Paper
Code

On the Importance of Word Order Information in Cross-lingual Sequence Labeling

no code implementations • 30 Jan 2020 • Zihan Liu, Genta Indra Winata, Samuel Cahyawijaya, Andrea Madotto, Zhaojiang Lin, Pascale Fung

To verify this hypothesis, we investigate whether making models insensitive to the word order of the source language can improve the adaptation performance in target languages.

named-entity-recognition Named Entity Recognition +3

Paper
Add Code

Attention over Parameters for Dialogue Systems

no code implementations • 7 Jan 2020 • Andrea Madotto, Zhaojiang Lin, Chien-Sheng Wu, Jamin Shin, Pascale Fung

Dialogue systems require a great deal of different but complementary expertise to assist, inform, and entertain humans.

Goal-Oriented Dialogue Systems

Paper
Add Code

Attention-Informed Mixed-Language Training for Zero-shot Cross-lingual Task-oriented Dialogue Systems

1 code implementation • 21 Nov 2019 • Zihan Liu, Genta Indra Winata, Zhaojiang Lin, Peng Xu, Pascale Fung

Recently, data-driven task-oriented dialogue systems have achieved promising performance in English.

Dialogue State Tracking Intent Detection +4

Paper
Code

Lightweight and Efficient End-to-End Speech Recognition Using Low-Rank Transformer

no code implementations • 30 Oct 2019 • Genta Indra Winata, Samuel Cahyawijaya, Zhaojiang Lin, Zihan Liu, Pascale Fung

Highly performing deep neural networks come at the cost of computational complexity that limits their practicality for deployment on portable devices.

Language Modelling speech-recognition +1

Paper
Add Code

Hierarchical Meta-Embeddings for Code-Switching Named Entity Recognition

1 code implementation • IJCNLP 2019 • Genta Indra Winata, Zhaojiang Lin, Jamin Shin, Zihan Liu, Pascale Fung

In countries that speak multiple main languages, mixing up different languages within a conversation is commonly called code-switching.

Ranked #1 on Named Entity Recognition (NER) on Code-Switching English-Spanish NER

named-entity-recognition Named Entity Recognition +2

Paper
Code

MoEL: Mixture of Empathetic Listeners

5 code implementations • IJCNLP 2019 • Zhaojiang Lin, Andrea Madotto, Jamin Shin, Peng Xu, Pascale Fung

Previous research on empathetic dialogue systems has mostly focused on generating responses given certain emotions.

Paper
Code

Getting To Know You: User Attribute Extraction from Dialogues

1 code implementation • LREC 2020 • Chien-Sheng Wu, Andrea Madotto, Zhaojiang Lin, Peng Xu, Pascale Fung

User attributes provide rich and useful information for user understanding, yet structured and easy-to-use attributes are often sparsely populated.

Attribute Attribute Extraction +1

Paper
Code

Learning to Learn Sales Prediction with Social Media Sentiment

no code implementations • WS 2019 • Zhaojiang Lin, Andrea Madotto, Genta Indra Winata, Zihan Liu, Yan Xu, Cong Gao, Pascale Fung

Paper
Add Code

Learning Multilingual Meta-Embeddings for Code-Switching Named Entity Recognition

no code implementations • WS 2019 • Genta Indra Winata, Zhaojiang Lin, Pascale Fung

In this paper, we propose Multilingual Meta-Embeddings (MME), an effective method to learn multilingual representations by leveraging monolingual pre-trained embeddings.

Language Identification named-entity-recognition +2