Past, Present, and Future: Conversational Emotion Recognition through Structural Modeling of Psychological Knowledge
Conversational Emotion Recognition (CER) is a task to predict the emotion of an utterance in the context of a conversation. Although modeling the conversational context and interactions between speakers has been studied broadly, it is important to consider the speaker’s psychological state, which controls the action and intention of the speaker. The state-of-the-art method introduces CommonSense Knowledge (CSK) to model psychological states in a sequential way (forwards and backwards). However, it ignores the structural psychological interactions between utterances. In this paper, we propose a pSychological-Knowledge-Aware Interaction Graph (SKAIG). In the locally connected graph, the targeted utterance will be enhanced with the information of action inferred from the past context and intention implied by the future context. The utterance is self-connected to consider the present effect from itself. Furthermore, we utilize CSK to enrich edges with knowledge representations and process the SKAIG with a graph transformer. Our method achieves state-of-the-art and competitive performance on four popular CER datasets.
PDF AbstractDatasets
Results from the Paper
Task | Dataset | Model | Metric Name | Metric Value | Global Rank | Uses Extra Training Data |
Benchmark |
---|---|---|---|---|---|---|---|
Emotion Recognition in Conversation | DailyDialog | Psychological | Macro F1 | 51.95 | # 3 | ||
Micro-F1 | 59.75 | # 8 | |||||
Emotion Recognition in Conversation | EmoryNLP | Psychological | Weighted-F1 | 38.88 | # 8 | ||
Emotion Recognition in Conversation | IEMOCAP | SKAIG | Weighted-F1 | 66.98 | # 16 | ||
Emotion Recognition in Conversation | MELD | Psychological | Weighted-F1 | 65.18 | # 17 |