Search Results for author: Kai Gao

Found 18 papers, 14 papers with code

MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations

1 code implementation • 16 Mar 2024 • Hanlei Zhang, Xin Wang, Hua Xu, Qianrui Zhou, Kai Gao, Jianhua Su, jinyue Zhao, Wenrui Li, Yanting Chen

We believe that MIntRec2. 0 will serve as a valuable resource, providing a pioneering foundation for research in human-machine conversational interactions, and significantly facilitating related applications.

Multimodal Intent Recognition

Paper
Code

Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition

1 code implementation • 22 Dec 2023 • Qianrui Zhou, Hua Xu, Hao Li, Hanlei Zhang, Xiaohan Zhang, Yifan Wang, Kai Gao

To establish an optimal multimodal semantic environment for text modality, we develop a modality-aware prompting module (MAP), which effectively aligns and fuses features from text, video and audio modalities with similarity-based modality alignment and cross-modality attention mechanism.

Ranked #2 on Multimodal Intent Recognition on MIntRec

Contrastive Learning Multimodal Intent Recognition

Paper
Code

ORLA: Mobile Manipulator-Based Object Rearrangement with Lazy A

no code implementations • 24 Sep 2023 • Kai Gao, Yan Ding, Shiqi Zhang, Jingjin Yu

Effectively performing object rearrangement is an essential skill for mobile manipulators, e. g., setting up a dinner table or organizing a desk.

Object

Paper
Add Code

A Forward and Backward Compatible Framework for Few-shot Class-incremental Pill Recognition

1 code implementation • 24 Apr 2023 • Jinghua Zhang, Li Liu, Kai Gao, Dewen Hu

In forward-compatible learning, we propose an innovative virtual class synthesis strategy and a Center-Triplet (CT) loss to enhance discriminative feature learning.

Few-Shot Class-Incremental Learning Graph Attention +4

Paper
Code

A Clustering Framework for Unsupervised and Semi-supervised New Intent Discovery

1 code implementation • 16 Apr 2023 • Hanlei Zhang, Hua Xu, Xin Wang, Fei Long, Kai Gao

New intent discovery is of great value to natural language processing, allowing for a better understanding of user needs and providing friendly services.

Clustering Intent Discovery +3

178

Paper
Code

A Self-Adjusting Fusion Representation Learning Model for Unaligned Text-Audio Sequences

no code implementations • 12 Nov 2022 • Kaicheng Yang, Ruxuan Zhang, Hua Xu, Kai Gao

In this paper, a Self-Adjusting Fusion Representation Learning Model (SA-FRLM) is proposed to learn robust crossmodal fusion representations directly from the unaligned text and audio sequences.

Multimodal Sentiment Analysis Representation Learning

Paper
Add Code

Make Acoustic and Visual Cues Matter: CH-SIMS v2.0 Dataset and AV-Mixup Consistent Module

1 code implementation • 22 Aug 2022 • Yihe Liu, Ziqi Yuan, Huisheng Mao, Zhiyun Liang, Wanqiuyue Yang, Yuanzhe Qiu, Tie Cheng, Xiaoteng Li, Hua Xu, Kai Gao

The designed modality mixup module can be regarded as an augmentation, which mixes the acoustic and visual modalities from different videos.

Multimodal Sentiment Analysis

Paper
Code

Continual Machine Reading Comprehension via Uncertainty-aware Fixed Memory and Adversarial Domain Adaptation

no code implementations • Findings (NAACL) 2022 • Zhijing Wu, Hua Xu, Jingliang Fang, Kai Gao

However, it is a great challenge to learn a new domain incrementally without catastrophically forgetting previous knowledge.

Domain Adaptation Incremental Learning +1

Paper
Add Code

Toward Efficient Task Planning for Dual-Arm Tabletop Object Rearrangement

no code implementations • 17 Jul 2022 • Kai Gao, Jingjin Yu

We investigate the problem of coordinating two robot arms to solve non-monotone tabletop multi-object rearrangement tasks.

Object Scheduling

Paper
Add Code

Deep Contrastive One-Class Time Series Anomaly Detection

1 code implementation • 4 Jul 2022 • Rui Wang, Chongwei Liu, Xudong Mou, Kai Gao, Xiaohui Guo, Pin Liu, Tianyu Wo, Xudong Liu

To overcome the shortcomings, a deep Contrastive One-Class Anomaly detection method of time series (COCA) is proposed by authors, following the normality assumptions of CL and one-class classification.

Contrastive Learning One-Class Classification +2

Paper
Code

M-SENA: An Integrated Platform for Multimodal Sentiment Analysis

3 code implementations • ACL 2022 • Huisheng Mao, Ziqi Yuan, Hua Xu, Wenmeng Yu, Yihe Liu, Kai Gao

The platform features a fully modular video sentiment analysis framework consisting of data management, feature extraction, model training, and result analysis modules.

Management Multimodal Sentiment Analysis

571

Paper
Code

Lazy Rearrangement Planning in Confined Spaces

2 code implementations • 19 Mar 2022 • Rui Wang, Kai Gao, Jingjin Yu, Kostas Bekris

Object rearrangement is important for many applications but remains challenging, especially in confined spaces, such as shelves, where objects cannot be accessed from above and they block reachability to each other.

Motion Planning

Paper
Code

Consistent Representation Learning for Continual Relation Extraction

1 code implementation • Findings (ACL) 2022 • Kang Zhao, Hua Xu, Jiangong Yang, Kai Gao

Specifically, supervised contrastive learning based on a memory bank is first used to train each new task so that the model can effectively learn the relation representation.

Continual Relation Extraction Contrastive Learning +3

Paper
Code

TEXTOIR: An Integrated and Visualized Platform for Text Open Intent Recognition

2 code implementations • ACL 2021 • Hanlei Zhang, Xiaoteng Li, Hua Xu, Panpan Zhang, Kang Zhao, Kai Gao

It is composed of two main modules: open intent detection and open intent discovery.

Intent Discovery Intent Recognition +3

178

Paper
Code

Representation Iterative Fusion based on Heterogeneous Graph Neural Network for Joint Entity and Relation Extraction

1 code implementation • 8 May 2021 • Kang Zhao, Hua Xu, Yue Cheng, Xiaoteng Li, Kai Gao

Joint entity and relation extraction is an essential task in information extraction, which aims to extract all relational triples from unstructured text.

Ranked #2 on Relation Extraction on SemEval-2010 Task-8

Joint Entity and Relation Extraction Relation +2

Paper
Code

A Structure-Aware Relation Network for Thoracic Diseases Detection and Segmentation

1 code implementation • 21 Apr 2021 • Jie Lian, Jingyu Liu, Shu Zhang, Kai Gao, Xiaoqing Liu, Dingwen Zhang, Yizhou Yu

Leveraging on constant structure and disease relations extracted from domain knowledge, we propose a structure-aware relation network (SAR-Net) extending Mask R-CNN.

Instance Segmentation Object Detection +2

Paper
Code

Uniform Object Rearrangement: From Complete Monotone Primitives to Efficient Non-Monotone Informed Search

2 code implementations • 28 Jan 2021 • Rui Wang, Kai Gao, Daniel Nakhimovich, Jingjin Yu, Kostas E. Bekris

DFSDP is extended to solve single-buffer, non-monotone instances, given a choice of an object and a buffer.

Object

Paper
Code

Cross-Modal BERT for Text-Audio Sentiment Analysis

1 code implementation • ACM Multimedia 2020 • Kaicheng Yang, Hua Xu, Kai Gao

In this paper, we propose the Cross-Modal BERT (CM-BERT), which relies on the interaction of text and audio modality to fine-tune the pre-trained BERT model.

Ranked #3 on Multimodal Sentiment Analysis on MOSI

Multimodal Sentiment Analysis Natural Language Inference +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.