Search Results for author: Kai Gao

Found 18 papers, 14 papers with code

MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations

1 code implementation16 Mar 2024 Hanlei Zhang, Xin Wang, Hua Xu, Qianrui Zhou, Kai Gao, Jianhua Su, jinyue Zhao, Wenrui Li, Yanting Chen

We believe that MIntRec2. 0 will serve as a valuable resource, providing a pioneering foundation for research in human-machine conversational interactions, and significantly facilitating related applications.

Multimodal Intent Recognition

Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition

1 code implementation22 Dec 2023 Qianrui Zhou, Hua Xu, Hao Li, Hanlei Zhang, Xiaohan Zhang, Yifan Wang, Kai Gao

To establish an optimal multimodal semantic environment for text modality, we develop a modality-aware prompting module (MAP), which effectively aligns and fuses features from text, video and audio modalities with similarity-based modality alignment and cross-modality attention mechanism.

Contrastive Learning Multimodal Intent Recognition

ORLA*: Mobile Manipulator-Based Object Rearrangement with Lazy A*

no code implementations24 Sep 2023 Kai Gao, Yan Ding, Shiqi Zhang, Jingjin Yu

Effectively performing object rearrangement is an essential skill for mobile manipulators, e. g., setting up a dinner table or organizing a desk.

Object

A Forward and Backward Compatible Framework for Few-shot Class-incremental Pill Recognition

1 code implementation24 Apr 2023 Jinghua Zhang, Li Liu, Kai Gao, Dewen Hu

In forward-compatible learning, we propose an innovative virtual class synthesis strategy and a Center-Triplet (CT) loss to enhance discriminative feature learning.

Few-Shot Class-Incremental Learning Graph Attention +4

A Clustering Framework for Unsupervised and Semi-supervised New Intent Discovery

1 code implementation16 Apr 2023 Hanlei Zhang, Hua Xu, Xin Wang, Fei Long, Kai Gao

New intent discovery is of great value to natural language processing, allowing for a better understanding of user needs and providing friendly services.

Clustering Intent Discovery +3

A Self-Adjusting Fusion Representation Learning Model for Unaligned Text-Audio Sequences

no code implementations12 Nov 2022 Kaicheng Yang, Ruxuan Zhang, Hua Xu, Kai Gao

In this paper, a Self-Adjusting Fusion Representation Learning Model (SA-FRLM) is proposed to learn robust crossmodal fusion representations directly from the unaligned text and audio sequences.

Multimodal Sentiment Analysis Representation Learning

Make Acoustic and Visual Cues Matter: CH-SIMS v2.0 Dataset and AV-Mixup Consistent Module

1 code implementation22 Aug 2022 Yihe Liu, Ziqi Yuan, Huisheng Mao, Zhiyun Liang, Wanqiuyue Yang, Yuanzhe Qiu, Tie Cheng, Xiaoteng Li, Hua Xu, Kai Gao

The designed modality mixup module can be regarded as an augmentation, which mixes the acoustic and visual modalities from different videos.

Multimodal Sentiment Analysis

Toward Efficient Task Planning for Dual-Arm Tabletop Object Rearrangement

no code implementations17 Jul 2022 Kai Gao, Jingjin Yu

We investigate the problem of coordinating two robot arms to solve non-monotone tabletop multi-object rearrangement tasks.

Object Scheduling

Deep Contrastive One-Class Time Series Anomaly Detection

1 code implementation4 Jul 2022 Rui Wang, Chongwei Liu, Xudong Mou, Kai Gao, Xiaohui Guo, Pin Liu, Tianyu Wo, Xudong Liu

To overcome the shortcomings, a deep Contrastive One-Class Anomaly detection method of time series (COCA) is proposed by authors, following the normality assumptions of CL and one-class classification.

Contrastive Learning One-Class Classification +2

M-SENA: An Integrated Platform for Multimodal Sentiment Analysis

3 code implementations ACL 2022 Huisheng Mao, Ziqi Yuan, Hua Xu, Wenmeng Yu, Yihe Liu, Kai Gao

The platform features a fully modular video sentiment analysis framework consisting of data management, feature extraction, model training, and result analysis modules.

Management Multimodal Sentiment Analysis

Lazy Rearrangement Planning in Confined Spaces

2 code implementations19 Mar 2022 Rui Wang, Kai Gao, Jingjin Yu, Kostas Bekris

Object rearrangement is important for many applications but remains challenging, especially in confined spaces, such as shelves, where objects cannot be accessed from above and they block reachability to each other.

Motion Planning

Consistent Representation Learning for Continual Relation Extraction

1 code implementation Findings (ACL) 2022 Kang Zhao, Hua Xu, Jiangong Yang, Kai Gao

Specifically, supervised contrastive learning based on a memory bank is first used to train each new task so that the model can effectively learn the relation representation.

Continual Relation Extraction Contrastive Learning +3

A Structure-Aware Relation Network for Thoracic Diseases Detection and Segmentation

1 code implementation21 Apr 2021 Jie Lian, Jingyu Liu, Shu Zhang, Kai Gao, Xiaoqing Liu, Dingwen Zhang, Yizhou Yu

Leveraging on constant structure and disease relations extracted from domain knowledge, we propose a structure-aware relation network (SAR-Net) extending Mask R-CNN.

Instance Segmentation Object Detection +2

Cross-Modal BERT for Text-Audio Sentiment Analysis

1 code implementation ACM Multimedia 2020 Kaicheng Yang, Hua Xu, Kai Gao

In this paper, we propose the Cross-Modal BERT (CM-BERT), which relies on the interaction of text and audio modality to fine-tune the pre-trained BERT model.

Multimodal Sentiment Analysis Natural Language Inference +1

Cannot find the paper you are looking for? You can Submit a new open access paper.