Search Results for author: Zhiyuan Ma

Found 19 papers, 7 papers with code

Intention Reasoning Network for Multi-Domain End-to-end Task-Oriented Dialogue

no code implementations EMNLP 2021 Zhiyuan Ma, Jianjun Li, Zezheng Zhang, GuoHui Li, Yongjing Cheng

Based on such a mechanism, we further propose an intention reasoning network (IR-Net), which consists of joint and multi-hop reasoning, to obtain intention-aware representations of conceptual tokens that can be used to capture the concept shifts involved in task-oriented conversations, so as to effectively identify user’s intention and generate more accurate responses.

UniTranSeR: A Unified Transformer Semantic Representation Framework for Multimodal Task-Oriented Dialog System

no code implementations ACL 2022 Zhiyuan Ma, Jianjun Li, GuoHui Li, Yongjing Cheng

Specifically, we first embed the multimodal features into a unified Transformer semantic space to prompt inter-modal interactions, and then devise a feature alignment and intention reasoning (FAIR) layer to perform cross-modal entity alignment and fine-grained key-value reasoning, so as to effectively identify user’s intention for generating more accurate responses.

Entity Alignment

GLAF: Global-to-Local Aggregation and Fission Network for Semantic Level Fact Verification

no code implementations COLING 2022 Zhiyuan Ma, Jianjun Li, GuoHui Li, Yongjing Cheng

Accurate fact verification depends on performing fine-grained reasoning over crucial entities by capturing their latent logical relations hidden in multiple evidence clues, which is generally lacking in existing fact verification models.

Fact Verification

Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models

1 code implementation26 Mar 2024 Yabin Zhang, Wenjie Zhu, Hui Tang, Zhiyuan Ma, Kaiyang Zhou, Lei Zhang

In this paper, we introduce a versatile adaptation approach that can effectively work under all three settings.

DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer

1 code implementation8 Feb 2024 Zhiyuan Ma, Xiangyu Zhu, GuoJun Qi, Chen Qian, Zhaoxiang Zhang, Zhen Lei

We suspect this is due to a shortage of paired audio-4D data, which is crucial for the Transformer to effectively perform as a denoiser within the Diffusion framework.

Generative Multi-Modal Knowledge Retrieval with Large Language Models

no code implementations16 Jan 2024 Xinwei Long, Jiali Zeng, Fandong Meng, Zhiyuan Ma, Kaiyan Zhang, BoWen Zhou, Jie zhou

Knowledge retrieval with multi-modal queries plays a crucial role in supporting knowledge-intensive multi-modal applications.

Retrieval

LMD: Faster Image Reconstruction with Latent Masking Diffusion

1 code implementation13 Dec 2023 Zhiyuan Ma, zhihuan yu, Jianjun Li, BoWen Zhou

Then, we combine the advantages of MAEs and DPMs to design a progressive masking diffusion model, which gradually increases the masking proportion by three different schedulers and reconstructs the latent features from simple to difficult, without sequentially performing denoising diffusion as in DPMs or using fixed high masking ratio as in MAEs, so as to alleviate the high training time-consumption predicament.

Denoising Image Reconstruction

AdapEdit: Spatio-Temporal Guided Adaptive Editing Algorithm for Text-Based Continuity-Sensitive Image Editing

1 code implementation13 Dec 2023 Zhiyuan Ma, Guoli Jia, BoWen Zhou

With the great success of text-conditioned diffusion models in creative text-to-image generation, various text-driven image editing approaches have attracted the attentions of many researchers.

Text-to-Image Generation

Comparative Study on Semi-supervised Learning Applied for Anomaly Detection in Hydraulic Condition Monitoring System

no code implementations5 Jun 2023 Yongqi Dong, KeJia Chen, Zhiyuan Ma

This study systematically compares semi-supervised learning methods applied for anomaly detection in hydraulic condition monitoring systems.

Anomaly Detection

OTAvatar: One-shot Talking Face Avatar with Controllable Tri-plane Rendering

1 code implementation CVPR 2023 Zhiyuan Ma, Xiangyu Zhu, GuoJun Qi, Zhen Lei, Lei Zhang

In this paper, we propose One-shot Talking face Avatar (OTAvatar), which constructs face avatars by a generalized controllable tri-plane rendering solution so that each personalized avatar can be constructed from only one portrait as the reference.

Comparative Study on Supervised versus Semi-supervised Machine Learning for Anomaly Detection of In-vehicle CAN Network

no code implementations21 Jul 2022 Yongqi Dong, KeJia Chen, Yinxuan Peng, Zhiyuan Ma

To enhance the security of in-vehicle networks and promote the research in this area, based upon a large scale of CAN network traffic data with the extracted valuable features, this study comprehensively compared fully-supervised machine learning with semi-supervised machine learning methods for CAN message anomaly detection.

Anomaly Detection BIG-bench Machine Learning

Nearest Neighborhood-Based Deep Clustering for Source Data-absent Unsupervised Domain Adaptation

1 code implementation27 Jul 2021 Song Tang, Yan Yang, Zhiyuan Ma, Norman Hendrich, Fanyu Zeng, Shuzhi Sam Ge, ChangShui Zhang, Jianwei Zhang

To reach this goal, we construct the nearest neighborhood for every target data and take it as the fundamental clustering unit by building our objective on the geometry.

Clustering Deep Clustering +1

Enriching Medcial Terminology Knowledge Bases via Pre-trained Language Model and Graph Convolutional Network

no code implementations2 Sep 2019 Jiaying Zhang, Zhixing Zhang, Huanhuan Zhang, Zhiyuan Ma, Yangming Zhou, Ping He

Afterwards, both semantic and structure embeddings are combined to measure the relevancy between the terminology and the entity.

Language Modelling

NE-LP: Normalized Entropy and Loss Prediction based Sampling for Active Learning in Chinese Word Segmentation on EHRs

no code implementations22 Aug 2019 Tingting Cai, Zhiyuan Ma, Hong Zheng, Yangming Zhou

Meanwhile, to minimize the computational cost of learning, we propose a joint model including a word segmenter and a loss prediction model.

Active Learning Chinese Word Segmentation

CBOWRA: A Representation Learning Approach for Medication Anomaly Detection

no code implementations20 Aug 2019 Liang Zhao, Zhiyuan Ma, Yangming Zhou, Kai Wang, Shengping Liu, Ju Gao

Electronic health record is an important source for clinical researches and applications, and errors inevitably occur in the data, which could lead to severe damages to both patients and hospital services.

Anomaly Detection BIG-bench Machine Learning +1

Estimate the Warfarin Dose by Ensemble of Machine Learning Algorithms

no code implementations10 Sep 2018 Zhiyuan Ma, Ping Wang, Zehui Gao, Ruobing Wang, Koroush Khalighi

Here, we present novel algorithms using stacked generalization frameworks to estimate the warfarin dose, within which different types of machine learning algorithms function together through a meta-machine learning model to maximize the prediction accuracy.

BIG-bench Machine Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.