Search Results for author: Guoqing Zhao

Found 15 papers, 6 papers with code

Improving Zero-Shot Entity Linking Candidate Generation with Ultra-Fine Entity Type Information

1 code implementation COLING 2022 Xuhui Sui, Ying Zhang, Kehui Song, Baohang Zhou, Guoqing Zhao, Xin Wei, Xiaojie Yuan

Recently, zero-shot entity linking task has become a research hotspot, which links mentions to unseen entities to challenge the generalization ability.

Entity Linking Entity Typing +1

Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation

no code implementations19 Mar 2024 Zhigang Chen, Benjia Zhou, Jun Li, Jun Wan, Zhen Lei, Ning Jiang, Quan Lu, Guoqing Zhao

Although some approaches work towards gloss-free SLT through jointly training the visual encoder and translation network, these efforts still suffer from poor performance and inefficient use of the powerful Large Language Model (LLM).

Gloss-free Sign Language Translation Language Modelling +3

SELM: Speech Enhancement Using Discrete Tokens and Language Models

no code implementations15 Dec 2023 Ziqian Wang, Xinfa Zhu, Zihan Zhang, YuanJun Lv, Ning Jiang, Guoqing Zhao, Lei Xie

Given the intrinsic similarity between speech generation and speech enhancement, harnessing semantic information holds potential advantages for speech enhancement tasks.

Self-Supervised Learning Speech Enhancement

Multi-Speaker Expressive Speech Synthesis via Semi-supervised Contrastive Learning

no code implementations26 Oct 2023 Xinfa Zhu, Yuke Li, Yi Lei, Ning Jiang, Guoqing Zhao, Lei Xie

This paper aims to build an expressive TTS system for multi-speakers, synthesizing a target speaker's speech with multiple styles and emotions.

Contrastive Learning Expressive Speech Synthesis

Multi-objective Progressive Clustering for Semi-supervised Domain Adaptation in Speaker Verification

no code implementations7 Oct 2023 Ze Li, Yuke Lin, Ning Jiang, Xiaoyi Qin, Guoqing Zhao, Haiying Wu, Ming Li

Utilizing the pseudo-labeling algorithm with large-scale unlabeled data becomes crucial for semi-supervised domain adaptation in speaker verification tasks.

Clustering Denoising +3

Haha-Pod: An Attempt for Laughter-based Non-Verbal Speaker Verification

1 code implementation25 Sep 2023 Yuke Lin, Xiaoyi Qin, Ning Jiang, Guoqing Zhao, Ming Li

It is widely acknowledged that discriminative representation for speaker verification can be extracted from verbal speech.

Speaker Verification

VoxBlink: A Large Scale Speaker Verification Dataset on Camera

no code implementations14 Aug 2023 Yuke Lin, Xiaoyi Qin, Guoqing Zhao, Ming Cheng, Ning Jiang, Haiyang Wu, Ming Li

In this paper, we introduce a large-scale and high-quality audio-visual speaker verification dataset, named VoxBlink.

Speaker Recognition Speaker Verification

TreeMAN: Tree-enhanced Multimodal Attention Network for ICD Coding

1 code implementation COLING 2022 Zichen Liu, Xuyuan Liu, Yanlong Wen, Guoqing Zhao, Fen Xia, Xiaojie Yuan

However, most previous works ignore the decisive information contained in structured medical data in EHRs, which is hard to be captured from the noisy clinical notes.

MADI: Inter-domain Matching and Intra-domain Discrimination for Cross-domain Speech Recognition

no code implementations22 Feb 2023 Jiaming Zhou, Shiwan Zhao, Ning Jiang, Guoqing Zhao, Yong Qin

Unsupervised domain adaptation (UDA) aims to improve the performance on the unlabeled target domain by transferring knowledge from the source to the target domain.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

MoSE: Modality Split and Ensemble for Multimodal Knowledge Graph Completion

1 code implementation17 Oct 2022 Yu Zhao, Xiangrui Cai, Yike Wu, Haiwei Zhang, Ying Zhang, Guoqing Zhao, Ning Jiang

Based on these embeddings, in the inference phase, we first make modality-split predictions and then exploit various ensemble methods to combine the predictions with different weights, which models the modality importance dynamically.

Knowledge Graph Completion Relation +1

Overcoming Language Priors in Visual Question Answering via Distinguishing Superficially Similar Instances

1 code implementation COLING 2022 Yike Wu, Yu Zhao, Shiwan Zhao, Ying Zhang, Xiaojie Yuan, Guoqing Zhao, Ning Jiang

In this work, we define the training instances with the same question type but different answers as \textit{superficially similar instances}, and attribute the language priors to the confusion of VQA model on such instances.

Attribute Question Answering +1

PPDL: Predicate Probability Distribution Based Loss for Unbiased Scene Graph Generation

no code implementations CVPR 2022 Wei Li, Haiwei Zhang, Qijie Bai, Guoqing Zhao, Ning Jiang, Xiaojie Yuan

However, the application value of SG on downstream tasks is severely limited by the predicate classification bias, which is caused by long-tailed data and presented as semantic bias of predicted relation predicates.

Graph Generation Predicate Classification +1

TPSNet: Reverse Thinking of Thin Plate Splines for Arbitrary Shape Scene Text Representation

1 code implementation25 Oct 2021 Wei Wang, Yu Zhou, Jiahao Lv, Dayan Wu, Guoqing Zhao, Ning Jiang, Weiping Wang

The research focus of scene text detection and recognition has shifted to arbitrary shape text in recent years, where the text shape representation is a fundamental problem.

Scene Text Detection Scene Text Recognition +2

Cannot find the paper you are looking for? You can Submit a new open access paper.