Search Results for author: Jingyu Li

Found 22 papers, 5 papers with code

Label-efficient Multi-organ Segmentation Method with Diffusion Model

no code implementations23 Feb 2024 Yongzhi Huang, Jinxin Zhu, Haseeb Hassan, Liyilei Su, Jingyu Li, Binding Huang

In this study, we present a label-efficient learning approach using a pre-trained diffusion model for multi-organ segmentation tasks in CT images.

Computed Tomography (CT) Denoising +2

Creating Personalized Synthetic Voices from Articulation Impaired Speech Using Augmented Reconstruction Loss

no code implementations8 Jan 2024 Yusheng Tian, Jingyu Li, Tan Lee

Experimental results on a real case of tongue cancer patient confirm that the synthetic voice achieves comparable articulation quality to unimpaired natural speech, while effectively maintaining the target speaker's individuality.

Efficient Black-Box Speaker Verification Model Adaptation with Reprogramming and Backend Learning

no code implementations24 Sep 2023 Jingyu Li, Tan Lee

The development of deep neural networks (DNN) has significantly enhanced the performance of speaker verification (SV) systems in recent years.

Domain Adaptation Speaker Verification

SOOD: Towards Semi-Supervised Oriented Object Detection

1 code implementation CVPR 2023 Wei Hua, Dingkang Liang, Jingyu Li, Xiaolong Liu, Zhikang Zou, Xiaoqing Ye, Xiang Bai

Semi-Supervised Object Detection (SSOD), aiming to explore unlabeled data for boosting object detectors, has become an active task in recent years.

Object object-detection +4

Convolution-Based Channel-Frequency Attention for Text-Independent Speaker Verification

no code implementations31 Oct 2022 Jingyu Li, Yusheng Tian, Tan Lee

The weights are imposed on the input features to improve the representation ability for speaker modeling.

Text-Independent Speaker Verification

Meta-Causal Feature Learning for Out-of-Distribution Generalization

no code implementations22 Aug 2022 Yuqing Wang, Xiangxian Li, Zhuang Qi, Jingyu Li, Xuelong Li, Xiangxu Meng, Lei Meng

Causal inference has become a powerful tool to handle the out-of-distribution (OOD) generalization problem, which aims to extract the invariant features.

Causal Inference Out-of-Distribution Generalization +1

What Makes for Automatic Reconstruction of Pulmonary Segments

1 code implementation7 Jul 2022 Kaiming Kuang, Li Zhang, Jingyu Li, Hongwei Li, Jiajun Chen, Bo Du, Jiancheng Yang

The automatic reconstruction of pulmonary segments by ImPulSe is accurate in metrics and visually appealing.

3D Reconstruction

Transport-Oriented Feature Aggregation for Speaker Embedding Learning

no code implementations26 Jun 2022 Yusheng Tian, Jingyu Li, Tan Lee

Pooling is needed to aggregate frame-level features into utterance-level representations for speaker modeling.

Speaker Verification

Learnable Frequency Filters for Speech Feature Extraction in Speaker Verification

no code implementations15 Jun 2022 Jingyu Li, Yusheng Tian, Tan Lee

There is no reason to expect that these features are optimal for all different tasks, including speaker verification (SV).

Speaker Verification

EDITnet: A Lightweight Network for Unsupervised Domain Adaptation in Speaker Verification

no code implementations15 Jun 2022 Jingyu Li, Wei Liu, Tan Lee

This paper proposes a domain transfer network, named EDITnet, to alleviate the language-mismatch problem on speaker embeddings without requiring speaker labels.

Self-Supervised Learning Speaker Verification +1

An Investigation on Applying Acoustic Feature Conversion to ASR of Adult and Child Speech

no code implementations25 May 2022 Wei Liu, Jingyu Li, Tan Lee

The performance of child speech recognition is generally less satisfactory compared to adult speech due to limited amount of training data.

Attribute Automatic Speech Recognition +4

Improving Text-Independent Speaker Verification with Auxiliary Speakers Using Graph

no code implementations20 Sep 2021 Jingyu Li, Si-Ioi Ng, Tan Lee

Given the embeddings from a pair of input utterances, a graph model is designed to incorporate additional information from a group of embeddings representing the so-called auxiliary speakers.

Text-Independent Speaker Verification

Detection of Consonant Errors in Disordered Speech Based on Consonant-vowel Segment Embedding

no code implementations16 Jun 2021 Si-Ioi Ng, Cymie Wing-Yee Ng, Jingyu Li, Tan Lee

This paper investigates a neural network based approach to detecting consonant errors in disordered speech using consonant-vowel (CV) diphone segment in comparison to using consonant monophone segment.

Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks

no code implementations ICCV 2019 Zhaoyang Zhang, Jingyu Li, Wenqi Shao, Zhanglin Peng, Ruimao Zhang, Xiaogang Wang, Ping Luo

ResNeXt, still suffers from the sub-optimal performance due to manually defining the number of groups as a constant over all of the layers.

Switchable Normalization for Learning-to-Normalize Deep Representation

no code implementations22 Jul 2019 Ping Luo, Ruimao Zhang, Jiamin Ren, Zhanglin Peng, Jingyu Li

Analyses of SN are also presented to answer the following three questions: (a) Is it useful to allow each normalization layer to select its own normalizer?

SSN: Learning Sparse Switchable Normalization via SparsestMax

1 code implementation CVPR 2019 Wenqi Shao, Tianjian Meng, Jingyu Li, Ruimao Zhang, Yudian Li, Xiaogang Wang, Ping Luo

Unlike $\ell_1$ and $\ell_0$ constraints that impose difficulties in optimization, we turn this constrained optimization problem into feed-forward computation by proposing SparsestMax, which is a sparse version of softmax.

SCAN: Self-and-Collaborative Attention Network for Video Person Re-identification

no code implementations16 Jul 2018 Ruimao Zhang, Hongbin Sun, Jingyu Li, Yuying Ge, Liang Lin, Ping Luo, Xiaogang Wang

To address the above issues, we present a novel and practical deep architecture for video person re-identification termed Self-and-Collaborative Attention Network (SCAN).

Video-Based Person Re-Identification

Differentiable Learning-to-Normalize via Switchable Normalization

3 code implementations ICLR 2019 Ping Luo, Jiamin Ren, Zhanglin Peng, Ruimao Zhang, Jingyu Li

We hope SN will help ease the usage and understand the normalization techniques in deep learning.

Cannot find the paper you are looking for? You can Submit a new open access paper.