no code implementations • 17 Aug 2023 • Liang Wang, Nan Zhang, Xiaoyang Qu, Jianzong Wang, Jiguang Wan, Guokuan Li, Kaiyu Hu, Guilin Jiang, Jing Xiao
In this paper, we introduce EdgeMA, a practical and efficient video analytics system designed to adapt models to shifts in real-world video streams over time, addressing the data drift problem.
no code implementations • 27 Jun 2023 • Liang Wang, Kai Lu, Nan Zhang, Xiaoyang Qu, Jianzong Wang, Jiguang Wan, Guokuan Li, Jing Xiao
This paper proposes Shoggoth, an efficient edge-cloud collaborative architecture, for boosting inference performance on real-time video of changing scenes.
no code implementations • 27 Jun 2023 • Chenghao Liu, Xiaoyang Qu, Jianzong Wang, Jing Xiao
To address local forgetting caused by new classes of new tasks and global forgetting brought by non-i. i. d (non-independent and identically distributed) class imbalance across different local clients, we proposed an Enhancer distillation method to modify the imbalance between old and new knowledge and repair the non-i. i. d.
no code implementations • 17 Mar 2023 • Jinggang Chen, Xiaoyang Qu, Junjie Li, Jianzong Wang, Jiguang Wan, Jing Xiao
Out-of-distribution (OOD) detection aims at enhancing standard deep neural networks to distinguish anomalous inputs from original training data.
no code implementations • 14 Mar 2023 • Zuheng Kang, Yayun He, Jianzong Wang, Junqing Peng, Xiaoyang Qu, Jing Xiao
Data-Free Knowledge Distillation (DFKD) has recently attracted growing attention in the academic community, especially with major breakthroughs in computer vision.
no code implementations • 15 Oct 2022 • Chendong Zhao, Jianzong Wang, Xiaoyang Qu, Haoqian Wang, Jing Xiao
Unsupervised representation learning for speech audios attained impressive performances for speech recognition tasks, particularly when annotated speech is limited.
no code implementations • 7 Oct 2022 • Jianhan Wu, Jianzong Wang, Shijing Si, Xiaoyang Qu, Jing Xiao
Most existing methods encode the texture of the whole reference human image into a latent space, and then utilize a decoder to synthesize the image texture of the target pose.
no code implementations • 30 Sep 2022 • Denghao Li, Yuqiao Zeng, Jianzong Wang, Lingwei Kong, Zhangcheng Huang, Ning Cheng, Xiaoyang Qu, Jing Xiao
Buddhism is an influential religion with a long-standing history and profound philosophy.
no code implementations • 30 Sep 2022 • Chendong Zhao, Jianzong Wang, Wen qi Wei, Xiaoyang Qu, Haoqian Wang, Jing Xiao
For multi-head attention in Transformer ASR, it is not easy to model monotonic alignments in different heads.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
no code implementations • 21 Sep 2022 • Shijing Si, Jianzong Wang, xulong Zhang, Xiaoyang Qu, Ning Cheng, Jing Xiao
Nonparallel multi-domain voice conversion methods such as the StarGAN-VCs have been widely applied in many scenarios.
no code implementations • 26 May 2022 • Nan Zhang, Jianzong Wang, Zhenhou Hong, Chendong Zhao, Xiaoyang Qu, Jing Xiao
Therefore, we propose an approach to derive utterance-level speaker embeddings via a Transformer architecture that uses a novel loss function named diffluence loss to integrate the feature information of different Transformer layers.
no code implementations • 26 May 2022 • Jianzong Wang, Shijing Si, Zhitao Zhu, Xiaoyang Qu, Zhenhou Hong, Jing Xiao
The experiments on four programming languages (Java, C, Python, and JavaScript) show that CPR can generate causal graphs for reasonable interpretations and boost the performance of bug fixing in automatic program repair.
1 code implementation • 26 May 2022 • Zhenhou Hong, Jianzong Wang, Xiaoyang Qu, Chendong Zhao, Wei Tao, Jing Xiao
However, Quantum Neural Network (QNN) running on low-qubit quantum devices would be difficult since it is based on Variational Quantum Circuit (VQC), which requires many qubits.
no code implementations • 24 May 2022 • Chendong Zhao, Jianzong Wang, Leilai Li, Xiaoyang Qu, Jing Xiao
In this work, we propose a novel task-adaptive module which is easy to plant into any metric-based few-shot learning frameworks.
no code implementations • 21 Feb 2022 • Chendong Zhao, Jianzong Wang, Xiaoyang Qu, Haoqian Wang, Jing Xiao
In this paper, we aim to evaluate and enhance the robustness of G2P models.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
no code implementations • 10 Jul 2021 • Shijing Si, Jianzong Wang, Xiaoyang Qu, Ning Cheng, Wenqi Wei, Xinghua Zhu, Jing Xiao
This paper investigates a novel task of talking face video generation solely from speeches.
no code implementations • 9 Jul 2021 • Zhenhou Hong, Jianzong Wang, Xiaoyang Qu, Jie Liu, Chendong Zhao, Jing Xiao
Text to speech (TTS) is a crucial task for user interaction, but TTS model training relies on a sizable set of high-quality original datasets.
no code implementations • 23 Feb 2021 • Xiaoyang Qu, Jianzong Wang, Jing Xiao
We add an activation regularizer and a virtual interpolation method to improve the data generation efficiency.
no code implementations • 13 Aug 2020 • Xiaoyang Qu, Jianzong Wang, Jing Xiao
We borrow the idea of neural architecture search(NAS) for the textindependent speaker verification task.
Neural Architecture Search
Text-Independent Speaker Verification