Search Results for author: Tianyi Xu

Found 14 papers, 4 papers with code

Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper

no code implementations20 Aug 2024 Tianyi Xu, Kaixun Huang, Pengcheng Guo, Yu Zhou, Longtao Huang, Hui Xue, Lei Xie

Pre-trained multilingual speech foundation models, like Whisper, have shown impressive performance across different languages.

Task-Aware Dynamic Transformer for Efficient Arbitrary-Scale Image Super-Resolution

1 code implementation16 Aug 2024 Tianyi Xu, Yiji Zhou, Xiaotao Hu, Kai Zhang, Anran Zhang, Xingye Qiu, Jun Xu

The TARC predicts the inference paths within feature extraction backbone, specifically selecting MSTBs based on the input images and SR scales.

Image Super-Resolution

4K-Resolution Photo Exposure Correction at 125 FPS with ~8K Parameters

1 code implementation15 Nov 2023 Yijie Zhou, Chao Li, Jin Liang, Tianyi Xu, Xin Liu, Jun Xu

The illumination of improperly exposed photographs has been widely corrected using deep convolutional neural networks or Transformers.

4k 8k +1

Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition

no code implementations7 Oct 2023 Kaixun Huang, Ao Zhang, BinBin Zhang, Tianyi Xu, Xingchen Song, Lei Xie

However, unlike shallow fusion methods that directly bias the posterior of the ASR model, deep biasing methods implicitly integrate contextual information, making it challenging to control the degree of bias.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

A First Order Meta Stackelberg Method for Robust Federated Learning

no code implementations23 Jun 2023 Yunian Pan, Tao Li, Henger Li, Tianyi Xu, Zizhan Zheng, Quanyan Zhu

Previous research has shown that federated learning (FL) systems are exposed to an array of security risks.

Federated Learning Meta-Learning +1

Emotional Talking Head Generation based on Memory-Sharing and Attention-Augmented Networks

no code implementations6 Jun 2023 Jianrong Wang, Yaxin Zhao, Li Liu, Tianyi Xu, Qi Li, Sen Li

Given an audio clip and a reference face image, the goal of the talking head generation is to generate a high-fidelity talking head video.

Talking Head Generation

MAVD: The First Open Large-Scale Mandarin Audio-Visual Dataset with Depth Information

1 code implementation4 Jun 2023 Jianrong Wang, Yuchen Huo, Li Liu, Tianyi Xu, Qi Li, Sen Li

Audio-visual speech recognition (AVSR) gains increasing attention from researchers as an important part of human-computer interaction.

Audio-Visual Speech Recognition speech-recognition +1

Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition

no code implementations1 Jun 2023 Tianyi Xu, Zhanheng Yang, Kaixun Huang, Pengcheng Guo, Ao Zhang, Biao Li, Changru Chen, Chao Li, Lei Xie

By incorporating additional contextual information, deep biasing methods have emerged as a promising solution for speech recognition of personalized words.

speech-recognition Speech Recognition

Online Learning for Adaptive Probing and Scheduling in Dense WLANs

no code implementations27 Dec 2022 Tianyi Xu, Ding Zhang, Zizhan Zheng

The problem is challenging even when the link rate distributions are pre-known (the offline setting) due to the necessity of balancing the information gains from probing and the cost of reducing the data transmission opportunity.

Scheduling

Joint AP Probing and Scheduling: A Contextual Bandit Approach

no code implementations6 Aug 2021 Tianyi Xu, Ding Zhang, Parth H. Pathak, Zizhan Zheng

In contrast to traditional link scheduling problems under uncertainty, we assume that in each time step, the device can probe a subset of links before deciding which one to use.

Decision Making Scheduling

Attention-based Residual Speech Portrait Model for Speech to Face Generation

no code implementations9 Jul 2020 Jianrong Wang, Xiaosheng Hu, Li Liu, Wei Liu, Mei Yu, Tianyi Xu

Given a speaker's speech, it is interesting to see if it is possible to generate this speaker's face.

Decoder Face Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.