Search Results for author: Tianyi Xu

Found 11 papers, 3 papers with code

4K-Resolution Photo Exposure Correction at 125 FPS with ~8K Parameters

1 code implementation • 15 Nov 2023 • Yijie Zhou, Chao Li, Jin Liang, Tianyi Xu, Xin Liu, Jun Xu

The illumination of improperly exposed photographs has been widely corrected using deep convolutional neural networks or Transformers.

4k 8k

Paper
Code

Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition

no code implementations • 7 Oct 2023 • Kaixun Huang, Ao Zhang, BinBin Zhang, Tianyi Xu, Xingchen Song, Lei Xie

However, unlike shallow fusion methods that directly bias the posterior of the ASR model, deep biasing methods implicitly integrate contextual information, making it challenging to control the degree of bias.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Lightweight Improved Residual Network for Efficient Inverse Tone Mapping

1 code implementation • 8 Jul 2023 • Liqi Xue, Tianyi Xu, Yongbao Song, Yan Liu, Lei Zhang, XianTong Zhen, Jun Xu

But the majority of media images on the internet remain in 8-bit standard dynamic range (SDR) format.

Image Reconstruction inverse tone mapping +2

Paper
Code

A First Order Meta Stackelberg Method for Robust Federated Learning

no code implementations • 23 Jun 2023 • Yunian Pan, Tao Li, Henger Li, Tianyi Xu, Zizhan Zheng, Quanyan Zhu

Previous research has shown that federated learning (FL) systems are exposed to an array of security risks.

Federated Learning Meta-Learning +1

Paper
Add Code

Emotional Talking Head Generation based on Memory-Sharing and Attention-Augmented Networks

no code implementations • 6 Jun 2023 • Jianrong Wang, Yaxin Zhao, Li Liu, Tianyi Xu, Qi Li, Sen Li

Given an audio clip and a reference face image, the goal of the talking head generation is to generate a high-fidelity talking head video.

Talking Head Generation

Paper
Add Code

MAVD: The First Open Large-Scale Mandarin Audio-Visual Dataset with Depth Information

1 code implementation • 4 Jun 2023 • Jianrong Wang, Yuchen Huo, Li Liu, Tianyi Xu, Qi Li, Sen Li

Audio-visual speech recognition (AVSR) gains increasing attention from researchers as an important part of human-computer interaction.

Audio-Visual Speech Recognition speech-recognition +1

Paper
Code

Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition

no code implementations • 1 Jun 2023 • Tianyi Xu, Zhanheng Yang, Kaixun Huang, Pengcheng Guo, Ao Zhang, Biao Li, Changru Chen, Chao Li, Lei Xie

By incorporating additional contextual information, deep biasing methods have emerged as a promising solution for speech recognition of personalized words.

speech-recognition Speech Recognition

Paper
Add Code

Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network

no code implementations • 21 May 2023 • Kaixun Huang, Ao Zhang, Zhanheng Yang, Pengcheng Guo, Bingshen Mu, Tianyi Xu, Lei Xie

In this study, we introduce a contextual phrase prediction network for an attention-based deep bias method.

speech-recognition Speech Recognition

Paper
Add Code

Online Learning for Adaptive Probing and Scheduling in Dense WLANs

no code implementations • 27 Dec 2022 • Tianyi Xu, Ding Zhang, Zizhan Zheng

The problem is challenging even when the link rate distributions are pre-known (the offline setting) due to the necessity of balancing the information gains from probing and the cost of reducing the data transmission opportunity.

Scheduling

Paper
Add Code

Joint AP Probing and Scheduling: A Contextual Bandit Approach

no code implementations • 6 Aug 2021 • Tianyi Xu, Ding Zhang, Parth H. Pathak, Zizhan Zheng

In contrast to traditional link scheduling problems under uncertainty, we assume that in each time step, the device can probe a subset of links before deciding which one to use.

Decision Making Scheduling

Paper
Add Code

Attention-based Residual Speech Portrait Model for Speech to Face Generation

no code implementations • 9 Jul 2020 • Jianrong Wang, Xiaosheng Hu, Li Liu, Wei Liu, Mei Yu, Tianyi Xu

Given a speaker's speech, it is interesting to see if it is possible to generate this speaker's face.

Face Generation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.