Search Results for author: Ke Hu

Found 27 papers, 4 papers with code

Feature Norm Regularized Federated Learning: Transforming Skewed Distributions into Global Insights

1 code implementation12 Dec 2023 Ke Hu, Weidong Qiu, Peng Tang

Our comprehensive analysis reveals that FNR-FL not only accelerates convergence but also significantly surpasses other contemporary federated learning algorithms in test accuracy, particularly under feature distribution skew scenarios.

Federated Learning

Improving Joint Speech-Text Representations Without Alignment

no code implementations11 Aug 2023 Cal Peyser, Zhong Meng, Ke Hu, Rohit Prabhavalkar, Andrew Rosenberg, Tara N. Sainath, Michael Picheny, Kyunghyun Cho

The last year has seen astonishing progress in text-prompted image generation premised on the idea of a cross-modal representation space in which the text and image domains are represented jointly.

Speech Recognition

Mixture-of-Expert Conformer for Streaming Multilingual ASR

no code implementations25 May 2023 Ke Hu, Bo Li, Tara N. Sainath, Yu Zhang, Francoise Beaufays

We evaluate the proposed model on a set of 12 languages, and achieve an average 11. 9% relative improvement in WER over the baseline.

Automatic Speech Recognition speech-recognition +1

A Deliberation-based Joint Acoustic and Text Decoder

no code implementations23 Mar 2023 Sepand Mavandadi, Tara N. Sainath, Ke Hu, Zelin Wu

We propose a new two-pass E2E speech recognition model that improves ASR performance by training on a combination of paired data and unpaired text data.

speech-recognition Speech Recognition

Improving Deliberation by Text-Only and Semi-Supervised Training

no code implementations29 Jun 2022 Ke Hu, Tara N. Sainath, Yanzhang He, Rohit Prabhavalkar, Trevor Strohman, Sepand Mavandadi, Weiran Wang

Text-only and semi-supervised training based on audio-only data has gained popularity recently due to the wide availability of unlabeled text and speech data.

Language Modelling

Hybrid CNN Based Attention with Category Prior for User Image Behavior Modeling

no code implementations5 May 2022 Xin Chen, Qingtao Tang, Ke Hu, Yue Xu, Shihang Qiu, Jia Cheng, Jun Lei

In Meituan, one of the largest e-commerce platform in China, an item is typically displayed with its image and whether a user clicks the item or not is usually influenced by its image, which implies that user's image behaviors are helpful for understanding user's visual preference and improving the accuracy of CTR prediction.

Click-Through Rate Prediction

Streaming Align-Refine for Non-autoregressive Deliberation

no code implementations15 Apr 2022 Weiran Wang, Ke Hu, Tara N. Sainath

We propose a streaming non-autoregressive (non-AR) decoding algorithm to deliberate the hypothesis alignment of a streaming RNN-T model.

Continual Learning for CTR Prediction: A Hybrid Approach

no code implementations18 Jan 2022 Ke Hu, Yi Qi, Jianqiang Huang, Jia Cheng, Jun Lei

To address this problem, we formulate CTR prediction as a continual learning task and propose COLF, a hybrid COntinual Learning Framework for CTR prediction, which has a memory-based modular architecture that is designed to adapt, learn and give predictions continuously when faced with non-stationary drifting click data streams.

Click-Through Rate Prediction Continual Learning

AutoHEnsGNN: Winning Solution to AutoGraph Challenge for KDD Cup 2020

1 code implementation25 Nov 2021 Jin Xu, Mingjian Chen, Jianqiang Huang, Xingyuan Tang, Ke Hu, Jian Li, Jia Cheng, Jun Lei

Graph Neural Networks (GNNs) have become increasingly popular and achieved impressive results in many graph-based applications.

Graph Classification Node Classification

Polarized skylight orientation determination artificial neural network

no code implementations6 Jul 2021 Huaju Liang, Hongyang Bai, Ke Hu, Xinbo Lv

This paper proposes an artificial neural network to determine orientation using polarized skylight.

Deep Position-wise Interaction Network for CTR Prediction

1 code implementation10 Jun 2021 Jianqiang Huang, Ke Hu, Qingtao Tang, Mingjian Chen, Yi Qi, Jia Cheng, Jun Lei

Click-through rate (CTR) prediction plays an important role in online advertising and recommender systems.

Click-Through Rate Prediction Position +1

Transformer Based Deliberation for Two-Pass Speech Recognition

no code implementations27 Jan 2021 Ke Hu, Ruoming Pang, Tara N. Sainath, Trevor Strohman

In this work, we explore using transformer layers instead of long-short term memory (LSTM) layers for deliberation rescoring.

speech-recognition Speech Recognition +1

Textual Echo Cancellation

no code implementations13 Aug 2020 Shaojin Ding, Ye Jia, Ke Hu, Quan Wang

In this paper, we propose Textual Echo Cancellation (TEC) - a framework for cancelling the text-to-speech (TTS) playback echo from overlapping speech recordings.

Acoustic echo cancellation speech-recognition +1

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency

no code implementations28 Mar 2020 Tara N. Sainath, Yanzhang He, Bo Li, Arun Narayanan, Ruoming Pang, Antoine Bruguier, Shuo-Yiin Chang, Wei Li, Raziel Alvarez, Zhifeng Chen, Chung-Cheng Chiu, David Garcia, Alex Gruenstein, Ke Hu, Minho Jin, Anjuli Kannan, Qiao Liang, Ian McGraw, Cal Peyser, Rohit Prabhavalkar, Golan Pundak, David Rybach, Yuan Shangguan, Yash Sheth, Trevor Strohman, Mirko Visontai, Yonghui Wu, Yu Zhang, Ding Zhao

Thus far, end-to-end (E2E) models have not been shown to outperform state-of-the-art conventional models with respect to both quality, i. e., word error rate (WER), and latency, i. e., the time the hypothesis is finalized after the user stops speaking.

Sentence

Deliberation Model Based Two-Pass End-to-End Speech Recognition

no code implementations17 Mar 2020 Ke Hu, Tara N. Sainath, Ruoming Pang, Rohit Prabhavalkar

End-to-end (E2E) models have made rapid progress in automatic speech recognition (ASR) and perform competitively relative to conventional models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models

no code implementations21 Jun 2019 Ke Hu, Antoine Bruguier, Tara N. Sainath, Rohit Prabhavalkar, Golan Pundak

Contextual automatic speech recognition, i. e., biasing recognition towards a given context (e. g. user's playlists, or contacts), is challenging in end-to-end (E2E) models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Adversarial Training for Multilingual Acoustic Modeling

no code implementations17 Jun 2019 Ke Hu, Hasim Sak, Hank Liao

In this work, we apply the domain adversarial network to encourage the shared layers of a multilingual model to learn language-invariant features.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Attaining the Unattainable? Reassessing Claims of Human Parity in Neural Machine Translation

1 code implementation WS 2018 Antonio Toral, Sheila Castilho, Ke Hu, Andy Way

We reassess a recent study (Hassan et al., 2018) that claimed that machine translation (MT) has reached human parity for the translation of news from Chinese into English, using pairwise ranking and considering three variables that were not taken into account in that previous study: the language in which the source side of the test set was originally written, the translation proficiency of the evaluators, and the provision of inter-sentential context.

Machine Translation Translation

Cannot find the paper you are looking for? You can Submit a new open access paper.