Search Results for author: Xiaodan Zhuang

Found 6 papers, 0 papers with code

Zero-shot Event Detection using Multi-modal Fusion of Weakly Supervised Concepts

no code implementations • CVPR 2014 • Shuang Wu, Sravanthi Bondugula, Florian Luisier, Xiaodan Zhuang, Pradeep Natarajan

Current state-of-the-art systems for visual content analysis require large training sets for each class of interest, and performance degrades rapidly with fewer examples.

Attribute Event Detection

Paper
Add Code

SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition

no code implementations • 4 Oct 2019 • Zhen Huang, Tim Ng, Leo Liu, Henry Mason, Xiaodan Zhuang, Daben Liu

The most popular way to train very deep CNNs is to use shortcut connections (SC) together with batch normalization (BN).

Inference Optimization speech-recognition +1

Paper
Add Code

Frame-level SpecAugment for Deep Convolutional Neural Networks in Hybrid ASR Systems

no code implementations • 7 Dec 2020 • Xinwei Li, Yuanyuan Zhang, Xiaodan Zhuang, Daben Liu

We demonstrate that f-SpecAugment is more effective than the utterance level SpecAugment for deep CNN based hybrid models.

Data Augmentation

Paper
Add Code

Exploring Retraining-Free Speech Recognition for Intra-sentential Code-Switching

no code implementations • 27 Aug 2021 • Zhen Huang, Xiaodan Zhuang, Daben Liu, Xiaoqiang Xiao, Yuchen Zhang, Sabato Marco Siniscalchi

To achieve such an ambitious goal, new mechanisms for foreign pronunciation generation and language model (LM) enrichment have been devised.

Language Modelling speech-recognition +1

Paper
Add Code

Variable Attention Masking for Configurable Transformer Transducer Speech Recognition

no code implementations • 2 Nov 2022 • Pawel Swietojanski, Stefan Braun, Dogan Can, Thiago Fraga da Silva, Arnab Ghoshal, Takaaki Hori, Roger Hsiao, Henry Mason, Erik McDermott, Honza Silovsky, Ruchir Travadi, Xiaodan Zhuang

This work studies the use of attention masking in transformer transducer based speech recognition for building a single configurable model for different deployment scenarios.

speech-recognition Speech Recognition

Paper
Add Code

Approximate Nearest Neighbour Phrase Mining for Contextual Speech Recognition

no code implementations • 18 Apr 2023 • Maurits Bleeker, Pawel Swietojanski, Stefan Braun, Xiaodan Zhuang

By including approximate nearest neighbour phrases (ANN-P) in the context list, we encourage the learned representation to disambiguate between similar, but not identical, biasing phrases.

speech-recognition Speech Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.