Search Results for author: Sanjeev Satheesh

Found 11 papers, 3 papers with code

Topic Compositional Neural Language Model

no code implementations • 28 Dec 2017 • Wenlin Wang, Zhe Gan, Wenqi Wang, Dinghan Shen, Jiaji Huang, Wei Ping, Sanjeev Satheesh, Lawrence Carin

The TCNLM learns the global semantic coherence of a document via a neural topic model, and the probability of each learned latent topic is further used to build a Mixture-of-Experts (MoE) language model, where each expert (corresponding to one topic) is a recurrent neural network (RNN) that accounts for learning the local structure of a word sequence.

Language Modelling

Paper
Add Code

Robust Speech Recognition Using Generative Adversarial Networks

no code implementations • 5 Nov 2017 • Anuroop Sriram, Heewoo Jun, Yashesh Gaur, Sanjeev Satheesh

This paper describes a general, scalable, end-to-end framework that uses the generative adversarial network (GAN) objective to enable robust speech recognition.

Generative Adversarial Network Robust Speech Recognition +1

Paper
Add Code

Cold Fusion: Training Seq2Seq Models Together with Language Models

no code implementations • ICLR 2018 • Anuroop Sriram, Heewoo Jun, Sanjeev Satheesh, Adam Coates

Sequence-to-sequence (Seq2Seq) models with attention have excelled at tasks which involve generating natural language sentences such as machine translation, image captioning and speech recognition.

Image Captioning Language Modelling +4

Paper
Add Code

Exploring Neural Transducers for End-to-End Speech Recognition

no code implementations • 24 Jul 2017 • Eric Battenberg, Jitong Chen, Rewon Child, Adam Coates, Yashesh Gaur, Yi Li, Hairong Liu, Sanjeev Satheesh, David Seetapun, Anuroop Sriram, Zhenyao Zhu

In this work, we perform an empirical comparison among the CTC, RNN-Transducer, and attention-based Seq2Seq models for end-to-end speech recognition.

Language Modelling speech-recognition +1

Paper
Add Code

Reducing Bias in Production Speech Models

no code implementations • 11 May 2017 • Eric Battenberg, Rewon Child, Adam Coates, Christopher Fougner, Yashesh Gaur, Jiaji Huang, Heewoo Jun, Ajay Kannan, Markus Kliegl, Atul Kumar, Hairong Liu, Vinay Rao, Sanjeev Satheesh, David Seetapun, Anuroop Sriram, Zhenyao Zhu

Replacing hand-engineered pipelines with end-to-end deep learning systems has enabled strong results in applications like speech and object recognition.

Object Recognition

Paper
Add Code

Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling

no code implementations • ICML 2017 • Hairong Liu, Zhenyao Zhu, Xiangang Li, Sanjeev Satheesh

These methods suffer from two major drawbacks: 1) the set of basic units is fixed, such as the set of words, characters or phonemes in speech recognition, and 2) the decomposition of target sequences is fixed.

Computational Efficiency speech-recognition +2

Paper
Add Code

Active Learning for Speech Recognition: the Power of Gradients

no code implementations • 10 Dec 2016 • Jiaji Huang, Rewon Child, Vinay Rao, Hairong Liu, Sanjeev Satheesh, Adam Coates

For speech recognition, confidence scores and other likelihood-based active learning methods have been shown to be effective.

Active Learning Informativeness +2

Paper
Add Code

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

35 code implementations • 8 Dec 2015 • Dario Amodei, Rishita Anubhai, Eric Battenberg, Carl Case, Jared Casper, Bryan Catanzaro, Jingdong Chen, Mike Chrzanowski, Adam Coates, Greg Diamos, Erich Elsen, Jesse Engel, Linxi Fan, Christopher Fougner, Tony Han, Awni Hannun, Billy Jun, Patrick LeGresley, Libby Lin, Sharan Narang, Andrew Ng, Sherjil Ozair, Ryan Prenger, Jonathan Raiman, Sanjeev Satheesh, David Seetapun, Shubho Sengupta, Yi Wang, Zhiqian Wang, Chong Wang, Bo Xiao, Dani Yogatama, Jun Zhan, Zhenyao Zhu

We show that an end-to-end deep learning approach can be used to recognize either English or Mandarin Chinese speech--two vastly different languages.

Ranked #1 on Accented Speech Recognition on VoxForge American-Canadian

Accented Speech Recognition Noisy Speech Recognition

76,594

Paper
Code

Deep Speech: Scaling up end-to-end speech recognition

24 code implementations • 17 Dec 2014 • Awni Hannun, Carl Case, Jared Casper, Bryan Catanzaro, Greg Diamos, Erich Elsen, Ryan Prenger, Sanjeev Satheesh, Shubho Sengupta, Adam Coates, Andrew Y. Ng

We present a state-of-the-art speech recognition system developed using end-to-end deep learning.

Ranked #2 on Accented Speech Recognition on VoxForge American-Canadian

Accented Speech Recognition

24,279

Paper
Code

ImageNet Large Scale Visual Recognition Challenge

12 code implementations • 1 Sep 2014 • Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, Li Fei-Fei

The ImageNet Large Scale Visual Recognition Challenge is a benchmark in object category classification and detection on hundreds of object categories and millions of images.

General Classification Image Classification +4

718

Paper
Code

Fast and Balanced: Efficient Label Tree Learning for Large Scale Object Recognition

no code implementations • NeurIPS 2011 • Jia Deng, Sanjeev Satheesh, Alexander C. Berg, Fei Li

We present a novel approach to efficiently learn a label tree for large scale classification with many classes.

Classification General Classification +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.