Multi-Modal Pre-Training for Automated Speech Recognition

no code implementations12 Oct 2021 David M. Chan, Shalini Ghosh, Debmalya Chakrabarty, Björn Hoffmeister

Traditionally, research in automated speech recognition has focused on local-first encoding of audio representations to predict the spoken phonemes in an utterance.

Language Modelling Self-Supervised Learning +1

Active Learning for Video Description With Cluster-Regularized Ensemble Ranking

no code implementations27 Jul 2020 David M. Chan, Sudheendra Vijayanarasimhan, David A. Ross, John Canny

Automatic video captioning aims to train models to generate text descriptions for all segments in a video, however, the most effective approaches require large amounts of manual annotation which is slow and expensive.

Active Learning Video Captioning +1

Exploring Exploration: Comparing Children with RL Agents in Unified Environments

1 code implementation6 May 2020 Eliza Kosoy, Jasmine Collins, David M. Chan, Sandy Huang, Deepak Pathak, Pulkit Agrawal, John Canny, Alison Gopnik, Jessica B. Hamrick

Research in developmental psychology consistently shows that children explore the world thoroughly and efficiently and that this exploration allows them to learn.

Diagnostic Visualization for Deep Neural Networks Using Stochastic Gradient Langevin Dynamics

1 code implementation11 Dec 2018 Biye Jiang, David M. Chan, Tianhao Zhang, John F. Canny

Finally we show that diagnostic visualization using LDAM leads to a novel insight into the parameter averaging method for deep net training.

t-SNE-CUDA: GPU-Accelerated t-SNE and its Applications to Modern Data

1 code implementation31 Jul 2018 David M. Chan, Roshan Rao, Forrest Huang, John F. Canny

Modern datasets and models are notoriously difficult to explore and analyze due to their inherent high dimensionality and massive numbers of samples.

Dimensionality Reduction

