Search Results for author: Junteng Jia

Found 14 papers, 7 papers with code

AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs

no code implementations12 Nov 2023 Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Ke Li, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer

In this work, we extend the instruction-tuned Llama-2 model with end-to-end general-purpose speech processing and reasoning abilities while maintaining the wide range of original LLM capabilities, without using any carefully curated paired data.

Question Answering

Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model

no code implementations22 Sep 2023 Jiamin Xie, Ke Li, Jinxi Guo, Andros Tjandra, Yuan Shangguan, Leda Sari, Chunyang Wu, Junteng Jia, Jay Mahadeokar, Ozlem Kalinli

In this work, we propose the use of an adaptive masking approach in two scenarios for pruning a multilingual ASR model efficiently, each resulting in sparse monolingual models or a sparse multilingual model (named as Dynamic ASR Pathways).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models

no code implementations5 Sep 2023 Yuan Shangguan, Haichuan Yang, Danni Li, Chunyang Wu, Yassir Fathullah, Dilin Wang, Ayushi Dalmia, Raghuraman Krishnamoorthi, Ozlem Kalinli, Junteng Jia, Jay Mahadeokar, Xin Lei, Mike Seltzer, Vikas Chandra

Results demonstrate that our TODM Supernet either matches or surpasses the performance of manually tuned models by up to a relative of 3% better in word error rate (WER), while efficiently keeping the cost of training many models at a small constant.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Prompting Large Language Models with Speech Recognition Abilities

no code implementations21 Jul 2023 Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer

Furthermore, we perform ablation studies to investigate whether the LLM can be completely frozen during training to maintain its original capabilities, scaling up the audio encoder, and increasing the audio encoder striding to generate fewer embeddings.

Abstractive Text Summarization Automatic Speech Recognition +3

Multi-Head State Space Model for Speech Recognition

no code implementations21 May 2023 Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales

State space models (SSMs) have recently shown promising results on small-scale sequence and language modelling tasks, rivalling and outperforming many attention-based approaches.

Language Modelling speech-recognition +1

Anchored Speech Recognition with Neural Transducers

no code implementations20 Oct 2022 Desh Raj, Junteng Jia, Jay Mahadeokar, Chunyang Wu, Niko Moritz, Xiaohui Zhang, Ozlem Kalinli

In this paper, we investigate anchored speech recognition to make neural transducers robust to background speech.

speech-recognition Speech Recognition

Understanding Non-linearity in Graph Neural Networks from the Bayesian-Inference Perspective

1 code implementation22 Jul 2022 Rongzhe Wei, Haoteng Yin, Junteng Jia, Austin R. Benson, Pan Li

Graph neural networks (GNNs) have shown superiority in many prediction tasks over graphs due to their impressive capability of capturing nonlinear relations in graph-structured data.

Bayesian Inference Node Classification

Federated Domain Adaptation for ASR with Full Self-Supervision

no code implementations30 Mar 2022 Junteng Jia, Jay Mahadeokar, Weiyi Zheng, Yuan Shangguan, Ozlem Kalinli, Frank Seide

Cross-device federated learning (FL) protects user privacy by collaboratively training a model on user devices, therefore eliminating the need for collecting, storing, and manually labeling user data.

Automatic Speech Recognition (ASR) Data Augmentation +2

Graph Belief Propagation Networks

1 code implementation6 Jun 2021 Junteng Jia, Cenk Baykal, Vamsi K. Potluru, Austin R. Benson

With the wide-spread availability of complex relational data, semi-supervised node classification in graphs has become a central machine learning problem.

Classification Node Classification

A Unifying Generative Model for Graph Learning Algorithms: Label Propagation, Graph Convolutions, and Combinations

1 code implementation19 Jan 2021 Junteng Jia, Austin R. Benson

Semi-supervised learning on graphs is a widely applicable problem in network science and machine learning.

Graph Learning

Residual Correlation in Graph Neural Network Regression

2 code implementations19 Feb 2020 Junteng Jia, Austin R. Benson

A graph neural network transforms features in each vertex's neighborhood into a vector representation of the vertex.

regression

Neural Jump Stochastic Differential Equations

2 code implementations NeurIPS 2019 Junteng Jia, Austin R. Benson

Many time series are effectively generated by a combination of deterministic continuous flows along with discrete jumps sparked by stochastic events.

Point Processes Time Series +1

Graph-based Semi-Supervised & Active Learning for Edge Flows

1 code implementation17 May 2019 Junteng Jia, Michael T. Schaub, Santiago Segarra, Austin R. Benson

The first strategy selects edges to minimize the reconstruction error bound and works well on flows that are approximately divergence-free.

Active Learning

Detecting Core-Periphery Structure in Spatial Networks

1 code implementation20 Aug 2018 Junteng Jia, Austin R. Benson

The core-periphery structure, which decompose a network into a densely-connected core and a sparsely-connected periphery, constantly emerges from spatial networks such as traffic, biological and social networks.

Social and Information Networks Physics and Society

Cannot find the paper you are looking for? You can Submit a new open access paper.