Search Results for author: Sourav Bhattacharya

Found 15 papers, 2 papers with code

SummaryMixing: A Linear-Complexity Alternative to Self-Attention for Speech Recognition and Understanding

1 code implementation12 Jul 2023 Titouan Parcollet, Rogier Van Dalen, Shucong Zhang, Sourav Bhattacharya

Unfortunately, token mixing with self-attention takes quadratic time in the length of the speech utterance, slowing down inference as well as training and increasing memory consumption.

speech-recognition Speech Recognition

Cross-Attention is all you need: Real-Time Streaming Transformers for Personalised Speech Enhancement

no code implementations8 Nov 2022 Shucong Zhang, Malcolm Chadwick, Alberto Gil C. P. Ramos, Sourav Bhattacharya

Personalised speech enhancement (PSE), which extracts only the speech of a target user and removes everything else from a recorded audio clip, can potentially improve users' experiences of audio AI modules deployed in the wild.

Speech Enhancement

Defensive Tensorization

no code implementations26 Oct 2021 Adrian Bulat, Jean Kossaifi, Sourav Bhattacharya, Yannis Panagakis, Timothy Hospedales, Georgios Tzimiropoulos, Nicholas D Lane, Maja Pantic

We propose defensive tensorization, an adversarial defence technique that leverages a latent high-order factorization of the network.

Audio Classification Image Classification

Conditioning Sequence-to-sequence Networks with Learned Activations

no code implementations ICLR 2022 Alberto Gil Couto Pimentel Ramos, Abhinav Mehrotra, Nicholas Donald Lane, Sourav Bhattacharya

Conditional neural networks play an important role in a number of sequence-to-sequence modeling tasks, including personalized sound enhancement (PSE), speaker dependent automatic speech recognition (ASR), and generative modeling such as text-to-speech synthesis.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Monotonicity of the over-rotation intervals for bimodal maps

no code implementations4 Mar 2021 Sourav Bhattacharya, Alexander Blokh

We show that the connectedness of the set of parameters for which the over-rotation interval of a bimodal interval map is constant.

Dynamical Systems

Fermionic Bell violation in the presence of background electromagnetic fields in the cosmological de Sitter spacetime

no code implementations23 Feb 2021 Md Sabir Ali, Sourav Bhattacharya, Shankhadeep Chakrabortty, Shagun Kaushal

Using the squeezed state expansion, we then demonstrate the Bell violations for the vacuum and some maximally entangled initial states.

High Energy Physics - Theory General Relativity and Quantum Cosmology

Unicritical Laminations

no code implementations20 Jan 2021 Sourav Bhattacharya, Alexander Blokh, Dierk Schleicher

In the end we verify the \emph{Fatou conjecture} for the unicritical laminations and extend the \emph{Lavaurs algorithm} onto $\mathrm{UML}_d$.

Dynamical Systems Primary: 54F20, Secondary: 30C35

Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems

no code implementations11 Aug 2020 Ravichander Vipperla, Sangjun Park, Kihyun Choo, Samin Ishtiaq, Kyoungbo Min, Sourav Bhattacharya, Abhinav Mehrotra, Alberto Gil C. P. Ramos, Nicholas D. Lane

LPCNet is an efficient vocoder that combines linear prediction and deep neural network modules to keep the computational complexity low.

Defensive Tensorization: Randomized Tensor Parametrization for Robust Neural Networks

no code implementations25 Sep 2019 Adrian Bulat, Jean Kossaifi, Sourav Bhattacharya, Yannis Panagakis, Georgios Tzimiropoulos, Nicholas D. Lane, Maja Pantic

As deep neural networks become widely adopted for solving most problems in computer vision and audio-understanding, there are rising concerns about their potential vulnerability.

Adversarial Defense Audio Classification +1

MobiSR: Efficient On-Device Super-Resolution through Heterogeneous Mobile Processors

no code implementations21 Aug 2019 Royson Lee, Stylianos I. Venieris, Łukasz Dudziak, Sourav Bhattacharya, Nicholas D. Lane

In recent years, convolutional networks have demonstrated unprecedented performance in the image restoration task of super-resolution (SR).

Cloud Computing Image Restoration +2

Understanding Opportunities for Efficiency in Single-image Super Resolution Networks

no code implementations ICLR 2019 Royson Lee, Nic Lane, Marko Stankovic, Sourav Bhattacharya

A successful application of convolutional architectures is to increase the resolution of single low-resolution images -- a image restoration task called super-resolution (SR).

Image Restoration Image Super-Resolution

Towards Using Unlabeled Data in a Sparse-coding Framework for Human Activity Recognition

no code implementations25 Dec 2013 Sourav Bhattacharya, Petteri Nurmi, Nils Hammerla, Thomas Plötz

We propose a sparse-coding framework for activity recognition in ubiquitous and mobile computing that alleviates two fundamental problems of current supervised learning approaches.

Human Activity Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.