Search Results for author: Visar Berisha

Found 23 papers, 7 papers with code

Learning Repeatable Speech Embeddings Using An Intra-class Correlation Regularizer

1 code implementation • NeurIPS 2023 • Jianwei Zhang, Suren Jayasuriya, Visar Berisha

A good supervised embedding for a specific machine learning task is only sensitive to changes in the label of interest and is invariant to other confounding factors.

Speaker Verification

Paper
Code

Requirements for Mass Adoption of Assistive Listening Technology by the General Public

no code implementations • 4 Mar 2023 • Thomas B. Kaufmann, Mehdi Foroogozar, Julie Liss, Visar Berisha

Assistive listening systems (ALSs) dramatically increase speech intelligibility and reduce listening effort.

Paper
Add Code

Smoothly Giving up: Robustness for Simple Models

no code implementations • 17 Feb 2023 • Tyler Sypherd, Nathan Stromberg, Richard Nock, Visar Berisha, Lalitha Sankar

There is a growing need for models that are interpretable and have reduced energy and computational cost (e. g., in health care analytics and federated learning).

Federated Learning regression

Paper
Add Code

Active Sequential Two-Sample Testing

no code implementations • 30 Jan 2023 • Weizhi Li, Karthikeyan Natesan Ramamurthy, Prad Kadambi, Pouria Saidi, Gautam Dasarathy, Visar Berisha

The classification model is adaptively updated and then used to guide an active query scheme called bimodal query to label sample features in the regions with high dependency between the feature variables and the label variables.

Two-sample testing valid +1

Paper
Add Code

Robust Vocal Quality Feature Embeddings for Dysphonic Voice Detection

1 code implementation • 17 Nov 2022 • Jianwei Zhang, Julie Liss, Suren Jayasuriya, Visar Berisha

In this paper, we propose a deep learning framework for generating acoustic feature embeddings sensitive to vocal quality and robust across different corpora.

Cross-corpus

Paper
Code

TorchDIVA: An Extensible Computational Model of Speech Production built on an Open-Source Machine Learning Library

1 code implementation • 17 Oct 2022 • Sean Kinahan, Julie Liss, Visar Berisha

The DIVA model is a computational model of speech motor control that combines a simulation of the brain regions responsible for speech production with a model of the human vocal tract.

Paper
Code

Does human speech follow Benford's Law?

no code implementations • 24 Mar 2022 • Leo Hsu, Visar Berisha

Researchers have observed that the frequencies of leading digits in many man-made and naturally occurring datasets follow a logarithmic curve, with digits that start with the number 1 accounting for $\sim 30\%$ of all numbers in the dataset and digits that start with the number 9 accounting for $\sim 5\%$ of all numbers in the dataset.

Paper
Add Code

Consonant-Vowel Transition Models Based on Deep Learning for Objective Evaluation of Articulation

no code implementations • 18 Mar 2022 • Vikram C. Mathad, Julie M. Liss, Kathy Chapman, Nancy Scherer, Visar Berisha

Spectro-temporal dynamics of consonant-vowel (CV) transition regions are considered to provide robust cues related to articulation.

Paper
Add Code

A label-efficient two-sample test

1 code implementation • 17 Nov 2021 • Weizhi Li, Gautam Dasarathy, Karthikeyan Natesan Ramamurthy, Visar Berisha

Two-sample tests evaluate whether two samples are realizations of the same distribution (the null hypothesis) or two different distributions (the alternative hypothesis).

Two-sample testing Vocal Bursts Valence Prediction

Paper
Code

Restoring degraded speech via a modified diffusion model

no code implementations • 22 Apr 2021 • Jianwei Zhang, Suren Jayasuriya, Visar Berisha

We replace the mel-spectrum upsampler in DiffWave with a deep CNN upsampler, which is trained to alter the degraded speech mel-spectrum to match that of the original speech.

Paper
Add Code

Finding the Homology of Decision Boundaries with Active Learning

1 code implementation • NeurIPS 2020 • Weizhi Li, Gautam Dasarathy, Karthikeyan Natesan Ramamurthy, Visar Berisha

We theoretically analyze the proposed framework and show that the query complexity of our active learning algorithm depends naturally on the intrinsic complexity of the underlying manifold.

Active Learning Meta-Learning +2

Paper
Code

Comparing Fisher Information Regularization with Distillation for DNN Quantization

no code implementations • NeurIPS Workshop DL-IG 2020 • Prad Kadambi, Karthikeyan Natesan Ramamurthy, Visar Berisha

A large body of work addresses deep neural network (DNN) quantization and pruning to mitigate the high computational burden of deploying DNNs.

Knowledge Distillation Quantization

Paper
Add Code

Regularization via Structural Label Smoothing

no code implementations • 7 Jan 2020 • Weizhi Li, Gautam Dasarathy, Visar Berisha

Regularization is an effective way to promote the generalization performance of machine learning models.

Paper
Add Code

Robust Estimation of Hypernasality in Dysarthria with Acoustic Model Likelihood Features

no code implementations • 26 Nov 2019 • Michael Saxon, Ayush Tripathi, Yishan Jiao, Julie Liss, Visar Berisha

To demonstrate that the features derived from these acoustic models are specific to hypernasal speech, we evaluate them across different dysarthria corpora.

BIG-bench Machine Learning

Paper
Add Code

A Review of Automated Speech and Language Features for Assessment of Cognitive and Thought Disorders

no code implementations • 4 Jun 2019 • Rohit Voleti, Julie M. Liss, Visar Berisha

Broadly speaking, the review is split into two categories: language features based on natural language processing and speech features based on speech signal processing.

Paper
Add Code

Objective Assessment of Social Skills Using Automated Language Analysis for Identification of Schizophrenia and Bipolar Disorder

no code implementations • 24 Apr 2019 • Rohit Voleti, Stephanie Woolridge, Julie M. Liss, Melissa Milanovic, Christopher R. Bowie, Visar Berisha

Furthermore, the same feature set can be used to build a strong binary classifier to distinguish between healthy controls and a clinical group (AUC = 0. 96) and also between patients within the clinical group with schizophrenia and bipolar I disorder (AUC = 0. 83).

Paper
Add Code

Investigating the Effects of Word Substitution Errors on Sentence Embeddings

1 code implementation • 16 Nov 2018 • Rohit Voleti, Julie M. Liss, Visar Berisha

In this paper we investigate the effects of word substitution errors, such as those coming from automatic speech recognition errors (ASR), on several state-of-the-art sentence embedding methods.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +6

Paper
Code

Triplet Network with Attention for Speaker Diarization

no code implementations • 4 Aug 2018 • Huan Song, Megan Willi, Jayaraman J. Thiagarajan, Visar Berisha, Andreas Spanias

In automatic speech processing systems, speaker diarization is a crucial front-end component to separate segments from different speakers.

Metric Learning speaker-diarization +1

Paper
Add Code

Minimizing Area and Energy of Deep Learning Hardware Design Using Collective Low Precision and Structured Compression

no code implementations • 19 Apr 2018 • Shihui Yin, Gaurav Srivastava, Shreyas K. Venkataramanaiah, Chaitali Chakrabarti, Visar Berisha, Jae-sun Seo

Deep learning algorithms have shown tremendous success in many recognition tasks; however, these algorithms typically include a deep neural network (DNN) structure and a large number of parameters, which makes it challenging to implement them on power/area-constrained embedded platforms.

Binarization

Paper
Add Code

Direct estimation of density functionals using a polynomial basis

no code implementations • 21 Feb 2017 • Alan Wisler, Visar Berisha, Andreas Spanias, Alfred O. Hero

Typically, estimating these quantities requires complete knowledge of the underlying distribution followed by multi-dimensional integration.

Density Estimation

Paper
Add Code

Reducing the Model Order of Deep Neural Networks Using Information Theory

no code implementations • 16 May 2016 • Ming Tu, Visar Berisha, Yu Cao, Jae-sun Seo

In this paper, we propose a method to compress deep neural networks by using the Fisher Information metric, which we estimate through a stochastic optimization method that keeps track of second-order information in the network.

General Classification Network Pruning +2

Paper
Add Code

Empirically Estimable Classification Bounds Based on a New Divergence Measure

no code implementations • 19 Dec 2014 • Visar Berisha, Alan Wisler, Alfred O. Hero, Andreas Spanias

Information divergence functions play a critical role in statistics and information theory.

Binary Classification Classification +2

Paper
Add Code

Empirical non-parametric estimation of the Fisher Information

1 code implementation • 6 Aug 2014 • Visar Berisha, Alfred O. Hero

Traditional approaches to estimating the FIM require estimating the probability distribution function (PDF), or its parameters, along with its gradient or Hessian.

Density Estimation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.