Search Results for author: Jennifer Williams

Found 24 papers, 5 papers with code

Exploratory Evaluation of Speech Content Masking

no code implementations8 Jan 2024 Jennifer Williams, Karla Pizzi, Paul-Gauthier Noe, Sneha Das

Most recent speech privacy efforts have focused on anonymizing acoustic speaker attributes but there has not been as much research into protecting information from speech content.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Protecting Publicly Available Data With Machine Learning Shortcuts

no code implementations30 Oct 2023 Nicolas M. Müller, Maximilian Burgert, Pascal Debus, Jennifer Williams, Philip Sperl, Konstantin Böttinger

Machine-learning (ML) shortcuts or spurious correlations are artifacts in datasets that lead to very good training and test performance but severely limit the model's generalization capability.

Localized Shortcut Removal

no code implementations24 Nov 2022 Nicolas M. Müller, Jochen Jacobs, Jennifer Williams, Konstantin Böttinger

This is often due to the existence of machine learning shortcuts - features in the data that are predictive but unrelated to the problem at hand.

Analysis of Voice Conversion and Code-Switching Synthesis Using VQ-VAE

no code implementations28 Mar 2022 Shuvayanti Das, Jennifer Williams, Catherine Lai

We found that speech synthesis quality degrades after increasing the number of language switches within an utterance and decreasing the number of words.

Speech Synthesis Voice Conversion

Same Cause; Different Effects in the Brain

1 code implementation21 Feb 2022 Mariya Toneva, Jennifer Williams, Anand Bollu, Christoph Dann, Leila Wehbe

It is then natural to ask: "Is the activity in these different brain zones caused by the stimulus properties in the same way?"

Behavior measures are predicted by how information is encoded in an individual's brain

1 code implementation11 Dec 2021 Jennifer Williams, Leila Wehbe

We hypothesize that individual differences in how information is encoded in the brain are task-specific and predict different behavior measures.

Revisiting Speech Content Privacy

no code implementations13 Oct 2021 Jennifer Williams, Junichi Yamagishi, Paul-Gauthier Noe, Cassia Valentini Botinhao, Jean-Francois Bonastre

In this paper, we discuss an important aspect of speech privacy: protecting spoken content.

Human Perception of Audio Deepfakes

no code implementations20 Jul 2021 Nicolas M. Müller, Karla Pizzi, Jennifer Williams

The recent emergence of deepfakes has brought manipulated and generated content to the forefront of machine learning research.

DeepFake Detection Face Recognition +4

Exploring Disentanglement with Multilingual and Monolingual VQ-VAE

1 code implementation4 May 2021 Jennifer Williams, Jason Fong, Erica Cooper, Junichi Yamagishi

This work examines the content and usefulness of disentangled phone and speaker representations from two separately trained VQ-VAE systems: one trained on multilingual data and another trained on monolingual data.

Disentanglement

Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm

1 code implementation21 Oct 2020 Jennifer Williams, Yi Zhao, Erica Cooper, Junichi Yamagishi

Additionally, phones can be recognized from sub-phone VQ codebook indices in our semi-supervised VQ-VAE better than self-supervised with global conditions.

speaker-diarization Speaker Diarization +1

Speech Replay Detection with x-Vector Attack Embeddings and Spectral Features

no code implementations23 Sep 2019 Jennifer Williams, Joanna Rownicka

Our system used convolutional neural networks (CNNs) and a representation of the speech audio that combined x-vector attack embeddings with signal processing features.

Recognizing Emotions in Video Using Multimodal DNN Feature Fusion

no code implementations WS 2018 Jennifer Williams, Steven Kleinegesse, Ramona Comanescu, Oana Radu

We present our system description of input-level multimodal fusion of audio, video, and text for recognition of emotions and their intensities for the 2018 First Grand Challenge on Computational Modeling of Human Multimodal Language.

Emotion Recognition Machine Translation

DNN Multimodal Fusion Techniques for Predicting Video Sentiment

no code implementations WS 2018 Jennifer Williams, Ramona Comanescu, Oana Radu, Leimin Tian

Our work also improves feature selection for unimodal sentiment analysis, while proposing a novel and effective multimodal fusion architecture for this task.

feature selection Multimodal Sentiment Analysis

A New Twitter Verb Lexicon for Natural Language Processing

no code implementations LREC 2012 Jennifer Williams, Graham Katz

We describe in-progress work on the creation of a new lexical resource that contains a list of 486 verbs annotated with quantified temporal durations for the events that they describe.

Game of Chess

Cannot find the paper you are looking for? You can Submit a new open access paper.