1 code implementation • 29 Jun 2022 • Yael Segal, Kasia Hitczenko, Matthew Goldrick, Adam Buchwald, Angela Roberts, Joseph Keshet
These segmentations predicted by the models are used to obtain measures of speech rate and sound duration.
1 code implementation • 31 Mar 2022 • Bronya R. Chernyak, Talia Ben Simon, Yael Segal, Jeremy Steffman, Eleanor Chodroff, Jennifer S. Cole, Joseph Keshet
The classifier is implemented as a multi-headed fully-connected network trained to detect creaky voice, voicing, and pitch, where the last two are used to refine creak prediction.
no code implementations • 7 Mar 2021 • Tzeviya Sylvia Fuchs, Yael Segal, Joseph Keshet
In this paper, we propose a spoken term detection algorithm for simultaneous prediction and localization of in-vocabulary and out-of-vocabulary terms within an audio segment.
no code implementations • 14 Apr 2019 • Yael Segal, Tzeviya Sylvia Fuchs, Joseph Keshet
In this paper, we propose to apply object detection methods from the vision domain on the speech recognition domain, by treating audio fragments as objects.