1 code implementation • 18 Mar 2024 • Tornike Karchkhadze, Hassan Salami Kavaki, Mohammad Rasool Izadi, Bryce Irvin, Mikolaj Kegler, Ari Hertz, Shuo Zhang, Marko Stamenovic
We introduce a new loss term to enhance Foley sound generation in AudioLDM without post-filtering.
1 code implementation • ICASSP 2022 • Viet Anh Trinh, Hassan Salami Kavaki, Michael I Mandel
We introduce ImportantAug, a technique to augment training data for speech classification and recognition models by adding noise to unimportant regions of the speech and not to important regions.
Ranked #1 on Keyword Spotting on Google Speech Commands (Google Speech Command-Musan metric)