1 code implementation • 15 Feb 2021 • Idan Achituve, Aviv Navon, Yochai Yemini, Gal Chechik, Ethan Fetaya
As a result, our method scales well with both the number of classes and data size.
1 code implementation • 5 Jun 2023 • Yochai Yemini, Aviv Shamsian, Lior Bracha, Sharon Gannot, Ethan Fetaya
We then condition a diffusion model on the video and use the extracted text through a classifier-guidance mechanism where a pre-trained ASR serves as the classifier.
no code implementations • 22 Oct 2020 • Yochai Yemini, Ethan Fetaya, Haggai Maron, Sharon Gannot
We use noisy and noiseless versions of a simulated reverberant dataset to test the proposed architecture.