Search Results for author: Ali Zare

Found 2 papers, 1 papers with code

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

2 code implementations21 Jul 2021 Yinghao Aaron Li, Ali Zare, Nima Mesgarani

We present an unsupervised non-parallel many-to-many voice conversion (VC) method using a generative adversarial network (GAN) called StarGAN v2.

Generative Adversarial Network Voice Conversion

RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in Instructional Videos

no code implementations27 Mar 2024 Ali Zare, Yulei Niu, Hammad Ayyubi, Shih-Fu Chang

(3) Annotation cost: Annotating instructional videos with step-level labels (i. e., timestamp) or sequence-level labels (i. e., action category) is demanding and labor-intensive, limiting its generalizability to large-scale datasets. In this work, we propose a new and practical setting, called adaptive procedure planning in instructional videos, where the procedure length is not fixed or pre-determined.

Cannot find the paper you are looking for? You can Submit a new open access paper.