Search Results for author: Gargi Gosh

Found 1 papers, 0 papers with code

FLAP: Fast Language-Audio Pre-training

no code implementations2 Nov 2023 Ching-Feng Yeh, Po-Yao Huang, Vasu Sharma, Shang-Wen Li, Gargi Gosh

We propose Fast Language-Audio Pre-training (FLAP), a self-supervised approach that efficiently and effectively learns aligned audio and language representations through masking, contrastive learning and reconstruction.

AudioCaps Contrastive Learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.