no code implementations • 6 Sep 2023 • Arvind Krishna Sridhar, Yinyi Guo, Erik Visser, Rehana Mahfuz
Then, we propose a parameter efficient inference time faithful decoding algorithm that enables smaller audio captioning models with performance equivalent to larger models trained with more data.
no code implementations • 26 Mar 2020 • Eunjeong Koh, Fatemeh Saki, Yinyi Guo, Cheng-Yu Hung, Erik Visser
The neural adapter layer facilitates the target model to learn new sound events with minimal training data and maintaining the performance of the previously learned sound events similar to the source model.