no code implementations • ICCV 2023 • Taegyeong Lee, Jeonghun Kang, Hyeonyu Kim, Taehwan Kim
Representing wild sounds as images is an important but challenging task due to the lack of paired datasets between sound and images and the significant differences in the characteristics of these two modalities.
1 code implementation • 29 Jun 2022 • Hyeonyu Kim, Jongeun Kim, Jeonghun Kang, Sanguk Park, Dongchan Park, Taehwan Kim
This technical report presents the 2nd winning model for AQTC, a task newly introduced in CVPR 2022 LOng-form VidEo Understanding (LOVEU) challenges.