1 code implementation • 19 Jun 2023 • Zengjie Song, Zhaoxiang Zhang
The framework of visually-guided sound source separation generally consists of three parts: visual feature extraction, multimodal feature fusion, and sound signal processing.
no code implementations • ICCV 2023 • Jingtao Wang, Zengjie Song, Yuxi Wang, Jun Xiao, Yuran Yang, Shuqi Mei, Zhaoxiang Zhang
Surrogate gradient (SG) is one of the most effective approaches for training spiking neural networks (SNNs).
1 code implementation • CVPR 2022 • Zengjie Song, Yuxi Wang, Junsong Fan, Tieniu Tan, Zhaoxiang Zhang
Sound source localization in visual scenes aims to localize objects emitting the sound in a given image.
1 code implementation • 22 Jan 2020 • Zengjie Song, Oluwasanmi Koyejo, Jiangshe Zhang
By exploring the real-valued space of the soft target representation, we are able to synthesize novel images with the designated properties.
no code implementations • 25 Dec 2019 • Zengjie Song, Oluwasanmi Koyejo, Jiangshe Zhang
By exploiting the real-valued space of the soft target representations, we are able to synthesize novel images with the designated properties.