no code implementations • 29 Feb 2024 • Vivek Singh, Shailza Sharma, Fabio Cuzzolin
This paper presents a novel feature-boosting network that gathers spatial context from multiple levels of feature extraction and computes the attention weights for each level of representation to generate the final class labels.
no code implementations • 20 Nov 2022 • Shailza Sharma, Abhinav Dhall, Vinay Kumar, Vivek Singh Bawa
In order to learn these fine spatio-temporal motion details, we propose a novel cross-modal audio-visual Video Face Hallucination Generative Adversarial Network (VFH-GAN).
no code implementations • 5 Oct 2021 • Shailza Sharma, Abhinav Dhall, Vinay Kumar
To explicitly encode the high frequency components, an auto encoder is proposed to generate high resolution coefficients of Discrete Cosine Transform (DCT).