no code implementations • 9 Dec 2024 • Mehdi Noroozi, Isma Hadji, Victor Escorcia, Anestis Zaganidis, Brais Martinez, Georgios Tzimiropoulos
To maintain a high visual quality on such low compute budget, we introduce a number of training strategies: (i) A novel conditioning mechanism on the low resolution input, coined bidirectional conditioning, which tailors the SD model for the SR task.
no code implementations • 5 Dec 2024 • Yassine Ouali, Adrian Bulat, Alexandros Xenos, Anestis Zaganidis, Ioannis Maniadis Metaxas, Brais Martinez, Georgios Tzimiropoulos
Contrastively-trained Vision-Language Models (VLMs) like CLIP have become the de facto approach for discriminative vision-language representation learning.
no code implementations • 3 Nov 2020 • Enrique Sanchez, Adrian Bulat, Anestis Zaganidis, Georgios Tzimiropoulos
The second stage uses another dataset of randomly chosen labeled frames to train a regressor on top of our spatio-temporal model for estimating the AU intensity.
no code implementations • 2 Jul 2018 • Li Sun, Zhi Yan, Anestis Zaganidis, Cheng Zhao, Tom Duckett
Most existing semantic mapping approaches focus on improving semantic understanding of single frames, rather than 3D refinement of semantic maps (i. e. fusing semantic observations).