no code implementations • 19 May 2023 • Ioanna Diamanti, Antigoni Tsiami, Petros Koutras, Petros Maragos
We introduce ViDaS, a two-stream, fully convolutional Video, Depth-Aware Saliency network to address the problem of attention modeling ``in-the-wild", via saliency prediction in videos.
no code implementations • 28 Aug 2020 • Niki Efthymiou, Panagiotis P. Filntisis, Petros Koutras, Antigoni Tsiami, Jack Hadfield, Gerasimos Potamianos, Petros Maragos
In this paper we present an integrated robotic system capable of participating in and performing a wide range of educational and entertainment tasks, in collaboration with one or more children.
1 code implementation • 21 Apr 2020 • Isidoros Marougkas, Petros Koutras, Nikos Kardaris, Georgios Retsinas, Georgia Chalvatzaki, Petros Maragos
We present a novel multi-attentional convolutional architecture to tackle the problem of real-time RGB-D 6D object pose tracking of single, known objects.
1 code implementation • CVPR 2020 • Antigoni Tsiami, Petros Koutras, Petros Maragos
We introduce STAViS, a spatio-temporal audiovisual saliency network that combines spatio-temporal visual and auditory information in order to efficiently address the problem of saliency estimation in videos.
1 code implementation • 15 Feb 2019 • Nikolaos Gkanatsios, Vassilis Pitsikalis, Petros Koutras, Athanasia Zlatintsi, Petros Maragos
Detecting visual relationships, i. e. <Subject, Predicate, Object> triplets, is a challenging Scene Understanding task approached in the past via linguistic priors or spatial information in a single feature branch.
1 code implementation • 7 Jan 2019 • Panagiotis P. Filntisis, Niki Efthymiou, Petros Koutras, Gerasimos Potamianos, Petros Maragos
In this paper we address the problem of multi-cue affect recognition in challenging scenarios such as child-robot interaction.
no code implementations • 3 Dec 2018 • Petros Koutras, Petros Maragos
In this work we propose a multi-task spatio-temporal network, called SUSiNet, that can jointly tackle the spatio-temporal problems of saliency estimation, action recognition and video summarization.
Ranked #66 on Action Recognition on HMDB-51 (using extra training data)
no code implementations • 1 Dec 2018 • Jack Hadfield, Georgia Chalvatzaki, Petros Koutras, Mehdi Khamassi, Costas S. Tzafestas, Petros Maragos
In this work we tackle the problem of child engagement estimation while children freely interact with a robot in their room.
no code implementations • 1 Dec 2018 • Georgia Chalvatzaki, Petros Koutras, Jack Hadfield, Xanthi S. Papageorgiou, Costas S. Tzafestas, Petros Maragos
In this work, we present a novel framework for on-line human gait stability prediction of the elderly users of an intelligent robotic rollator using Long Short Term Memory (LSTM) networks, fusing multimodal RGB-D and Laser Range Finder (LRF) data from non-wearable sensors.
1 code implementation • CVPR 2018 • Giorgos Bouritsas, Petros Koutras, Athanasia Zlatintsi, Petros Maragos
Despite the availability of a huge amount of video data accompanied by descriptive texts, it is not always easy to exploit the information contained in natural language in order to automatically recognize video concepts.