Search Results for author: Petros Koutras

Found 10 papers, 5 papers with code

ViDaS Video Depth-aware Saliency Network

no code implementations • 19 May 2023 • Ioanna Diamanti, Antigoni Tsiami, Petros Koutras, Petros Maragos

We introduce ViDaS, a two-stream, fully convolutional Video, Depth-Aware Saliency network to address the problem of attention modeling ``in-the-wild", via saliency prediction in videos.

object-detection Object Detection +2

Paper
Add Code

ChildBot: Multi-Robot Perception and Interaction with Children

no code implementations • 28 Aug 2020 • Niki Efthymiou, Panagiotis P. Filntisis, Petros Koutras, Antigoni Tsiami, Jack Hadfield, Gerasimos Potamianos, Petros Maragos

In this paper we present an integrated robotic system capable of participating in and performing a wide range of educational and entertainment tasks, in collaboration with one or more children.

Paper
Add Code

How to track your dragon: A Multi-Attentional Framework for real-time RGB-D 6-DOF Object Pose Tracking

1 code implementation • 21 Apr 2020 • Isidoros Marougkas, Petros Koutras, Nikos Kardaris, Georgios Retsinas, Georgia Chalvatzaki, Petros Maragos

We present a novel multi-attentional convolutional architecture to tackle the problem of real-time RGB-D 6D object pose tracking of single, known objects.

Data Augmentation Object Tracking +3

Paper
Code

STAViS: Spatio-Temporal AudioVisual Saliency Network

1 code implementation • CVPR 2020 • Antigoni Tsiami, Petros Koutras, Petros Maragos

We introduce STAViS, a spatio-temporal audiovisual saliency network that combines spatio-temporal visual and auditory information in order to efficiently address the problem of saliency estimation in videos.

Saliency Prediction

Paper
Code

Deeply Supervised Multimodal Attentional Translation Embeddings for Visual Relationship Detection

1 code implementation • 15 Feb 2019 • Nikolaos Gkanatsios, Vassilis Pitsikalis, Petros Koutras, Athanasia Zlatintsi, Petros Maragos

Detecting visual relationships, i. e. <Subject, Predicate, Object> triplets, is a challenging Scene Understanding task approached in the past via linguistic priors or spatial information in a single feature branch.

Relationship Detection Translation +1

Paper
Code

Fusing Body Posture with Facial Expressions for Joint Recognition of Affect in Child-Robot Interaction

1 code implementation • 7 Jan 2019 • Panagiotis P. Filntisis, Niki Efthymiou, Petros Koutras, Gerasimos Potamianos, Petros Maragos

In this paper we address the problem of multi-cue affect recognition in challenging scenarios such as child-robot interaction.

Paper
Code

SUSiNet: See, Understand and Summarize it

no code implementations • 3 Dec 2018 • Petros Koutras, Petros Maragos

In this work we propose a multi-task spatio-temporal network, called SUSiNet, that can jointly tackle the spatio-temporal problems of saliency estimation, action recognition and video summarization.

Ranked #66 on Action Recognition on HMDB-51 (using extra training data)

Action Recognition Saliency Prediction +2

Paper
Add Code

A Deep Learning Approach for Multi-View Engagement Estimation of Children in a Child-Robot Joint Attention task

no code implementations • 1 Dec 2018 • Jack Hadfield, Georgia Chalvatzaki, Petros Koutras, Mehdi Khamassi, Costas S. Tzafestas, Petros Maragos

In this work we tackle the problem of child engagement estimation while children freely interact with a robot in their room.

Paper
Add Code

LSTM-based Network for Human Gait Stability Prediction in an Intelligent Robotic Rollator

no code implementations • 1 Dec 2018 • Georgia Chalvatzaki, Petros Koutras, Jack Hadfield, Xanthi S. Papageorgiou, Costas S. Tzafestas, Petros Maragos

In this work, we present a novel framework for on-line human gait stability prediction of the elderly users of an intelligent robotic rollator using Long Short Term Memory (LSTM) networks, fusing multimodal RGB-D and Laser Range Finder (LRF) data from non-wearable sensors.

Pose Estimation

Paper
Add Code

Multimodal Visual Concept Learning with Weakly Supervised Techniques

1 code implementation • CVPR 2018 • Giorgos Bouritsas, Petros Koutras, Athanasia Zlatintsi, Petros Maragos

Despite the availability of a huge amount of video data accompanied by descriptive texts, it is not always easy to exploit the information contained in natural language in order to automatically recognize video concepts.

Action Recognition Descriptive +4

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.