no code implementations • 20 Mar 2024 • Jingxi Chen, Brandon Y. Feng, Haoming Cai, Mingyang Xie, Christopher Metzler, Cornelia Fermuller, Yiannis Aloimonos
Through extensive experimentation, we demonstrate the capability of our approach to synthesize high-quality videos that effectively ``rewind'' time, showcasing the potential of combining event camera technology with generative models.
no code implementations • 4 Mar 2024 • Snehesh Shrestha, Yantian Zha, Saketh Banagiri, Ge Gao, Yiannis Aloimonos, Cornelia Fermuller
NatSGD serves as a foundational resource at the intersection of machine learning and HRI research, and we demonstrate its effectiveness in training robots to understand tasks through multimodal human commands, emphasizing the significance of jointly considering speech and gestures.
1 code implementation • 26 Jul 2023 • Daniel Deniz, Cornelia Fermuller, Eduardo Ros, Manuel Rodriguez-Alvarez, Francisco Barranco
In this study, we introduce an event-based dataset on fine-grained manipulation actions and perform an experimental study on the use of transformers for action prediction with events.
no code implementations • CVPR 2023 • Eadom Dessalene, Michael Maynord, Cornelia Fermuller, Yiannis Aloimonos
In this paper we introduce a rule-based, compositional, and hierarchical modeling of action using Therbligs as our atoms.
no code implementations • 31 May 2022 • Peter Sutor, Dehao Yuan, Douglas Summers-Stay, Cornelia Fermuller, Yiannis Aloimonos
This process can be performed iteratively and even on single neural networks by instead making a consensus of multiple classification hypervectors.
no code implementations • 1 Feb 2021 • Eadom Dessalene, Chinmaya Devaraj, Michael Maynord, Cornelia Fermuller, Yiannis Aloimonos
Human actions involving hand manipulations are structured according to the making and breaking of hand-object contact, and human visual understanding of action is reliant on anticipation of contact as is demonstrated by pioneering work in cognitive science.
no code implementations • 1 Sep 2020 • Matthew Evanusa, Cornelia Fermuller, Yiannis Aloimonos
Here we show that a large, deep layered SNN with dynamical, chaotic activity mimicking the mammalian cortex with biologically-inspired learning rules, such as STDP, is capable of encoding information from temporal data.
no code implementations • 5 Jun 2020 • Eadom Dessalene, Michael Maynord, Chinmaya Devaraj, Cornelia Fermuller, Yiannis Aloimonos
We introduce Egocentric Object Manipulation Graphs (Ego-OMG) - a novel representation for activity modeling and anticipation of near future actions integrating three components: 1) semantic temporal structure of activities, 2) short-term dynamics, and 3) representations for appearance.
no code implementations • 25 Jan 2020 • John Kanu, Eadom Dessalene, Xiaomin Lin, Cornelia Fermuller, Yiannis Aloimonos
While traditional methods for instruction-following typically assume prior linguistic and perceptual knowledge, many recent works in reinforcement learning (RL) have proposed learning policies end-to-end, typically by training neural networks to map joint representations of observations and instructions directly to actions.
no code implementations • 18 Mar 2019 • Anton Mitrokhin, Chengxi Ye, Cornelia Fermuller, Yiannis Aloimonos, Tobi Delbruck
In addition to camera egomotion and a dense depth map, the network estimates pixel-wise independently moving object segmentation and computes per-object 3D translational velocities for moving objects.
no code implementations • 16 Nov 2018 • Konstantinos Zampogiannis, Cornelia Fermuller, Yiannis Aloimonos
In this paper, we introduce a non-rigid registration pipeline for pairs of unorganized point clouds that may be topologically different.
no code implementations • 13 Jul 2018 • Konstantinos Zampogiannis, Kanishka Ganguly, Cornelia Fermuller, Yiannis Aloimonos
When we physically interact with our environment using our hands, we touch objects and force them to move: contact and motion are defining properties of manipulation.
no code implementations • 8 Jul 2018 • Francisco Barranco, Cornelia Fermuller, Eduardo Ros
The potential of our approach is demonstrated in a multi-target tracking application using Kalman filters to smooth the trajectories.
1 code implementation • 1 Jul 2018 • Konstantinos Zampogiannis, Cornelia Fermuller, Yiannis Aloimonos
We introduce cilantro, an open-source C++ library for geometric and general-purpose point cloud data processing.
no code implementations • 12 Mar 2018 • Anton Mitrokhin, Cornelia Fermuller, Chethan Parameshwara, Yiannis Aloimonos
Event-based vision sensors, such as the Dynamic Vision Sensor (DVS), are ideally suited for real-time motion analysis.
no code implementations • 2 Aug 2017 • Chengxi Ye, Yezhou Yang, Cornelia Fermuller, Yiannis Aloimonos
We conclude this paper with the construction of a novel contractive neural network.
no code implementations • 14 Nov 2016 • Went Luan, Yezhou Yang, Cornelia Fermuller, John S. Baras
In this work, we present a fast target detection framework for real-world robotics applications.
no code implementations • 12 Sep 2016 • Ren Mao, John S. Baras, Yezhou Yang, Cornelia Fermuller
It is designed to adapt the original imitation trajectories, which are learned from demonstrations, to novel situations with various constraints.
no code implementations • 12 Sep 2016 • Wentao Luan, Yezhou Yang, Cornelia Fermuller, John Baras
We consider the problem of object recognition in 3D using an ensemble of attribute-based classifiers.
1 code implementation • 9 May 2016 • Chengxi Ye, Chen Zhao, Yezhou Yang, Cornelia Fermuller, Yiannis Aloimonos
LightNet is a lightweight, versatile and purely Matlab-based deep learning framework.
no code implementations • 29 Jan 2016 • Chengxi Ye, Yezhou Yang, Cornelia Fermuller, Yiannis Aloimonos
For robots that have the capability to interact with the physical environment through their end effectors, understanding the surrounding scenes is not merely a task of image classification or object recognition.
no code implementations • 10 Dec 2015 • Yezhou Yang, Yi Li, Cornelia Fermuller, Yiannis Aloimonos
In this paper we consider the problem of continuously discovering image contents by actively asking image based questions and subsequently answering the questions being asked.
no code implementations • IJCNLP 2015 • Yezhou Yang, Yiannis Aloimonos, Cornelia Fermuller, Eren Erdal Aksoy
In this paper we present a formal computational framework for modeling manipulation actions.
no code implementations • ICCV 2015 • Francisco Barranco, Ching L. Teo, Cornelia Fermuller, Yiannis Aloimonos
The bio-inspired, asynchronous event-based dynamic vision sensor records temporal changes in the luminance of the scene at high temporal resolution.
no code implementations • ICCV 2015 • Ching L. Teo, Cornelia Fermuller, Yiannis Aloimonos
Symmetry, as one of the key components of Gestalt theory, provides an important mid-level cue that serves as input to higher visual processes such as segmentation.
no code implementations • 10 Nov 2015 • Somak Aditya, Yezhou Yang, Chitta Baral, Cornelia Fermuller, Yiannis Aloimonos
Specifically, commonsense reasoning is applied on (a) detections obtained from existing perception methods on given images, (b) a "commonsense" knowledge base constructed using natural language processing of image annotations and (c) lexical ontological knowledge from resources such as WordNet.
no code implementations • CVPR 2015 • Yezhou Yang, Cornelia Fermuller, Yi Li, Yiannis Aloimonos
The grasp type provides crucial information about human action.
no code implementations • CVPR 2015 • Ching Teo, Cornelia Fermuller, Yiannis Aloimonos
To this end, we use several different local cues: shape, spectral properties of boundary patches, and semi-global grouping cues that are indicative of perceived depth.
no code implementations • CVPR 2013 • Yezhou Yang, Cornelia Fermuller, Yiannis Aloimonos
There is a small set of fundamental primitive action consequences that provides a systematic high-level classification of manipulation actions.