Search Results for author: Antonino Furnari

Found 40 papers, 21 papers with code

Eyes Wide Unshut: Unsupervised Mistake Detection in Egocentric Procedural Video by Detecting Unpredictable Gaze

no code implementations12 Jun 2024 Michele Mazzamuto, Antonino Furnari, Giovanni Maria Farinella

In this paper, we address the challenge of unsupervised mistake detection in egocentric procedural video through the analysis of gaze signals.

Mistake Detection

AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation

1 code implementation3 Jun 2024 Lorenzo Mur-Labadia, Ruben Martinez-Cantin, Josechu Guerrero, Giovanni Maria Farinella, Antonino Furnari

Short-Term object-interaction Anticipation consists of detecting the location of the next-active objects, the noun and verb categories of the interaction, and the time to contact from the observation of egocentric video.

Short-term Object Interaction Anticipation

Differentiable Task Graph Learning: Procedural Activity Representation and Online Mistake Detection from Egocentric Videos

1 code implementation3 Jun 2024 Luigi Seminara, Giovanni Maria Farinella, Antonino Furnari

Task graphs learned with our approach are also shown to significantly enhance online mistake detection in procedural egocentric videos, achieving notable gains of +19. 8% and +7. 5% on the Assembly101 and EPIC-Tent datasets.

Graph Learning Online Mistake Detection +1

Synchronization is All You Need: Exocentric-to-Egocentric Transfer for Temporal Action Segmentation with Unlabeled Synchronized Video Pairs

no code implementations5 Dec 2023 Camillo Quattrocchi, Antonino Furnari, Daniele Di Mauro, Mario Valerio Giuffrida, Giovanni Maria Farinella

Instead, we propose a novel methodology which performs the adaptation leveraging existing labeled exocentric videos and a new set of unlabeled, synchronized exocentric-egocentric video pairs, for which temporal action segmentation annotations do not need to be collected.

Action Segmentation Knowledge Distillation +2

Are Synthetic Data Useful for Egocentric Hand-Object Interaction Detection?

1 code implementation5 Dec 2023 Rosario Leonardi, Antonino Furnari, Francesco Ragusa, Giovanni Maria Farinella

In this study, we investigate the effectiveness of synthetic data in enhancing egocentric hand-object interaction detection.

Hand-Object Interaction Detection

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

2 code implementations CVPR 2024 Kristen Grauman, Andrew Westbury, Lorenzo Torresani, Kris Kitani, Jitendra Malik, Triantafyllos Afouras, Kumar Ashutosh, Vijay Baiyya, Siddhant Bansal, Bikram Boote, Eugene Byrne, Zach Chavis, Joya Chen, Feng Cheng, Fu-Jen Chu, Sean Crane, Avijit Dasgupta, Jing Dong, Maria Escobar, Cristhian Forigua, Abrham Gebreselasie, Sanjay Haresh, Jing Huang, Md Mohaiminul Islam, Suyog Jain, Rawal Khirodkar, Devansh Kukreja, Kevin J Liang, Jia-Wei Liu, Sagnik Majumder, Yongsen Mao, Miguel Martin, Effrosyni Mavroudi, Tushar Nagarajan, Francesco Ragusa, Santhosh Kumar Ramakrishnan, Luigi Seminara, Arjun Somayazulu, Yale Song, Shan Su, Zihui Xue, Edward Zhang, Jinxu Zhang, Angela Castillo, Changan Chen, Xinzhu Fu, Ryosuke Furuta, Cristina Gonzalez, Prince Gupta, Jiabo Hu, Yifei HUANG, Yiming Huang, Weslie Khoo, Anush Kumar, Robert Kuo, Sach Lakhavani, Miao Liu, Mi Luo, Zhengyi Luo, Brighid Meredith, Austin Miller, Oluwatumininu Oguntola, Xiaqing Pan, Penny Peng, Shraman Pramanick, Merey Ramazanova, Fiona Ryan, Wei Shan, Kiran Somasundaram, Chenan Song, Audrey Southerland, Masatoshi Tateno, Huiyu Wang, Yuchen Wang, Takuma Yagi, Mingfei Yan, Xitong Yang, Zecheng Yu, Shengxin Cindy Zha, Chen Zhao, Ziwei Zhao, Zhifan Zhu, Jeff Zhuo, Pablo Arbelaez, Gedas Bertasius, David Crandall, Dima Damen, Jakob Engel, Giovanni Maria Farinella, Antonino Furnari, Bernard Ghanem, Judy Hoffman, C. V. Jawahar, Richard Newcombe, Hyun Soo Park, James M. Rehg, Yoichi Sato, Manolis Savva, Jianbo Shi, Mike Zheng Shou, Michael Wray

We present Ego-Exo4D, a diverse, large-scale multimodal multiview video dataset and benchmark challenge.

Video Understanding

ENIGMA-51: Towards a Fine-Grained Understanding of Human-Object Interactions in Industrial Scenarios

2 code implementations26 Sep 2023 Francesco Ragusa, Rosario Leonardi, Michele Mazzamuto, Claudia Bonanno, Rosario Scavo, Antonino Furnari, Giovanni Maria Farinella

ENIGMA-51 is a new egocentric dataset acquired in an industrial scenario by 19 subjects who followed instructions to complete the repair of electrical boards using industrial tools (e. g., electric screwdriver) and equipments (e. g., oscilloscope).

Action Detection Human-Object Interaction Detection +3

Streaming egocentric action anticipation: An evaluation scheme and approach

no code implementations29 Jun 2023 Antonino Furnari, Giovanni Maria Farinella

We propose a streaming egocentric action evaluation scheme which assumes that predictions are performed online and made available only after the model has processed the current input segment, which depends on its runtime.

Action Anticipation Knowledge Distillation

StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipation

1 code implementation8 Apr 2023 Francesco Ragusa, Giovanni Maria Farinella, Antonino Furnari

Anticipation problem has been studied considering different aspects such as predicting humans' locations, predicting hands and objects trajectories, and forecasting actions and human-object interactions.

Human-Object Interaction Detection Object +1

Visual Object Tracking in First Person Vision

no code implementations27 Sep 2022 Matteo Dunnhofer, Antonino Furnari, Giovanni Maria Farinella, Christian Micheloni

Despite a few previous attempts to exploit trackers in the FPV domain, a methodical analysis of the performance of state-of-the-art trackers is still missing.

Human-Object Interaction Detection Object +2

MECCANO: A Multimodal Egocentric Dataset for Humans Behavior Understanding in the Industrial-like Domain

no code implementations19 Sep 2022 Francesco Ragusa, Antonino Furnari, Giovanni Maria Farinella

To encourage research in this field, we present MECCANO, a multimodal dataset of egocentric videos to study humans behavior understanding in industrial-like settings.

Action Anticipation Action Recognition +1

Panoptic Segmentation using Synthetic and Real Data

no code implementations14 Apr 2022 Camillo Quattrocchi, Daniele Di Mauro, Antonino Furnari, Giovanni Maria Farinella

Motivated by this observation, we propose a pipeline which allows to generate synthetic images from 3D models of real environments and real objects.

object-detection Object Detection +2

Weakly Supervised Attended Object Detection Using Gaze Data as Annotations

no code implementations14 Apr 2022 Michele Mazzamuto, Francesco Ragusa, Antonino Furnari, Giovanni Signorello, Giovanni Maria Farinella

Since labeling large amounts of data to train a standard object detector is expensive in terms of costs and time, we propose a weakly supervised version of the task which leans only on gaze data and a frame-level label indicating the class of the attended object.

Object object-detection +1

Untrimmed Action Anticipation

no code implementations8 Feb 2022 Ivan Rodin, Antonino Furnari, Dimitrios Mavroeidis, Giovanni Maria Farinella

Experiments show that the performance of current models designed for trimmed action anticipation is very limited and more research on this task is required.

Action Anticipation Action Detection

Image-based Navigation in Real-World Environments via Multiple Mid-level Representations: Fusion Models, Benchmark and Efficient Evaluation

1 code implementation2 Feb 2022 Marco Rosano, Antonino Furnari, Luigi Gulino, Corrado Santoro, Giovanni Maria Farinella

All the proposed navigation models have been trained with the Habitat simulator on a synthetic office environment and have been tested on the same real-world environment using a real robotic platform.

PointGoal Navigation Scene Understanding

Ego4D: Around the World in 3,000 Hours of Egocentric Video

8 code implementations CVPR 2022 Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Abrham Gebreselasie, Cristina Gonzalez, James Hillis, Xuhua Huang, Yifei HUANG, Wenqi Jia, Weslie Khoo, Jachym Kolar, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Ziwei Zhao, Yunyi Zhu, Pablo Arbelaez, David Crandall, Dima Damen, Giovanni Maria Farinella, Christian Fuegen, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik

We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite.

De-identification Ethics

Towards Streaming Egocentric Action Anticipation

no code implementations11 Oct 2021 Antonino Furnari, Giovanni Maria Farinella

In contrast, in this paper, we propose a "streaming" egocentric action anticipation evaluation protocol which explicitly considers model runtime for performance assessment, assuming that predictions will be available only after the current video segment is processed, which depends on the processing time of a method.

Action Anticipation Knowledge Distillation

Is First Person Vision Challenging for Object Tracking?

no code implementations31 Aug 2021 Matteo Dunnhofer, Antonino Furnari, Giovanni Maria Farinella, Christian Micheloni

Our study extensively analyses the performance of recent visual trackers and baseline FPV trackers with respect to different aspects and considering a new performance measure.

Human-Object Interaction Detection Object +2

Predicting the Future from First Person (Egocentric) Vision: A Survey

no code implementations28 Jul 2021 Ivan Rodin, Antonino Furnari, Dimitrios Mavroedis, Giovanni Maria Farinella

Egocentric videos can bring a lot of information about how humans perceive the world and interact with the environment, which can be beneficial for the analysis of human behaviour.

Future prediction

A Survey on Human-aware Robot Navigation

no code implementations22 Jun 2021 Ronja Möller, Antonino Furnari, Sebastiano Battiato, Aki Härmä, Giovanni Maria Farinella

This paper is concerned with the navigation aspect of a socially-compliant robot and provides a survey of existing solutions for the relevant areas of research as well as an outlook on possible future directions.

Human Activity Recognition Robot Navigation

Is First Person Vision Challenging for Object Tracking?

no code implementations24 Nov 2020 Matteo Dunnhofer, Antonino Furnari, Giovanni Maria Farinella, Christian Micheloni

Despite a few previous attempts to exploit trackers in FPV applications, a methodical analysis of the performance of state-of-the-art visual trackers in this domain is still missing.

Human-Object Interaction Detection Object +2

On Embodied Visual Navigation in Real Environments Through Habitat

1 code implementation26 Oct 2020 Marco Rosano, Antonino Furnari, Luigi Gulino, Giovanni Maria Farinella

Visual navigation models based on deep learning can learn effective policies when trained on large amounts of visual observations through reinforcement learning.

Unsupervised Domain Adaptation Visual Navigation

Rolling-Unrolling LSTMs for Action Anticipation from First-Person Video

2 code implementations4 May 2020 Antonino Furnari, Giovanni Maria Farinella

The experiments show that the proposed architecture is state-of-the-art in the domain of egocentric videos, achieving top performances in the 2019 EPIC-Kitchens egocentric action anticipation challenge.

Action Anticipation Action Recognition +3

The EPIC-KITCHENS Dataset: Collection, Challenges and Baselines

2 code implementations29 Apr 2020 Dima Damen, Hazel Doughty, Giovanni Maria Farinella, Sanja Fidler, Antonino Furnari, Evangelos Kazakos, Davide Moltisanti, Jonathan Munro, Toby Perrett, Will Price, Michael Wray

Our dataset features 55 hours of video consisting of 11. 5M frames, which we densely labelled for a total of 39. 6K action segments and 454. 2K object bounding boxes.

Object

Knowledge Distillation for Action Anticipation via Label Smoothing

no code implementations16 Apr 2020 Guglielmo Camporese, Pasquale Coscia, Antonino Furnari, Giovanni Maria Farinella, Lamberto Ballan

Since multiple actions may equally occur in the future, we treat action anticipation as a multi-label problem with missing labels extending the concept of label smoothing.

Action Anticipation Autonomous Driving +2

EGO-CH: Dataset and Fundamental Tasks for Visitors BehavioralUnderstanding using Egocentric Vision

no code implementations3 Feb 2020 Francesco Ragusa, Antonino Furnari, Sebastiano Battiato, Giovanni Signorello, Giovanni Maria Farinella

Equipping visitors of a cultural site with a wearable device allows to easily collect information about their preferences which can be exploited to improve the fruition of cultural goods with augmented reality.

Object Recognition Retrieval

Egocentric Visitors Localization in Cultural Sites

no code implementations10 Apr 2019 Francesco Ragusa, Antonino Furnari, Sebastiano Battiato, Giovanni Signorello, Giovanni Maria Farinella

We consider the problem of localizing visitors in a cultural site from egocentric (first person) images.

Next-Active-Object prediction from Egocentric Videos

no code implementations10 Apr 2019 Antonino Furnari, Sebastiano Battiato, Kristen Grauman, Giovanni Maria Farinella

Although First Person Vision systems can sense the environment from the user's perspective, they are generally unable to predict his intentions and goals.

Object

Cannot find the paper you are looking for? You can Submit a new open access paper.