Search Results for author: Marco Cristani

Found 60 papers, 26 papers with code

Leveraging Acoustic Images for Effective Self-Supervised Audio Representation Learning

1 code implementation ECCV 2020 Valentina Sanguineti, Pietro Morerio, Niccolò Pozzetti, Danilo Greco, Marco Cristani, Vittorio Murino

However, since 2D planar arrays are cumbersome and not as widespread as ordinary microphones, we propose that the richer information content of acoustic images can be distilled, through a self-supervised learning scheme, into more powerful audio and visual feature representations.

Cross-Modal Retrieval Representation Learning +3

Dif4FF: Leveraging Multimodal Diffusion Models and Graph Neural Networks for Accurate New Fashion Product Performance Forecasting

1 code implementation7 Dec 2024 Andrea Avogaro, Luigi Capogrosso, Franco Fummi, Marco Cristani

As a result, this paper proposes Dif4FF, a novel two-stage pipeline for New Fashion Product Performance Forecasting (NFPPF) that leverages the power of diffusion models conditioned on multimodal data related to specific clothes.

Collaborative Instance Navigation: Leveraging Agent Self-Dialogue to Minimize User Input

no code implementations2 Dec 2024 Francesco Taioli, Edoardo Zorzi, Gianni Franchi, Alberto Castellini, Alessandro Farinelli, Marco Cristani, Yiming Wang

Existing embodied instance goal navigation tasks, driven by natural language, assume human users to provide complete and nuanced instance descriptions prior to the navigation, which can be impractical in the real world as human instructions might be brief and ambiguous.

object-detection Object Detection

IoT-Based Coma Patient Monitoring System

no code implementations20 Nov 2024 Hailemicael Lulseged Yimer, Hailegabriel Dereje Degefa, Marco Cristani, Federico Cunico

Continuous monitoring of coma patients is essential but challenging, especially in developing countries with limited resources, staff, and infrastructure.

Learning based Ge'ez character handwritten recognition

no code implementations20 Nov 2024 Hailemicael Lulseged Yimer, Hailegabriel Dereje Degefa, Marco Cristani, Federico Cunico

Ge'ez, an ancient Ethiopic script of cultural and historical significance, has been largely neglected in handwriting recognition research, hindering the digitization of valuable manuscripts.

Handwriting Recognition

Upper-Body Pose-based Gaze Estimation for Privacy-Preserving 3D Gaze Target Detection

1 code implementation26 Sep 2024 Andrea Toaiari, Vittorio Murino, Marco Cristani, Cigdem Beyan

This paper presents a novel approach to tackle this problem by utilizing the person's upper-body pose and available depth maps to extract a 3D gaze direction and employing a multi-stage or an end-to-end pipeline to predict the gazed target.

Gaze Estimation Privacy Preserving

Multi-Camera Industrial Open-Set Person Re-Identification and Tracking

no code implementations5 Sep 2024 Federico Cunico, Marco Cristani

This work presents MICRO-TRACK, a Modular Industrial multi-Camera Re_identification and Open-set Tracking system that is real-time, scalable, and easy to integrate into existing industrial surveillance scenarios.

Person Re-Identification

SITUATE: Indoor Human Trajectory Prediction through Geometric Features and Self-Supervised Vision Representation

1 code implementation1 Sep 2024 Luigi Capogrosso, Andrea Toaiari, Andrea Avogaro, Uzair Khan, Aditya Jivoji, Franco Fummi, Marco Cristani

Patterns of human motion in outdoor and indoor environments are substantially different due to the scope of the environment and the typical intentions of people therein.

Trajectory Forecasting

Self-Supervised Iterative Refinement for Anomaly Detection in Industrial Quality Control

no code implementations21 Aug 2024 Muhammad Aqeel, Shakiba Sharifi, Marco Cristani, Francesco Setti

This study introduces the Iterative Refinement Process (IRP), a robust anomaly detection methodology designed for high-stakes industrial quality control.

Anomaly Detection Defect Detection

Enhancing Split Computing and Early Exit Applications through Predefined Sparsity

1 code implementation16 Jul 2024 Luigi Capogrosso, Enrico Fraccaroli, Giulio Petrozziello, Francesco Setti, Samarjit Chakraborty, Franco Fummi, Marco Cristani

This paper introduces a novel approach to address this challenge by combining the concept of predefined sparsity with Split Computing (SC) and Early Exit (EE).

Action Recognition

MTL-Split: Multi-Task Learning for Edge Devices using Split Computing

1 code implementation8 Jul 2024 Luigi Capogrosso, Enrico Fraccaroli, Samarjit Chakraborty, Franco Fummi, Marco Cristani

However, how to partition such a multi-tasking DNN to be deployed within a SC framework has not been sufficiently studied.

Multi-Task Learning

Leveraging Latent Diffusion Models for Training-Free In-Distribution Data Augmentation for Surface Defect Detection

1 code implementation4 Jul 2024 Federico Girella, Ziyue Liu, Franco Fummi, Francesco Setti, Marco Cristani, Luigi Capogrosso

Usually, defect detection classifiers are trained on ground-truth data formed by normal samples (negative data) and samples with defects (positive data), where the latter are consistently fewer than normal samples.

Data Augmentation Defect Detection +1

I2EDL: Interactive Instruction Error Detection and Localization

no code implementations7 Jun 2024 Francesco Taioli, Stefano Rosa, Alberto Castellini, Lorenzo Natale, Alessio Del Bue, Alessandro Farinelli, Marco Cristani, Yiming Wang

We evaluate the proposed I2EDL on a dataset of instructions containing errors, and further devise a novel metric, the Success weighted by Interaction Number (SIN), to reflect both the navigation performance and the interaction effectiveness.

Vision and Language Navigation

Diffusion-based Image Generation for In-distribution Data Augmentation in Surface Defect Detection

1 code implementation1 Jun 2024 Luigi Capogrosso, Federico Girella, Francesco Taioli, Michele Dalla Chiara, Muhammad Aqeel, Franco Fummi, Francesco Setti, Marco Cristani

In general, defect detection classifiers are trained on ground-truth data formed by normal samples (negative data) and samples with defects (positive data), where the latter are consistently fewer than normal samples.

Data Augmentation Defect Detection +1

Disentangled Latent Spaces Facilitate Data-Driven Auxiliary Learning

no code implementations13 Oct 2023 Geri Skenderi, Luigi Capogrosso, Andrea Toaiari, Matteo Denitto, Franco Fummi, Simone Melzi, Marco Cristani

In this paper, we propose a novel framework, dubbed Detaux, whereby a weakly supervised disentanglement procedure is used to discover a new unrelated auxiliary classification task, which allows us to go from a Single-Task Learning (STL) to a Multi-Task Learning (MTL) problem.

Auxiliary Learning Disentanglement +2

Graph-level Representation Learning with Joint-Embedding Predictive Architectures

1 code implementation27 Sep 2023 Geri Skenderi, Hang Li, Jiliang Tang, Marco Cristani

They aim to learn an energy-based model by predicting the latent representation of a target signal y from the latent representation of a context signal x. JEPAs bypass the need for negative and positive samples, traditionally required by contrastive learning while avoiding the overfitting issues associated with generative pretraining.

Contrastive Learning Data Augmentation +3

A Machine Learning-oriented Survey on Tiny Machine Learning

no code implementations21 Sep 2023 Luigi Capogrosso, Federico Cunico, Dong Seon Cheng, Franco Fummi, Marco Cristani

The emergence of Tiny Machine Learning (TinyML) has positively revolutionized the field of Artificial Intelligence by promoting the joint design of resource-constrained IoT hardware devices and their learning-based software architectures.

Model Optimization Survey

Split-Et-Impera: A Framework for the Design of Distributed Deep Learning Applications

1 code implementation22 Mar 2023 Luigi Capogrosso, Federico Cunico, Michele Lora, Marco Cristani, Franco Fummi, Davide Quaglia

Many recent pattern recognition applications rely on complex distributed architectures in which sensing and computational nodes interact together through a communication network.

A Masked Face Classification Benchmark on Low-Resolution Surveillance Images

1 code implementation23 Nov 2022 Federico Cunico, Andrea Toaiari, Marco Cristani

Results show that the richness of SF-MASK (real + synthetic images) leads all of the tested classifiers to perform better than exploiting comparative face mask datasets, on a fixed 1077 images testing set.

Classification Multi-class Classification

On the use of learning-based forecasting methods for ameliorating fashion business processes: A position paper

no code implementations9 Nov 2022 Geri Skenderi, Christian Joppi, Matteo Denitto, Marco Cristani

The fashion industry is one of the most active and competitive markets in the world, manufacturing millions of products and reaching large audiences every year.

Management Marketing +1

Leveraging commonsense for object localisation in partial scenes

no code implementations1 Nov 2022 Francesco Giuliari, Geri Skenderi, Marco Cristani, Alessio Del Bue, Yiming Wang

With the proposed graph-based scene representation, we estimate the unknown position of the target object using a Graph Neural Network that implements a novel attentional message passing mechanism.

Graph Neural Network Object +1

Toward Smart Doors: A Position Paper

no code implementations23 Sep 2022 Luigi Capogrosso, Geri Skenderi, Federico Girella, Franco Fummi, Marco Cristani

In particular, a smart door system predicts the intention of people near the door based on the social context of the surrounding environment and then makes rational decisions about whether or not to open the door.

Position

I-SPLIT: Deep Network Interpretability for Split Computing

1 code implementation23 Sep 2022 Federico Cunico, Luigi Capogrosso, Francesco Setti, Damiano Carra, Franco Fummi, Marco Cristani

A neuron is important if its gradient with respect to the correct class decision is high.

Pose Forecasting in Industrial Human-Robot Collaboration

1 code implementation24 Jul 2022 Alessio Sampieri, Guido D'Amely, Andrea Avogaro, Federico Cunico, Geri Skenderi, Francesco Setti, Marco Cristani, Fabio Galasso

Pushing back the frontiers of collaborative robots in industrial environments, we propose a new Separable-Sparse Graph Convolutional Network (SeS-GCN) for pose forecasting.

Human Pose Forecasting

POP: Mining POtential Performance of new fashion products via webly cross-modal query expansion

1 code implementation22 Jul 2022 Christian Joppi, Geri Skenderi, Marco Cristani

We propose a data-centric pipeline able to generate exogenous observation data for the New Fashion Product Performance Forecasting (NFPPF) problem, i. e., predicting the performance of a brand-new clothing probe with no available past observations.

New Product Sales Forecasting Time Series +1

SHREC 2022 Track on Online Detection of Heterogeneous Gestures

no code implementations14 Jul 2022 Ariel Caputo, Marco Emporio, Andrea Giachetti, Marco Cristani, Guido Borghi, Andrea D'Eusanio, Minh-Quan Le, Hai-Dang Nguyen, Minh-Triet Tran, F. Ambellan, M. Hanik, E. Nava-Yazdani, C. von Tycowicz

This paper presents the outcomes of a contest organized to evaluate methods for the online recognition of heterogeneous gestures from sequences of 3D hand poses.

Mixed Reality

Under the Hood of Transformer Networks for Trajectory Forecasting

no code implementations22 Mar 2022 Luca Franco, Leonardo Placidi, Francesco Giuliari, Irtiza Hasan, Marco Cristani, Fabio Galasso

This paper proposes the first in-depth study of Transformer Networks (TF) and Bidirectional Transformers (BERT) for the forecasting of the individual motion of people, without bells and whistles.

Trajectory Forecasting

Spatial Commonsense Graph for Object Localisation in Partial Scenes

1 code implementation CVPR 2022 Francesco Giuliari, Geri Skenderi, Marco Cristani, Yiming Wang, Alessio Del Bue

The SCG is used to estimate the unknown position of the target object in two steps: first, we feed the SCG into a novel Proximity Prediction Network, a graph neural network that uses attention to perform distance prediction between the node representing the target object and the nodes representing the observed objects in the SCG; second, we propose a Localisation Module based on circular intersection to estimate the object position using all the predicted pairwise distances in order to be independent of any reference system.

Graph Neural Network Object +1

MovingFashion: a Benchmark for the Video-to-Shop Challenge

1 code implementation6 Oct 2021 Marco Godi, Christian Joppi, Geri Skenderi, Marco Cristani

Retrieving clothes which are worn in social media videos (Instagram, TikTok) is the latest frontier of e-fashion, referred to as "video-to-shop" in the computer vision literature.

Video-to-Shop

Well Googled is Half Done: Multimodal Forecasting of New Fashion Product Sales with Image-based Google Trends

1 code implementation20 Sep 2021 Geri Skenderi, Christian Joppi, Matteo Denitto, Marco Cristani

In particular, we propose a neural network-based approach, where an encoder learns a representation of the exogenous time series, while the decoder forecasts the sales based on the Google Trends encoding and the available visual and metadata information.

New Product Sales Forecasting Time Series +1

Infinite Feature Selection: A Graph-based Feature Filtering Approach

1 code implementation15 Jun 2020 Giorgio Roffo, Simone Melzi, Umberto Castellani, Alessandro Vinciarelli, Marco Cristani

We propose a filtering feature selection framework that considers subsets of features as paths in a graph, where a node is a feature and an edge indicates pairwise (customizable) relations among features, dealing with relevance and redundancy principles.

feature selection

The Visual Social Distancing Problem

no code implementations11 May 2020 Marco Cristani, Alessio Del Bue, Vittorio Murino, Francesco Setti, Alessandro Vinciarelli

One of the main and most effective measures to contain the recent viral outbreak is the maintenance of the so-called Social Distancing (SD).

An integrated light management system with real-time light measurement and human perception

no code implementations17 Apr 2020 Theodore Tsesmelis, Irtiza Hasan, Marco Cristani, Alessio Del Bue, Fabio Galasso

Illumination is important for well-being, productivity and safety across several environments, including offices, retail shops and industrial warehouses.

Management

Transformer Networks for Trajectory Forecasting

1 code implementation18 Mar 2020 Francesco Giuliari, Irtiza Hasan, Marco Cristani, Fabio Galasso

In particular, the TF model without bells and whistles yields the best score on the largest and most challenging trajectory forecasting benchmark of TrajNet.

Trajectory Forecasting

Texture Retrieval in the Wild through detection-based attributes

no code implementations29 Aug 2019 Christian Joppi, Marco Godi, Andrea Giachetti, Fabio Pellacini, Marco Cristani

Capturing the essence of a textile image in a robust way is important to retrieve it in a large repository, especially if it has been acquired in the wild (by taking a photo of the textile of interest).

Retrieval

Texel-Att: Representing and Classifying Element-based Textures by Attributes

1 code implementation29 Aug 2019 Marco Godi, Christian Joppi, Andrea Giachetti, Fabio Pellacini, Marco Cristani

It first individuates texels, characterizing them with individual attributes; subsequently, texels are grouped and characterized through layout attributes, which give the Texel-Att representation.

Attribute

Human-centric light sensing and estimation from RGBD images: The invisible light switch

no code implementations30 Jan 2019 Theodore Tsesmelis, Irtiza Hasan, Marco Cristani, Alessio Del Bue, Fabio Galasso

ILS may therefore dim those luminaires, which are not seen by the user, resulting in an effective energy saving, especially in large open offices (where light may otherwise be ON everywhere for a single person).

RGBD2lux: Dense light intensity estimation with an RGBD sensor

no code implementations20 Sep 2018 Theodore Tsesmelis, Irtiza Hasan, Marco Cristani, Fabio Galasso, Alessio Del Bue

The proposed method uses both depth data and images from the sensor to provide a dense measure of light intensity in the field of view of the camera.

MX-LSTM: mixing tracklets and vislets to jointly forecast trajectories and head poses

no code implementations CVPR 2018 Irtiza Hasan, Francesco Setti, Theodore Tsesmelis, Alessio Del Bue, Fabio Galasso, Marco Cristani

Recent approaches on trajectory forecasting use tracklets to predict the future positions of pedestrians exploiting Long Short Term Memory (LSTM) architectures.

Trajectory Forecasting

Understanding Deep Architectures by Visual Summaries

2 code implementations27 Jan 2018 Marco Carletti, Marco Godi, Maedeh Aghaei, Francesco Giuliari, Marco Cristani

In deep learning, visualization techniques extract the salient patterns exploited by deep networks for image classification, focusing on single images; no effort has been spent in investigating whether these patterns are systematically related to precise semantic entities over multiple images belonging to a same class, thus failing to capture the very understanding of the image class the network has realized.

General Classification Image Classification

What your Facebook Profile Picture Reveals about your Personality

no code implementations3 Aug 2017 Cristina Segalin, Fabio Celli, Luca Polonio, Michal Kosinski, David Stillwell, Nicu Sebe, Marco Cristani, Bruno Lepri

We analyze the effectiveness of four families of visual features and we discuss some human interpretable patterns that explain the personality traits of the individuals.

General Classification

Looking Beyond Appearances: Synthetic Training Data for Deep CNNs in Re-identification

no code implementations11 Jan 2017 Igor Barros Barbosa, Marco Cristani, Barbara Caputo, Aleksander Rognhaugen, Theoharis Theoharis

Re-identification is generally carried out by encoding the appearance of a subject in terms of outfit, suggesting scenarios where people do not change their attire.

Infinite Feature Selection

1 code implementation ICCV 2015 Giorgio Roffo, Simone Melzi, Marco Cristani

Filter-based feature selection has become crucial in many classification settings, especially object recognition, recently faced with feature learning strategies that originate thousands of cues.

Classification feature selection +2

The S-Hock Dataset: Analyzing Crowds at the Stadium

no code implementations CVPR 2015 Davide Conigliaro, Paolo Rota, Francesco Setti, Chiara Bassetti, Nicola Conci, Nicu Sebe, Marco Cristani

In the dataset, a massive annotation has been carried out, focusing on the spectators at different levels of details: at a higher level, people have been labeled depending on the team they are supporting and the fact that they know the people close to them; going to the lower levels, standard pose information has been considered (regarding the head, the body) but also fine grained actions such as hands on hips, clapping hands etc.

Head Pose Estimation Sociology

Audio Surveillance: a Systematic Review

no code implementations27 Sep 2014 Marco Crocco, Marco Cristani, Andrea Trucco, Vittorio Murino

Despite surveillance systems are becoming increasingly ubiquitous in our living environment, automated surveillance, currently based on video sensory modality and machine intelligence, lacks most of the time the robustness and reliability required in several real applications.

Object Tracking

F-formation Detection: Individuating Free-standing Conversational Groups in Images

no code implementations9 Sep 2014 Francesco Setti, Chris Russell, Chiara Bassetti, Marco Cristani

Then, as a main contribution, we present a brand new method for the automatic detection of groups in still images, which is based on a graph-cuts framework for clustering individuals; in particular we are able to codify in a computational sense the sociological definition of F-formation, that is very useful to encode a group having only proxemic information: position and orientation of people.

Clustering

Free energy score space

no code implementations NeurIPS 2009 Alessandro Perina, Marco Cristani, Umberto Castellani, Vittorio Murino, Nebojsa Jojic

Score functions induced by generative models extract fixed-dimension feature vectors from different-length data observations by subsuming the process of data generation, projecting them in highly informative spaces called score spaces.

General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.