Search Results for author: Silvio Giancola

Found 35 papers, 19 papers with code

SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap

1 code implementation • 17 Apr 2024 • Vladimir Somers, Victor Joos, Anthony Cioppa, Silvio Giancola, Seyed Abolfazl Ghasemzadeh, Floriane Magera, Baptiste Standaert, Amir Mohammad Mansourian, Xin Zhou, Shohreh Kasaei, Bernard Ghanem, Alexandre Alahi, Marc Van Droogenbroeck, Christophe De Vleeschouwer

This tracking and identification process is crucial for reconstructing the game state, defined by the athletes' positions and identities on a 2D top-view of the pitch, (i. e. a minimap).

Camera Calibration

Paper
Code

X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Model

no code implementations • 7 Apr 2024 • Jan Held, Hani Itani, Anthony Cioppa, Silvio Giancola, Bernard Ghanem, Marc Van Droogenbroeck

The rapid advancement of artificial intelligence has led to significant improvements in automated decision-making.

Action Recognition Decision Making +4

Paper
Add Code

Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders

1 code implementation • 26 Mar 2024 • Alexandre Eymaël, Renaud Vandeghen, Anthony Cioppa, Silvio Giancola, Bernard Ghanem, Marc Van Droogenbroeck

In particular, SiamMAE recently introduced a Siamese network, training a shared-weight encoder from two frames of a video with a high asymmetric masking ratio (95%).

Self-Supervised Learning

Paper
Code

Artificial intelligence optical hardware empowers high-resolution hyperspectral video understanding at 1.2 Tb/s

no code implementations • 17 Dec 2023 • Maksim Makarenko, Qizhou Wang, Arturo Burguete-Lopez, Silvio Giancola, Bernard Ghanem, Luca Passone, Andrea Fratalocchi

The technology platform combines artificial intelligence hardware, processing information optically, with state-of-the-art machine vision networks, resulting in a data processing speed of 1. 2 Tb/s with hundreds of frequency bands and megapixel spatial resolution at video rates.

Semantic Segmentation Video Semantic Segmentation +1

Paper
Add Code

SoccerNet 2023 Challenges Results

2 code implementations • 12 Sep 2023 • Anthony Cioppa, Silvio Giancola, Vladimir Somers, Floriane Magera, Xin Zhou, Hassan Mkhallati, Adrien Deliège, Jan Held, Carlos Hinojosa, Amir M. Mansourian, Pierre Miralles, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdullah Kamal, Adrien Maglo, Albert Clapés, Amr Abdelaziz, Artur Xarles, Astrid Orcesi, Atom Scott, Bin Liu, Byoungkwon Lim, Chen Chen, Fabian Deuser, Feng Yan, Fufu Yu, Gal Shitrit, Guanshuo Wang, Gyusik Choi, Hankyul Kim, Hao Guo, Hasby Fahrudin, Hidenari Koguchi, Håkan Ardö, Ibrahim Salah, Ido Yerushalmy, Iftikar Muhammad, Ikuma Uchida, Ishay Be'ery, Jaonary Rabarisoa, Jeongae Lee, Jiajun Fu, Jianqin Yin, Jinghang Xu, Jongho Nang, Julien Denize, Junjie Li, Junpei Zhang, Juntae Kim, Kamil Synowiec, Kenji Kobayashi, Kexin Zhang, Konrad Habel, Kota Nakajima, Licheng Jiao, Lin Ma, Lizhi Wang, Luping Wang, Menglong Li, Mengying Zhou, Mohamed Nasr, Mohamed Abdelwahed, Mykola Liashuha, Nikolay Falaleev, Norbert Oswald, Qiong Jia, Quoc-Cuong Pham, Ran Song, Romain Hérault, Rui Peng, Ruilong Chen, Ruixuan Liu, Ruslan Baikulov, Ryuto Fukushima, Sergio Escalera, Seungcheon Lee, Shimin Chen, Shouhong Ding, Taiga Someya, Thomas B. Moeslund, Tianjiao Li, Wei Shen, Wei zhang, Wei Li, Wei Dai, Weixin Luo, Wending Zhao, Wenjie Zhang, Xinquan Yang, Yanbiao Ma, Yeeun Joo, Yingsen Zeng, Yiyang Gan, Yongqiang Zhu, Yujie Zhong, Zheng Ruan, Zhiheng Li, Zhijian Huang, Ziyu Meng

More information on the tasks, challenges, and leaderboards are available on https://www. soccer-net. org.

Action Spotting Camera Calibration +3

Paper
Code

Learning Semantic Segmentation with Query Points Supervision on Aerial Images

no code implementations • 11 Sep 2023 • Santiago Rivier, Carlos Hinojosa, Silvio Giancola, Bernard Ghanem

In this work, we present a weakly supervised learning algorithm to train semantic segmentation algorithms that only rely on query point annotations instead of full mask labels.

Image Segmentation Segmentation +3

Paper
Add Code

SoccerNet-Caption: Dense Video Captioning for Soccer Broadcasts Commentaries

no code implementations • 10 Apr 2023 • Hassan Mkhallati, Anthony Cioppa, Silvio Giancola, Bernard Ghanem, Marc Van Droogenbroeck

By providing broadcasters with a tool to summarize the content of their video with the same level of engagement as a live game, our method could help satisfy the needs of the numerous fans who follow their team but cannot necessarily watch the live game.

Dense Video Captioning

Paper
Add Code

VARS: Video Assistant Referee System for Automated Soccer Decision Making from Multiple Views

no code implementations • 10 Apr 2023 • Jan Held, Anthony Cioppa, Silvio Giancola, Abdullah Hamdi, Bernard Ghanem, Marc Van Droogenbroeck

The Video Assistant Referee (VAR) has revolutionized association football, enabling referees to review incidents on the pitch, make informed decisions, and ensure fairness.

Decision Making Fairness

Paper
Add Code

Towards Active Learning for Action Spotting in Association Football Videos

no code implementations • 9 Apr 2023 • Silvio Giancola, Anthony Cioppa, Julia Georgieva, Johsan Billingham, Andreas Serner, Kerry Peek, Bernard Ghanem, Marc Van Droogenbroeck

In this paper, we propose an active learning framework that selects the most informative video samples to be annotated next, thus drastically reducing the annotation effort and accelerating the training of action spotting models to reach the highest accuracy at a faster pace.

Action Spotting Active Learning

Paper
Add Code

MVTN: Learning Multi-View Transformations for 3D Understanding

1 code implementation • 27 Dec 2022 • Abdullah Hamdi, Faisal AlZahrani, Silvio Giancola, Bernard Ghanem

Multi-view projection techniques have shown themselves to be highly effective in achieving top-performing results in the recognition of 3D shapes.

3D Classification 3D Shape Classification +2

Paper
Code

EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual Queries

1 code implementation • ICCV 2023 • Jinjie Mai, Abdullah Hamdi, Silvio Giancola, Chen Zhao, Bernard Ghanem

Yet, we point out that the low number of camera poses caused by camera re-localization from previous VQ3D methods severally hinders their overall success rate.

3D Reconstruction Object +2

Paper
Code

SegNeRF: 3D Part Segmentation with Neural Radiance Fields

no code implementations • 21 Nov 2022 • Jesus Zarzar, Sara Rojas, Silvio Giancola, Bernard Ghanem

The predicted semantic fields allow SegNeRF to achieve an average mIoU of $\textbf{30. 30%}$ for 2D novel view segmentation, and $\textbf{37. 46%}$ for 3D part segmentation, boasting competitive performance against point-based methods by using only a few posed images.

3D Part Segmentation 3D Reconstruction +2

Paper
Add Code

Estimating more camera poses for ego-centric videos is essential for VQ3D

no code implementations • 18 Nov 2022 • Jinjie Mai, Chen Zhao, Abdullah Hamdi, Silvio Giancola, Bernard Ghanem

Visual queries 3D localization (VQ3D) is a task in the Ego4D Episodic Memory Benchmark.

Object Pose Estimation

Paper
Add Code

SoccerNet 2022 Challenges Results

7 code implementations • 5 Oct 2022 • Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xin Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chen Zhang, Chen Zhao, Chengzhi Lin, Cheuk-Yiu Chan, Chun Chuen Hui, Dengjie Li, Fan Yang, Fan Liang, Fang Da, Feng Yan, Fufu Yu, Guanshuo Wang, H. Anthony Chan, He Zhu, Hongwei Kan, Jiaming Chu, Jianming Hu, Jianyang Gu, Jin Chen, João V. B. Soares, Jonas Theiner, Jorge De Corte, José Henrique Brito, Jun Zhang, Junjie Li, Junwei Liang, Leqi Shen, Lin Ma, Lingchi Chen, Miguel Santos Marques, Mike Azatov, Nikita Kasatkin, Ning Wang, Qiong Jia, Quoc Cuong Pham, Ralph Ewerth, Ran Song, RenGang Li, Rikke Gade, Ruben Debien, Runze Zhang, Sangrok Lee, Sergio Escalera, Shan Jiang, Shigeyuki Odashima, Shimin Chen, Shoichi Masui, Shouhong Ding, Sin-wai Chan, Siyu Chen, Tallal El-Shabrawy, Tao He, Thomas B. Moeslund, Wan-Chi Siu, Wei zhang, Wei Li, Xiangwei Wang, Xiao Tan, Xiaochuan Li, Xiaolin Wei, Xiaoqing Ye, Xing Liu, Xinying Wang, Yandong Guo, YaQian Zhao, Yi Yu, YingYing Li, Yue He, Yujie Zhong, Zhenhua Guo, Zhiheng Li

The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team.

Action Spotting Camera Calibration +3

Paper
Code

SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in Soccer Videos

no code implementations • 14 Apr 2022 • Anthony Cioppa, Silvio Giancola, Adrien Deliege, Le Kang, Xin Zhou, Zhiyu Cheng, Bernard Ghanem, Marc Van Droogenbroeck

Tracking objects in soccer videos is extremely important to gather both player and team statistics, whether it is to estimate the total distance run, the ball possession or the team formation.

Benchmarking Multiple Object Tracking

Paper
Add Code

3DeformRS: Certifying Spatial Deformations on Point Clouds

1 code implementation • CVPR 2022 • Gabriel Pérez S., Juan C. Pérez, Motasem Alfarra, Silvio Giancola, Bernard Ghanem

In this work, we propose 3DeformRS, a method to certify the robustness of point cloud Deep Neural Networks (DNNs) against real-world deformations.

Autonomous Driving

Paper
Code

Real-time Hyperspectral Imaging in Hardware via Trained Metasurface Encoders

1 code implementation • CVPR 2022 • Maksim Makarenko, Arturo Burguete-Lopez, Qizhou Wang, Fedor Getman, Silvio Giancola, Bernard Ghanem, Andrea Fratalocchi

Hyperspectral imaging has attracted significant attention to identify spectral signatures for image classification and automated pattern recognition in computer vision.

Image Classification Semantic Segmentation +1

Paper
Code

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions

1 code implementation • CVPR 2022 • Mattia Soldan, Alejandro Pardo, Juan León Alcázar, Fabian Caba Heilbron, Chen Zhao, Silvio Giancola, Bernard Ghanem

The recent and increasing interest in video-language research has driven the development of large-scale datasets that enable data-intensive machine learning techniques.

Ranked #2 on Natural Language Moment Retrieval on MAD

Moment Retrieval Natural Language Moment Retrieval

134

Paper
Code

Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding

2 code implementations • 30 Nov 2021 • Abdullah Hamdi, Silvio Giancola, Bernard Ghanem

To this end, we introduce the concept of the multi-view point cloud (Voint cloud), representing each 3D point as a set of features extracted from several view-points.

3D Classification 3D Part Segmentation +3

Paper
Code

SCTN: Sparse Convolution-Transformer Network for Scene Flow Estimation

1 code implementation • 10 May 2021 • Bing Li, Cheng Zheng, Silvio Giancola, Bernard Ghanem

We propose a novel scene flow estimation approach to capture and infer 3D motions from point clouds.

Scene Flow Estimation

Paper
Code

Camera Calibration and Player Localization in SoccerNet-v2 and Investigation of their Representations for Action Spotting

no code implementations • 19 Apr 2021 • Anthony Cioppa, Adrien Deliège, Floriane Magera, Silvio Giancola, Olivier Barnich, Bernard Ghanem, Marc Van Droogenbroeck

Specifically, we distill a powerful commercial calibration tool in a recent neural network architecture on the large-scale SoccerNet dataset, composed of untrimmed broadcast videos of 500 soccer games.

Action Spotting Camera Calibration +1

Paper
Add Code

Temporally-Aware Feature Pooling for Action Spotting in Soccer Broadcasts

1 code implementation • 14 Apr 2021 • Silvio Giancola, Bernard Ghanem

In this paper, we focus our analysis on action spotting in soccer broadcast, which consists in temporally localizing the main actions in a soccer game.

Ranked #7 on Action Spotting on SoccerNet-v2 (Average-mAP metric)

Action Spotting

158

Paper
Code

SALA: Soft Assignment Local Aggregation for Parameter Efficient 3D Semantic Segmentation

no code implementations • 29 Dec 2020 • Hani Itani, Silvio Giancola, Ali Thabet, Bernard Ghanem

Since it is learnable, this mapping is allowed to be different per layer instead of being applied uniformly throughout the depth of the network.

3D Semantic Segmentation

Paper
Add Code

MVTN: Multi-View Transformation Network for 3D Shape Recognition

2 code implementations • ICCV 2021 • Abdullah Hamdi, Silvio Giancola, Bernard Ghanem

MVTN exhibits clear performance gains in the tasks of 3D shape classification and 3D shape retrieval without the need for extra training supervision.

Ranked #1 on 3D Object Retrieval on ModelNet40

3D Classification 3D Object Retrieval +6

Paper
Code

SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer Videos

3 code implementations • 26 Nov 2020 • Adrien Deliège, Anthony Cioppa, Silvio Giancola, Meisam J. Seikavandi, Jacob V. Dueholm, Kamal Nasrollahi, Bernard Ghanem, Thomas B. Moeslund, Marc Van Droogenbroeck

In this work, we propose SoccerNet-v2, a novel large-scale corpus of manual annotations for the SoccerNet video dataset, along with open challenges to encourage more research in soccer understanding and broadcast production.

Ranked #1 on Camera shot segmentation on SoccerNet-v2

Action Spotting Boundary Detection +5

158

Paper
Code

TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks

1 code implementation • 23 Nov 2020 • Humam Alwassel, Silvio Giancola, Bernard Ghanem

Extensive experiments show that using features trained with our novel pretraining strategy significantly improves the performance of recent state-of-the-art methods on three tasks: Temporal Action Localization, Action Proposal Generation, and Dense Video Captioning.

Ranked #5 on Temporal Action Proposal Generation on ActivityNet-1.3

Action Classification Dense Video Captioning +2

105

Paper
Code

LC-NAS: Latency Constrained Neural Architecture Search for Point Cloud Networks

no code implementations • 24 Aug 2020 • Guohao Li, Mengmeng Xu, Silvio Giancola, Ali Thabet, Bernard Ghanem

In this paper, we introduce a new NAS framework, dubbed LC-NAS, where we search for point cloud architectures that are constrained to a target latency.

Neural Architecture Search Point Cloud Classification +2

Paper
Add Code

A Context-Aware Loss Function for Action Spotting in Soccer Videos

1 code implementation • CVPR 2020 • Anthony Cioppa, Adrien Deliège, Silvio Giancola, Bernard Ghanem, Marc Van Droogenbroeck, Rikke Gade, Thomas B. Moeslund

We benchmark our loss on a large dataset of soccer videos, SoccerNet, and achieve an improvement of 12. 8% over the baseline.

Ranked #3 on Action Spotting on SoccerNet

Action Spotting Video Understanding

Paper
Code

PointRGCN: Graph Convolution Networks for 3D Vehicles Detection Refinement

no code implementations • 27 Nov 2019 • Jesus Zarzar, Silvio Giancola, Bernard Ghanem

We integrate residual GCNs in a two-stage 3D object detection pipeline, where 3D object proposals are refined using a novel graph representation.

Ranked #14 on 3D Object Detection on KITTI Cars Hard

3D Object Detection Autonomous Driving +2

Paper
Add Code

Efficient Bird Eye View Proposals for 3D Siamese Tracking

no code implementations • 25 Mar 2019 • Jesus Zarzar, Silvio Giancola, Bernard Ghanem

Successively, we refine our selection of 3D object candidates by exploiting the similarity capability of a 3D Siamese network.

Object Tracking Region Proposal

Paper
Add Code

Leveraging Shape Completion for 3D Siamese Tracking

1 code implementation • CVPR 2019 • Silvio Giancola, Jesus Zarzar, Bernard Ghanem

We design a Siamese tracker that encodes model and candidate shapes into a compact latent representation.

3D Object Tracking Autonomous Vehicles +2

116

Paper
Code

SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos

2 code implementations • 12 Apr 2018 • Silvio Giancola, Mohieddine Amine, Tarek Dghaily, Bernard Ghanem

A total of 6, 637 temporal annotations are automatically parsed from online match reports at a one minute resolution for three main classes of events (Goal, Yellow/Red Card, and Substitution).

Ranked #6 on Action Spotting on SoccerNet

Action Classification Action Detection +2

101

Paper
Code

TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the Wild

1 code implementation • ECCV 2018 • Matthias Müller, Adel Bibi, Silvio Giancola, Salman Al-Subaihi, Bernard Ghanem

In this work, we present TrackingNet, the first large-scale dataset and benchmark for object tracking in the wild.

Object object-detection +2

168

Paper
Code

Integration of Absolute Orientation Measurements in the KinectFusion Reconstruction pipeline

no code implementations • 12 Feb 2018 • Silvio Giancola, Jens Schneider, Peter Wonka, Bernard S. Ghanem

We also present a technique to filter the pairs of 3D matched points based on the distribution of their distances.

3D Reconstruction

Paper
Add Code

A Solution for Crime Scene Reconstruction using Time-of-Flight Cameras

no code implementations • 7 Aug 2017 • Silvio Giancola, Daniele Piron, Pasquale Poppa, Remo Sala

In this work, we propose a method for three-dimensional (3D) reconstruction of wide crime scene, based on a Simultaneous Localization and Mapping (SLAM) approach.

3D Reconstruction Simultaneous Localization and Mapping

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.