Search Results for author: Silvio Giancola

Found 35 papers, 19 papers with code

Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders

1 code implementation26 Mar 2024 Alexandre Eymaël, Renaud Vandeghen, Anthony Cioppa, Silvio Giancola, Bernard Ghanem, Marc Van Droogenbroeck

In particular, SiamMAE recently introduced a Siamese network, training a shared-weight encoder from two frames of a video with a high asymmetric masking ratio (95%).

Self-Supervised Learning

Artificial intelligence optical hardware empowers high-resolution hyperspectral video understanding at 1.2 Tb/s

no code implementations17 Dec 2023 Maksim Makarenko, Qizhou Wang, Arturo Burguete-Lopez, Silvio Giancola, Bernard Ghanem, Luca Passone, Andrea Fratalocchi

The technology platform combines artificial intelligence hardware, processing information optically, with state-of-the-art machine vision networks, resulting in a data processing speed of 1. 2 Tb/s with hundreds of frequency bands and megapixel spatial resolution at video rates.

Semantic Segmentation Video Semantic Segmentation +1

SoccerNet 2023 Challenges Results

2 code implementations12 Sep 2023 Anthony Cioppa, Silvio Giancola, Vladimir Somers, Floriane Magera, Xin Zhou, Hassan Mkhallati, Adrien Deliège, Jan Held, Carlos Hinojosa, Amir M. Mansourian, Pierre Miralles, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdullah Kamal, Adrien Maglo, Albert Clapés, Amr Abdelaziz, Artur Xarles, Astrid Orcesi, Atom Scott, Bin Liu, Byoungkwon Lim, Chen Chen, Fabian Deuser, Feng Yan, Fufu Yu, Gal Shitrit, Guanshuo Wang, Gyusik Choi, Hankyul Kim, Hao Guo, Hasby Fahrudin, Hidenari Koguchi, Håkan Ardö, Ibrahim Salah, Ido Yerushalmy, Iftikar Muhammad, Ikuma Uchida, Ishay Be'ery, Jaonary Rabarisoa, Jeongae Lee, Jiajun Fu, Jianqin Yin, Jinghang Xu, Jongho Nang, Julien Denize, Junjie Li, Junpei Zhang, Juntae Kim, Kamil Synowiec, Kenji Kobayashi, Kexin Zhang, Konrad Habel, Kota Nakajima, Licheng Jiao, Lin Ma, Lizhi Wang, Luping Wang, Menglong Li, Mengying Zhou, Mohamed Nasr, Mohamed Abdelwahed, Mykola Liashuha, Nikolay Falaleev, Norbert Oswald, Qiong Jia, Quoc-Cuong Pham, Ran Song, Romain Hérault, Rui Peng, Ruilong Chen, Ruixuan Liu, Ruslan Baikulov, Ryuto Fukushima, Sergio Escalera, Seungcheon Lee, Shimin Chen, Shouhong Ding, Taiga Someya, Thomas B. Moeslund, Tianjiao Li, Wei Shen, Wei zhang, Wei Li, Wei Dai, Weixin Luo, Wending Zhao, Wenjie Zhang, Xinquan Yang, Yanbiao Ma, Yeeun Joo, Yingsen Zeng, Yiyang Gan, Yongqiang Zhu, Yujie Zhong, Zheng Ruan, Zhiheng Li, Zhijian Huang, Ziyu Meng

More information on the tasks, challenges, and leaderboards are available on https://www. soccer-net. org.

Action Spotting Camera Calibration +3

Learning Semantic Segmentation with Query Points Supervision on Aerial Images

no code implementations11 Sep 2023 Santiago Rivier, Carlos Hinojosa, Silvio Giancola, Bernard Ghanem

In this work, we present a weakly supervised learning algorithm to train semantic segmentation algorithms that only rely on query point annotations instead of full mask labels.

Image Segmentation Segmentation +3

SoccerNet-Caption: Dense Video Captioning for Soccer Broadcasts Commentaries

no code implementations10 Apr 2023 Hassan Mkhallati, Anthony Cioppa, Silvio Giancola, Bernard Ghanem, Marc Van Droogenbroeck

By providing broadcasters with a tool to summarize the content of their video with the same level of engagement as a live game, our method could help satisfy the needs of the numerous fans who follow their team but cannot necessarily watch the live game.

Dense Video Captioning

VARS: Video Assistant Referee System for Automated Soccer Decision Making from Multiple Views

no code implementations10 Apr 2023 Jan Held, Anthony Cioppa, Silvio Giancola, Abdullah Hamdi, Bernard Ghanem, Marc Van Droogenbroeck

The Video Assistant Referee (VAR) has revolutionized association football, enabling referees to review incidents on the pitch, make informed decisions, and ensure fairness.

Decision Making Fairness

Towards Active Learning for Action Spotting in Association Football Videos

no code implementations9 Apr 2023 Silvio Giancola, Anthony Cioppa, Julia Georgieva, Johsan Billingham, Andreas Serner, Kerry Peek, Bernard Ghanem, Marc Van Droogenbroeck

In this paper, we propose an active learning framework that selects the most informative video samples to be annotated next, thus drastically reducing the annotation effort and accelerating the training of action spotting models to reach the highest accuracy at a faster pace.

Action Spotting Active Learning

MVTN: Learning Multi-View Transformations for 3D Understanding

1 code implementation27 Dec 2022 Abdullah Hamdi, Faisal AlZahrani, Silvio Giancola, Bernard Ghanem

Multi-view projection techniques have shown themselves to be highly effective in achieving top-performing results in the recognition of 3D shapes.

3D Classification 3D Shape Classification +2

EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual Queries

1 code implementation ICCV 2023 Jinjie Mai, Abdullah Hamdi, Silvio Giancola, Chen Zhao, Bernard Ghanem

Yet, we point out that the low number of camera poses caused by camera re-localization from previous VQ3D methods severally hinders their overall success rate.

3D Reconstruction Object +2

SegNeRF: 3D Part Segmentation with Neural Radiance Fields

no code implementations21 Nov 2022 Jesus Zarzar, Sara Rojas, Silvio Giancola, Bernard Ghanem

The predicted semantic fields allow SegNeRF to achieve an average mIoU of $\textbf{30. 30%}$ for 2D novel view segmentation, and $\textbf{37. 46%}$ for 3D part segmentation, boasting competitive performance against point-based methods by using only a few posed images.

3D Part Segmentation 3D Reconstruction +2

SoccerNet 2022 Challenges Results

7 code implementations5 Oct 2022 Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xin Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chen Zhang, Chen Zhao, Chengzhi Lin, Cheuk-Yiu Chan, Chun Chuen Hui, Dengjie Li, Fan Yang, Fan Liang, Fang Da, Feng Yan, Fufu Yu, Guanshuo Wang, H. Anthony Chan, He Zhu, Hongwei Kan, Jiaming Chu, Jianming Hu, Jianyang Gu, Jin Chen, João V. B. Soares, Jonas Theiner, Jorge De Corte, José Henrique Brito, Jun Zhang, Junjie Li, Junwei Liang, Leqi Shen, Lin Ma, Lingchi Chen, Miguel Santos Marques, Mike Azatov, Nikita Kasatkin, Ning Wang, Qiong Jia, Quoc Cuong Pham, Ralph Ewerth, Ran Song, RenGang Li, Rikke Gade, Ruben Debien, Runze Zhang, Sangrok Lee, Sergio Escalera, Shan Jiang, Shigeyuki Odashima, Shimin Chen, Shoichi Masui, Shouhong Ding, Sin-wai Chan, Siyu Chen, Tallal El-Shabrawy, Tao He, Thomas B. Moeslund, Wan-Chi Siu, Wei zhang, Wei Li, Xiangwei Wang, Xiao Tan, Xiaochuan Li, Xiaolin Wei, Xiaoqing Ye, Xing Liu, Xinying Wang, Yandong Guo, YaQian Zhao, Yi Yu, YingYing Li, Yue He, Yujie Zhong, Zhenhua Guo, Zhiheng Li

The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team.

Action Spotting Camera Calibration +3

SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in Soccer Videos

no code implementations14 Apr 2022 Anthony Cioppa, Silvio Giancola, Adrien Deliege, Le Kang, Xin Zhou, Zhiyu Cheng, Bernard Ghanem, Marc Van Droogenbroeck

Tracking objects in soccer videos is extremely important to gather both player and team statistics, whether it is to estimate the total distance run, the ball possession or the team formation.

Benchmarking Multiple Object Tracking

3DeformRS: Certifying Spatial Deformations on Point Clouds

1 code implementation CVPR 2022 Gabriel Pérez S., Juan C. Pérez, Motasem Alfarra, Silvio Giancola, Bernard Ghanem

In this work, we propose 3DeformRS, a method to certify the robustness of point cloud Deep Neural Networks (DNNs) against real-world deformations.

Autonomous Driving

Real-time Hyperspectral Imaging in Hardware via Trained Metasurface Encoders

1 code implementation CVPR 2022 Maksim Makarenko, Arturo Burguete-Lopez, Qizhou Wang, Fedor Getman, Silvio Giancola, Bernard Ghanem, Andrea Fratalocchi

Hyperspectral imaging has attracted significant attention to identify spectral signatures for image classification and automated pattern recognition in computer vision.

Image Classification Semantic Segmentation +1

Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding

2 code implementations30 Nov 2021 Abdullah Hamdi, Silvio Giancola, Bernard Ghanem

To this end, we introduce the concept of the multi-view point cloud (Voint cloud), representing each 3D point as a set of features extracted from several view-points.

3D Classification 3D Part Segmentation +3

SCTN: Sparse Convolution-Transformer Network for Scene Flow Estimation

1 code implementation10 May 2021 Bing Li, Cheng Zheng, Silvio Giancola, Bernard Ghanem

We propose a novel scene flow estimation approach to capture and infer 3D motions from point clouds.

Scene Flow Estimation

Camera Calibration and Player Localization in SoccerNet-v2 and Investigation of their Representations for Action Spotting

no code implementations19 Apr 2021 Anthony Cioppa, Adrien Deliège, Floriane Magera, Silvio Giancola, Olivier Barnich, Bernard Ghanem, Marc Van Droogenbroeck

Specifically, we distill a powerful commercial calibration tool in a recent neural network architecture on the large-scale SoccerNet dataset, composed of untrimmed broadcast videos of 500 soccer games.

Action Spotting Camera Calibration +1

Temporally-Aware Feature Pooling for Action Spotting in Soccer Broadcasts

1 code implementation14 Apr 2021 Silvio Giancola, Bernard Ghanem

In this paper, we focus our analysis on action spotting in soccer broadcast, which consists in temporally localizing the main actions in a soccer game.

Ranked #7 on Action Spotting on SoccerNet-v2 (Average-mAP metric)

Action Spotting

SALA: Soft Assignment Local Aggregation for Parameter Efficient 3D Semantic Segmentation

no code implementations29 Dec 2020 Hani Itani, Silvio Giancola, Ali Thabet, Bernard Ghanem

Since it is learnable, this mapping is allowed to be different per layer instead of being applied uniformly throughout the depth of the network.

3D Semantic Segmentation

MVTN: Multi-View Transformation Network for 3D Shape Recognition

2 code implementations ICCV 2021 Abdullah Hamdi, Silvio Giancola, Bernard Ghanem

MVTN exhibits clear performance gains in the tasks of 3D shape classification and 3D shape retrieval without the need for extra training supervision.

3D Classification 3D Object Retrieval +6

SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer Videos

3 code implementations26 Nov 2020 Adrien Deliège, Anthony Cioppa, Silvio Giancola, Meisam J. Seikavandi, Jacob V. Dueholm, Kamal Nasrollahi, Bernard Ghanem, Thomas B. Moeslund, Marc Van Droogenbroeck

In this work, we propose SoccerNet-v2, a novel large-scale corpus of manual annotations for the SoccerNet video dataset, along with open challenges to encourage more research in soccer understanding and broadcast production.

Action Spotting Boundary Detection +5

TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks

1 code implementation23 Nov 2020 Humam Alwassel, Silvio Giancola, Bernard Ghanem

Extensive experiments show that using features trained with our novel pretraining strategy significantly improves the performance of recent state-of-the-art methods on three tasks: Temporal Action Localization, Action Proposal Generation, and Dense Video Captioning.

Action Classification Dense Video Captioning +2

LC-NAS: Latency Constrained Neural Architecture Search for Point Cloud Networks

no code implementations24 Aug 2020 Guohao Li, Mengmeng Xu, Silvio Giancola, Ali Thabet, Bernard Ghanem

In this paper, we introduce a new NAS framework, dubbed LC-NAS, where we search for point cloud architectures that are constrained to a target latency.

Neural Architecture Search Point Cloud Classification +2

PointRGCN: Graph Convolution Networks for 3D Vehicles Detection Refinement

no code implementations27 Nov 2019 Jesus Zarzar, Silvio Giancola, Bernard Ghanem

We integrate residual GCNs in a two-stage 3D object detection pipeline, where 3D object proposals are refined using a novel graph representation.

3D Object Detection Autonomous Driving +2

Efficient Bird Eye View Proposals for 3D Siamese Tracking

no code implementations25 Mar 2019 Jesus Zarzar, Silvio Giancola, Bernard Ghanem

Successively, we refine our selection of 3D object candidates by exploiting the similarity capability of a 3D Siamese network.

Object Tracking Region Proposal

SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos

2 code implementations12 Apr 2018 Silvio Giancola, Mohieddine Amine, Tarek Dghaily, Bernard Ghanem

A total of 6, 637 temporal annotations are automatically parsed from online match reports at a one minute resolution for three main classes of events (Goal, Yellow/Red Card, and Substitution).

Action Classification Action Detection +2

Integration of Absolute Orientation Measurements in the KinectFusion Reconstruction pipeline

no code implementations12 Feb 2018 Silvio Giancola, Jens Schneider, Peter Wonka, Bernard S. Ghanem

We also present a technique to filter the pairs of 3D matched points based on the distribution of their distances.

3D Reconstruction

A Solution for Crime Scene Reconstruction using Time-of-Flight Cameras

no code implementations7 Aug 2017 Silvio Giancola, Daniele Piron, Pasquale Poppa, Remo Sala

In this work, we propose a method for three-dimensional (3D) reconstruction of wide crime scene, based on a Simultaneous Localization and Mapping (SLAM) approach.

3D Reconstruction Simultaneous Localization and Mapping

Cannot find the paper you are looking for? You can Submit a new open access paper.