Search Results for author: Francesco Ferroni

Found 15 papers, 3 papers with code

Better Call SAL: Towards Learning to Segment Anything in Lidar

no code implementations19 Mar 2024 Aljoša Ošep, Tim Meinhardt, Francesco Ferroni, Neehar Peri, Deva Ramanan, Laura Leal-Taixé

We propose $\texttt{SAL}$ ($\texttt{S}$egment $\texttt{A}$nything in $\texttt{L}$idar) method consisting of a text-promptable zero-shot model for segmenting and classifying any object in Lidar, and a pseudo-labeling engine that facilitates model training without manual supervision.

Panoptic Segmentation

SeMoLi: What Moves Together Belongs Together

no code implementations29 Feb 2024 Jenny Seidenschwarz, Aljoša Ošep, Francesco Ferroni, Simon Lucey, Laura Leal-Taixé

Recent results suggest that heuristic-based clustering methods in conjunction with object trackers can be used to pseudo-label instances of moving objects and use these as supervisory signals to train 3D object detectors in Lidar data without manual supervision.

Clustering Object +5

Thinking Like an Annotator: Generation of Dataset Labeling Instructions

no code implementations24 Jun 2023 Nadine Chang, Francesco Ferroni, Michael J. Tarr, Martial Hebert, Deva Ramanan

In Labeling Instruction Generation, we take a reasonably annotated dataset and: 1) generate a set of examples that are visually representative of each category in the dataset; 2) provide a text label that corresponds to each of the examples.

Language Modelling Retrieval

Fast Neural Scene Flow

1 code implementation ICCV 2023 Xueqian Li, Jianqiao Zheng, Francesco Ferroni, Jhony Kaesemodel Pontes, Simon Lucey

Neural Scene Flow Prior (NSFP) is of significant interest to the vision community due to its inherent robustness to out-of-distribution (OOD) effects and its ability to deal with dense lidar points.

Autonomous Driving Self-supervised Scene Flow Estimation

Learning to Zoom and Unzoom

no code implementations CVPR 2023 Chittesh Thavamani, Mengtian Li, Francesco Ferroni, Deva Ramanan

In this work (LZU), we "learn to zoom" in on the input image, compute spatial features, and then "unzoom" to revert any deformations.

Autonomous Navigation Monocular 3D Object Detection +3

Priors are Powerful: Improving a Transformer for Multi-camera 3D Detection with 2D Priors

no code implementations31 Jan 2023 Di Feng, Francesco Ferroni

Transfomer-based approaches advance the recent development of multi-camera 3D detection both in academia and industry.

Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images

no code implementations CVPR 2023 Xindi Wu, KwunFung Lau, Francesco Ferroni, Aljoša Ošep, Deva Ramanan

Moreover, we show that our retrieved maps can be used to update or expand existing maps and even show proof-of-concept results for visual localization and image retrieval from spatial graphs.

Autonomous Navigation Cross-Modal Retrieval +3

Far3Det: Towards Far-Field 3D Detection

no code implementations25 Nov 2022 Shubham Gupta, Jeet Kanjani, Mengtian Li, Francesco Ferroni, James Hays, Deva Ramanan, Shu Kong

We focus on the task of far-field 3D detection (Far3Det) of objects beyond a certain distance from an observer, e. g., $>$50m.

Autonomous Vehicles Philosophy

Flat Latent Manifolds for Human-machine Co-creation of Music

no code implementations23 Feb 2022 Nutan Chen, Djalel Benbouzid, Francesco Ferroni, Mathis Nitschke, Luciano Pinna, Patrick van der Smagt

We therefore consider a music-generating algorithm as a counterpart to a human musician, in a setting where reciprocal interplay is to lead to new experiences, both for the musician and the audience.

Music Generation

Learning Flat Latent Manifolds with VAEs

no code implementations ICML 2020 Nutan Chen, Alexej Klushyn, Francesco Ferroni, Justin Bayer, Patrick van der Smagt

Prevalent is the use of the Euclidean metric, which has the drawback of ignoring information about similarity of data stored in the decoder, as captured by the framework of Riemannian geometry.

Computational Efficiency

FLAT MANIFOLD VAES

no code implementations25 Sep 2019 Nutan Chen, Alexej Klushyn, Francesco Ferroni, Justin Bayer, Patrick van der Smagt

Latent-variable models represent observed data by mapping a prior distribution over some latent space to an observed space.

Fast Approximate Geodesics for Deep Generative Models

no code implementations19 Dec 2018 Nutan Chen, Francesco Ferroni, Alexej Klushyn, Alexandros Paraschos, Justin Bayer, Patrick van der Smagt

The length of the geodesic between two data points along a Riemannian manifold, induced by a deep generative model, yields a principled measure of similarity.

Cannot find the paper you are looking for? You can Submit a new open access paper.