1 code implementation • 21 Jul 2022 • Grant van Horn, Rui Qian, Kimberly Wilber, Hartwig Adam, Oisin Mac Aodha, Serge Belongie
We thoroughly benchmark audiovisual classification performance and modality fusion experiments through the use of state-of-the-art transformer methods.
1 code implementation • 20 Jul 2022 • Elijah Cole, Kimberly Wilber, Grant van Horn, Xuan Yang, Marco Fornoni, Pietro Perona, Serge Belongie, Andrew Howard, Oisin Mac Aodha
Weakly supervised object localization (WSOL) aims to learn representations that encode object location using only image-level category labels.
1 code implementation • 19 Jul 2022 • Justin Kay, Peter Kulits, Suzanne Stathatos, Siqi Deng, Erik Young, Sara Beery, Grant van Horn, Pietro Perona
We present the Caltech Fish Counting Dataset (CFC), a large-scale dataset for detecting, tracking, and counting fish in sonar videos.
no code implementations • 25 Oct 2021 • Devis Tuia, Benjamin Kellenberger, Sara Beery, Blair R. Costelloe, Silvia Zuffi, Benjamin Risse, Alexander Mathis, Mackenzie W. Mathis, Frank van Langevelde, Tilo Burghardt, Roland Kays, Holger Klinck, Martin Wikelski, Iain D. Couzin, Grant van Horn, Margaret C. Crofoot, Charles V. Stewart, Tanya Berger-Wolf
Data acquisition in animal ecology is rapidly accelerating due to inexpensive and accessible sensors such as smartphones, drones, satellites, audio recorders and bio-logging devices.
1 code implementation • CVPR 2021 • Grant van Horn, Elijah Cole, Sara Beery, Kimberly Wilber, Serge Belongie, Oisin Mac Aodha
In order to facilitate progress in this area we present two new natural world visual classification datasets, iNat2021 and NeWT.
no code implementations • 11 Apr 2019 • Sara Beery, Grant van Horn, Oisin Mac Aodha, Pietro Perona
Camera traps are a valuable tool for studying biodiversity, but research using this data is limited by the speed of human annotation.
2 code implementations • ECCV 2018 • Sara Beery, Grant van Horn, Pietro Perona
The challenge is learning recognition in a handful of locations, and generalizing animal detection and classification to new locations where no training data is available.
no code implementations • CVPR 2018 • Grant Van Horn, Steve Branson, Scott Loarie, Serge Belongie, Pietro Perona
We introduce a method for efficiently crowdsourcing multiclass annotations in challenging, real world image datasets.
no code implementations • 5 Sep 2017 • Grant Van Horn, Pietro Perona
We find that (a) peak classification performance on well-represented categories is excellent, (b) given enough data, classification performance suffers only minimally from an increase in the number of classes, (c) classification performance decays precipitously as the number of training examples decreases, (d) surprisingly, transfer learning is virtually absent in current methods.
11 code implementations • CVPR 2018 • Grant Van Horn, Oisin Mac Aodha, Yang song, Yin Cui, Chen Sun, Alex Shepard, Hartwig Adam, Pietro Perona, Serge Belongie
Existing image classification datasets used in computer vision tend to have a uniform distribution of images across object categories.
Ranked #7 on
Image Classification
on iNaturalist
no code implementations • CVPR 2017 • Steve Branson, Grant van Horn, Pietro Perona
We develop specialized models and algorithms for binary annotation, part keypoint annotation, and sets of bounding box annotations.
no code implementations • CVPR 2015 • Grant Van Horn, Steve Branson, Ryan Farrell, Scott Haber, Jessie Barry, Panos Ipeirotis, Pietro Perona, Serge Belongie
We worked with bird experts to measure the quality of popular datasets like CUB-200-2011 and ImageNet and found class label error rates of at least 4%.
no code implementations • 11 Jun 2014 • Steve Branson, Grant van Horn, Serge Belongie, Pietro Perona
We perform a detailed investigation of state-of-the-art deep convolutional feature implementations and fine-tuning feature learning for fine-grained classification.
no code implementations • CVPR 2014 • Catherine Wah, Grant van Horn, Steve Branson, Subhransu Maji, Pietro Perona, Serge Belongie
Current human-in-the-loop fine-grained visual categorization systems depend on a predefined vocabulary of attributes and parts, usually determined by experts.
Fine-Grained Visual Categorization
General Classification
+2