Search Results for author: Alexander Kirillov

Found 30 papers, 19 papers with code

R-MAE: Regions Meet Masked Autoencoders

1 code implementation • 8 Jun 2023 • Duy-Kien Nguyen, Vaibhav Aggarwal, Yanghao Li, Martin R. Oswald, Alexander Kirillov, Cees G. M. Snoek, Xinlei Chen

In this work, we explore regions as a potential visual analogue of words for self-supervised image representation learning.

Contrastive Learning Interactive Segmentation +4

104

Paper
Code

Segment Anything

18 code implementations • ICCV 2023 • Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C. Berg, Wan-Yen Lo, Piotr Dollár, Ross Girshick

We introduce the Segment Anything (SA) project: a new task, model, and dataset for image segmentation.

Ranked #2 on Zero-Shot Instance Segmentation on LVIS v1.0 val

Event-based Object Segmentation Image Segmentation +3

124,353

Paper
Code

Point-Level Region Contrast for Object Detection Pre-Training

1 code implementation • CVPR 2022 • Yutong Bai, Xinlei Chen, Alexander Kirillov, Alan Yuille, Alexander C. Berg

In this work we present point-level region contrast, a self-supervised pre-training approach for the task of object detection.

Contrastive Learning Knowledge Distillation +2

Paper
Code

SLIP: Self-supervision meets Language-Image Pre-training

1 code implementation • 23 Dec 2021 • Norman Mu, Alexander Kirillov, David Wagner, Saining Xie

Across ImageNet and a battery of additional datasets, we find that SLIP improves accuracy by a large margin.

Multi-Task Learning Representation Learning +1

724

Paper
Code

Mask2Former for Video Instance Segmentation

5 code implementations • 20 Dec 2021 • Bowen Cheng, Anwesa Choudhuri, Ishan Misra, Alexander Kirillov, Rohit Girdhar, Alexander G. Schwing

We find Mask2Former also achieves state-of-the-art performance on video instance segmentation without modifying the architecture, the loss or even the training pipeline.

Ranked #14 on Video Instance Segmentation on YouTube-VIS validation

Image Segmentation Instance Segmentation +5

124,353

Paper
Code

Recognizing Scenes from Novel Viewpoints

no code implementations • 2 Dec 2021 • Shengyi Qian, Alexander Kirillov, Nikhila Ravi, Devendra Singh Chaplot, Justin Johnson, David F. Fouhey, Georgia Gkioxari

Humans can perceive scenes in 3D from a handful of 2D views.

Scene Recognition

Paper
Add Code

Masked-attention Mask Transformer for Universal Image Segmentation

6 code implementations • CVPR 2022 • Bowen Cheng, Ishan Misra, Alexander G. Schwing, Alexander Kirillov, Rohit Girdhar

While only the semantics of each task differ, current research focuses on designing specialized architectures for each task.

Ranked #3 on Semantic Segmentation on Mapillary val

Image Segmentation Instance Segmentation +3

124,353

Paper
Code

Per-Pixel Classification is Not All You Need for Semantic Segmentation

3 code implementations • NeurIPS 2021 • Bowen Cheng, Alexander G. Schwing, Alexander Kirillov

Overall, the proposed mask classification-based method simplifies the landscape of effective approaches to semantic and panoptic segmentation tasks and shows excellent empirical results.

Ranked #4 on Semantic Segmentation on Mapillary val

Classification Panoptic Segmentation +1

124,353

Paper
Code

Pointly-Supervised Instance Segmentation

2 code implementations • CVPR 2022 • Bowen Cheng, Omkar Parkhi, Alexander Kirillov

Our experiments show that the new module is more suitable for the point-based supervision.

Instance Segmentation Object +3

28,589

Paper
Code

Boundary IoU: Improving Object-Centric Image Segmentation Evaluation

2 code implementations • CVPR 2021 • Bowen Cheng, Ross Girshick, Piotr Dollár, Alexander C. Berg, Alexander Kirillov

We perform an extensive analysis across different error types and object sizes and show that Boundary IoU is significantly more sensitive than the standard Mask IoU measure to boundary errors for large objects and does not over-penalize errors on smaller objects.

Image Segmentation Object +2

206

Paper
Code

On Interaction Between Augmentations and Corruptions in Natural Corruption Robustness

1 code implementation • NeurIPS 2021 • Eric Mintun, Alexander Kirillov, Saining Xie

Invariance to a broad array of image corruptions, such as warping, noise, or color shifts, is an important aspect of building robust models in computer vision.

Paper
Code

Evaluating Large-Vocabulary Object Detectors: The Devil is in the Details

2 code implementations • 1 Feb 2021 • Achal Dave, Piotr Dollár, Deva Ramanan, Alexander Kirillov, Ross Girshick

On one hand, this is desirable as it treats all classes equally.

Benchmarking object-detection +2

1,936

Paper
Code

TrackFormer: Multi-Object Tracking with Transformers

2 code implementations • CVPR 2022 • Tim Meinhardt, Alexander Kirillov, Laura Leal-Taixe, Christoph Feichtenhofer

The challenging task of multi-object tracking (MOT) requires simultaneous reasoning about track initialization, identity, and spatio-temporal trajectories.

Ranked #1 on Multi-Object Tracking on MOT17 (e2e-MOT metric)

Multi-Object Tracking Object +1

471

Paper
Code

Is Robustness Robust? On the interaction between augmentations and corruptions

no code implementations • 1 Jan 2021 • Eric Mintun, Alexander Kirillov, Saining Xie

Invariance to a broad array of image corruptions, such as warping, noise, or color shifts, is an important aspect of building robust models in computer vision.

Paper
Add Code

End-to-End Object Detection with Transformers

37 code implementations • ECCV 2020 • Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, Sergey Zagoruyko

We present a new method that views object detection as a direct set prediction problem.

Ranked #21 on Panoptic Segmentation on COCO minival

Object Panoptic Segmentation +1

124,353

Paper
Code

PointRend: Image Segmentation as Rendering

14 code implementations • CVPR 2020 • Alexander Kirillov, Yuxin Wu, Kaiming He, Ross Girshick

We present a new method for efficient high-quality image segmentation of objects and scenes.

Ranked #3 on Instance Segmentation on COCO 2017 val

Image Segmentation Instance Segmentation +2

28,595

Paper
Code

Exploring Randomly Wired Neural Networks for Image Recognition

9 code implementations • ICCV 2019 • Saining Xie, Alexander Kirillov, Ross Girshick, Kaiming He

In this paper, we explore a more diverse set of connectivity patterns through the lens of randomly wired neural networks.

Ranked #118 on Neural Architecture Search on ImageNet

Image Classification Neural Architecture Search

2,109

Paper
Code

Panoptic Feature Pyramid Networks

12 code implementations • CVPR 2019 • Alexander Kirillov, Ross Girshick, Kaiming He, Piotr Dollár

In this work, we perform a detailed study of this minimally extended version of Mask R-CNN with FPN, which we refer to as Panoptic FPN, and show it is a robust and accurate baseline for both tasks.

Ranked #4 on Panoptic Segmentation on Indian Driving Dataset

Instance Segmentation Panoptic Segmentation +2

28,593

Paper
Code

WSD algorithm based on a new method of vector-word contexts proximity calculation via epsilon-filtration

1 code implementation • 24 May 2018 • Alexander Kirillov, Natalia Krizhanovsky, Andrew Krizhanovsky

It is necessary to select the meaning of the word in the sentence automatically.

Sentence Word Sense Disambiguation

Paper
Code

Calculated attributes of synonym sets

no code implementations • 5 Mar 2018 • Andrew Krizhanovsky, Alexander Kirillov

Several geometric characteristics of the synset words are introduced: the interior of synset, the synset word rank and centrality.

Paper
Add Code

Panoptic Segmentation

9 code implementations • CVPR 2019 • Alexander Kirillov, Kaiming He, Ross Girshick, Carsten Rother, Piotr Dollár

We propose and study a task we name panoptic segmentation (PS).

Ranked #23 on Panoptic Segmentation on Cityscapes val (using extra training data)

Image Segmentation Instance Segmentation +4

404

Paper
Code

Joint Graph Decomposition & Node Labeling: Problem, Algorithms, Applications

no code implementations • CVPR 2017 • Evgeny Levinkov, Jonas Uhrig, Siyu Tang, Mohamed Omran, Eldar Insafutdinov, Alexander Kirillov, Carsten Rother, Thomas Brox, Bernt Schiele, Bjoern Andres

In order to find feasible solutions efficiently, we define two local search algorithms that converge monotonously to a local optimum, offering a feasible solution at any time.

Combinatorial Optimization Multiple Object Tracking +2

Paper
Add Code

Analyzing Modular CNN Architectures for Joint Depth Prediction and Semantic Segmentation

no code implementations • 26 Feb 2017 • Omid Hosseini Jafari, Oliver Groth, Alexander Kirillov, Michael Ying Yang, Carsten Rother

Towards this end we propose a Convolutional Neural Network (CNN) architecture that fuses the state of the state-of-the-art results for depth estimation and semantic labeling.

Depth Estimation Depth Prediction +1

Paper
Add Code

Global Hypothesis Generation for 6D Object Pose Estimation

no code implementations • CVPR 2017 • Frank Michel, Alexander Kirillov, Eric Brachmann, Alexander Krull, Stefan Gumhold, Bogdan Savchynskyy, Carsten Rother

Most modern approaches solve this task in three steps: i) Compute local features; ii) Generate a pool of pose-hypotheses; iii) Select and refine a pose from the pool.

6D Pose Estimation using RGB Object

Paper
Add Code

InstanceCut: from Edges to Instances with MultiCut

no code implementations • CVPR 2017 • Alexander Kirillov, Evgeny Levinkov, Bjoern Andres, Bogdan Savchynskyy, Carsten Rother

This work addresses the task of instance-aware semantic segmentation.

Edge Detection Segmentation +1

Paper
Add Code

Joint Graph Decomposition and Node Labeling: Problem, Algorithms, Applications

1 code implementation • 14 Nov 2016 • Evgeny Levinkov, Jonas Uhrig, Siyu Tang, Mohamed Omran, Eldar Insafutdinov, Alexander Kirillov, Carsten Rother, Thomas Brox, Bernt Schiele, Bjoern Andres

In order to find feasible solutions efficiently, we define two local search algorithms that converge monotonously to a local optimum, offering a feasible solution at any time.

Combinatorial Optimization Multiple Object Tracking +2

230

Paper
Code

Joint M-Best-Diverse Labelings as a Parametric Submodular Minimization

no code implementations • NeurIPS 2016 • Alexander Kirillov, Alexander Shekhovtsov, Carsten Rother, Bogdan Savchynskyy

In particular, the joint M-best diverse labelings can be obtained by running a non-parametric submodular minimization (in the special case - max-flow) solver for M different values of $\gamma$ in parallel, for certain diversity measures.

Paper
Add Code

Inferring M-Best Diverse Labelings in a Single One

no code implementations • ICCV 2015 • Alexander Kirillov, Bogdan Savchynskyy, Dmitrij Schlesinger, Dmitry Vetrov, Carsten Rother

We consider the task of finding M-best diverse solutions in a graphical model.

Image Segmentation Semantic Segmentation

Paper
Add Code

M-Best-Diverse Labelings for Submodular Energies and Beyond

no code implementations • NeurIPS 2015 • Alexander Kirillov, Dmytro Shlezinger, Dmitry P. Vetrov, Carsten Rother, Bogdan Savchynskyy

In this work we show that the joint inference of $M$ best diverse solutions can be formulated as a submodular energy minimization if the original MAP-inference problem is submodular, hence fast inference techniques can be used.

Total Energy

Paper
Add Code

Joint Training of Generic CNN-CRF Models with Stochastic Optimization

no code implementations • 16 Nov 2015 • Alexander Kirillov, Dmitrij Schlesinger, Shuai Zheng, Bogdan Savchynskyy, Philip H. S. Torr, Carsten Rother

We propose a new CNN-CRF end-to-end learning framework, which is based on joint stochastic optimization with respect to both Convolutional Neural Network (CNN) and Conditional Random Field (CRF) parameters.

Stochastic Optimization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.