TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Real-time Instance Segmentation	MSCOCO	YOLACT-550++ (ResNet-101-FPN)	Frame (fps)	27.3 (Titan Xp)	# 11
Real-time Instance Segmentation	MSCOCO	YOLACT-550++ (ResNet-101-FPN)	mask AP	34.6	# 15
Real-time Instance Segmentation	MSCOCO	YOLACT-550++ (ResNet-101-FPN)	AP50	53.8	# 11
Real-time Instance Segmentation	MSCOCO	YOLACT-550++ (ResNet-101-FPN)	AP75	36.9	# 11
Real-time Instance Segmentation	MSCOCO	YOLACT-550++ (ResNet-101-FPN)	APS	11.9	# 10
Real-time Instance Segmentation	MSCOCO	YOLACT-550++ (ResNet-101-FPN)	APM	36.8	# 10
Real-time Instance Segmentation	MSCOCO	YOLACT-550++ (ResNet-101-FPN)	APL	55.1	# 8

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/yolact-better-real-time-instance-segmentation/real-time-instance-segmentation-on-mscoco)](https://paperswithcode.com/sota/real-time-instance-segmentation-on-mscoco?p=yolact-better-real-time-instance-segmentation)`

YOLACT++: Better Real-time Instance Segmentation

3 Dec 2019 · Daniel Bolya, Chong Zhou, Fanyi Xiao, Yong Jae Lee ·

We present a simple, fully-convolutional model for real-time (>30 fps) instance segmentation that achieves competitive results on MS COCO evaluated on a single Titan Xp, which is significantly faster than any previous state-of-the-art approach. Moreover, we obtain this result after training on only one GPU. We accomplish this by breaking instance segmentation into two parallel subtasks: (1) generating a set of prototype masks and (2) predicting per-instance mask coefficients. Then we produce instance masks by linearly combining the prototypes with the mask coefficients. We find that because this process doesn't depend on repooling, this approach produces very high-quality masks and exhibits temporal stability for free. Furthermore, we analyze the emergent behavior of our prototypes and show they learn to localize instances on their own in a translation variant manner, despite being fully-convolutional. We also propose Fast NMS, a drop-in 12 ms faster replacement for standard NMS that only has a marginal performance penalty. Finally, by incorporating deformable convolutions into the backbone network, optimizing the prediction head with better anchor scales and aspect ratios, and adding a novel fast mask re-scoring branch, our YOLACT++ model can achieve 34.1 mAP on MS COCO at 33.5 fps, which is fairly close to the state-of-the-art approaches while still running at real-time.

PDF Abstract

Code

Add Remove Mark official

dbolya/yolact official

4,922

mindspore-ai/models

334

DataXujing/yolact_pytorch

anshkumar/yolact

KevinJia1212/yolact_cityscapes_550

See all 36 implementations

Tasks

Add Remove

Instance Segmentation

Real-time Instance Segmentation

Segmentation

Semantic Segmentation

Datasets

MS COCO

Cityscapes

ssd

SBD MSCOCO

Results from the Paper

Add Remove

Ranked #15 on Real-time Instance Segmentation on MSCOCO (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Real-time Instance Segmentation	MSCOCO	YOLACT-550++ (ResNet-101-FPN)	Frame (fps)	27.3 (Titan Xp)	# 11	Compare
			mask AP	34.6	# 15	Compare
			AP50	53.8	# 11	Compare
			AP75	36.9	# 11	Compare
			APS	11.9	# 10	Compare
			APM	36.8	# 10	Compare
			APL	55.1	# 8	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Convolution • Global Average Pooling • Kaiming Initialization • Max Pooling • ReLU • Residual Block • Residual Connection • ResNet

Edit Social Preview

YOLACT++: Better Real-time Instance Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove