SegGPT: Segmenting Everything In Context

6 Apr 2023  ยท  Xinlong Wang, Xiaosong Zhang, Yue Cao, Wen Wang, Chunhua Shen, Tiejun Huang ยท

We present SegGPT, a generalist model for segmenting everything in context. We unify various segmentation tasks into a generalist in-context learning framework that accommodates different kinds of segmentation data by transforming them into the same format of images. The training of SegGPT is formulated as an in-context coloring problem with random color mapping for each data sample. The objective is to accomplish diverse tasks according to the context, rather than relying on specific colors. After training, SegGPT can perform arbitrary segmentation tasks in images or videos via in-context inference, such as object instance, stuff, part, contour, and text. SegGPT is evaluated on a broad range of tasks, including few-shot semantic segmentation, video object segmentation, semantic segmentation, and panoptic segmentation. Our results show strong capabilities in segmenting in-domain and out-of-domain targets, either qualitatively or quantitatively.

PDF Abstract

Results from the Paper


 Ranked #1 on Few-Shot Semantic Segmentation on PASCAL-5i (5-Shot) (using extra training data)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Few-Shot Semantic Segmentation COCO-20i (1-shot) SegGPT (ViT) Mean IoU 56.1 # 2
Few-Shot Semantic Segmentation COCO-20i (5-shot) SegGPT (ViT) Mean IoU 67.9 # 1
Few-Shot Semantic Segmentation FSS-1000 (1-shot) SegGPT (ViT) Mean IoU 85.6 # 19
Few-Shot Semantic Segmentation FSS-1000 (5-shot) SegGPT (ViT) Mean IoU 89.3 # 9
Few-Shot Semantic Segmentation PASCAL-5i (1-Shot) SegGPT (ViT) Mean IoU 83.2 # 1
Few-Shot Semantic Segmentation PASCAL-5i (5-Shot) SegGPT (ViT) Mean IoU 89.8 # 1

Methods


No methods listed for this paper. Add relevant methods here