FEELVOS: Fast End-to-End Embedding Learning for Video Object Segmentation

Many of the recent successful methods for video object segmentation (VOS) are overly complicated, heavily rely on fine-tuning on the first frame, and/or are slow, and are hence of limited practical use. In this work, we propose FEELVOS as a simple and fast method which does not rely on fine-tuning... (read more)

PDF Abstract CVPR 2019 PDF CVPR 2019 Abstract
TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT BENCHMARK
Semi-Supervised Video Object Segmentation DAVIS 2016 FEELVOS Jaccard (Mean) 81.1 # 22
Jaccard (Recall) 90.5 # 22
Jaccard (Decay) 13.7 # 6
F-measure (Mean) 82.2 # 16
F-measure (Recall) 86.6 # 22
F-measure (Decay) 14.1 # 25
J&F 81.65 # 19
Semi-Supervised Video Object Segmentation DAVIS 2017 (test-dev) FEELVOS J&F 57.8 # 11
Jaccard (Mean) 55.1 # 11
Jaccard (Recall) 62.6 # 8
Jaccard (Decay) 29.8 # 19
F-measure (Mean) 60.4 # 12
F-measure (Recall) 68.5 # 9
F-measure (Decay) 33.5 # 18
Semi-Supervised Video Object Segmentation DAVIS 2017 (val) FEELVOS Jaccard (Mean) 69.1 # 12
Jaccard (Recall) 79.1 # 7
Jaccard (Decay) 17.5 # 12
F-measure (Mean) 74.0 # 13
F-measure (Recall) 83.8 # 7
F-measure (Decay) 20.1 # 14
J&F 71.55 # 12
Semi-Supervised Video Object Segmentation YouTube FEELVOS mIoU 0.821 # 1

Methods used in the Paper


METHOD TYPE
🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet