1 dataset result for Video Object Segmentation AND Audio

AVSBench is a pixel-level audio-visual segmentation benchmark that provides ground truth labels for sounding objects. The dataset is divided into three subsets: AVSBench-object (Single-source subset, Multi-sources subset) and AVSBench-semantic (Semantic-labels subset). Accordingly, three settings are studied:

10 PAPERS • NO BENCHMARKS YET

Datasets

1 dataset result for Video Object Segmentation AND Audio