no code implementations • 3 Dec 2019 • Sungkwon Choo, Wonkyo Seo, Nam Ik Cho
The two-stream fusion network again consists of motion and appearance stream networks, which extract long-term temporal and spatial information, respectively.
Foreground Segmentation Instance Segmentation +6