1 code implementation • ECCV 2020 • Deng-Ping Fan, Yingjie Zhai, Ali Borji, Jufeng Yang, Ling Shao
In particular, we 1) propose a bifurcated backbone strategy (BBS) to split the multi-level features into teacher and student features, and 2) utilize a depth-enhanced module (DEM) to excavate informative parts of depth cues from the channel and spatial views.
2 code implementations • 14 May 2024 • Yingjie Zhai, Wenshuo Li, Yehui Tang, Xinghao Chen, Yunhe Wang
In this paper, we propose to squeeze the time axis of a video sequence into the channel dimension and present a lightweight video recognition network, term as \textit{SqueezeTime}, for mobile video understanding.
1 code implementation • 10 May 2024 • ZhenLiang Ni, Xinghao Chen, Yingjie Zhai, Yehui Tang, Yunhe Wang
Specifically, it achieves $43. 6\%$ mIoU on ADE20K with only $4. 0$ GFLOPs, which is $0. 9\%$ and $2. 5\%$ mIoU better than SeaFormer and SegNeXt but with about $38. 0\%$ fewer GFLOPs.
2 code implementations • 6 Jul 2020 • Yingjie Zhai, Deng-Ping Fan, Jufeng Yang, Ali Borji, Ling Shao, Junwei Han, Liang Wang
In particular, first, we propose to regroup the multi-level features into teacher and student features using a bifurcated backbone strategy (BBS).
Ranked #2 on RGB-D Salient Object Detection on RGBD135