2 code implementations • 25 Apr 2024 • Haotian Yan, Ming Wu, Chuang Zhang
VWA leverages the local window attention (LWA) and disentangles LWA into the query window and context window, allowing the context's scale to vary for the query to learn representations at multiple scales.
2 code implementations • 5 Jan 2022 • Haotian Yan, Chuang Zhang, Ming Wu
In this paper, we succeed in introducing multi-scale representations into semantic segmentation ViT via window attention mechanism and further improves the performance and efficiency.
Ranked #14 on Semantic Segmentation on DADA-seg
2 code implementations • 27 Apr 2021 • Haotian Yan, Zhe Li, Weijian Li, Changhu Wang, Ming Wu, Chuang Zhang
It is also worth pointing that, given identical strong data augmentations, the performance improvement of ConTNet is more remarkable than that of ResNet.