no code implementations • 28 Feb 2022 • Xiuquan Du, Kunpeng Ma, Yuhui Song
Firstly, a coarse-grained patch attention module in the encoding is adopted to get a patch-based coarse-grained attention map in a multi-stage explicitly supervised way, enabling target spatial context saliency representation with a patch-based weighting technique that eliminates the effect of intra-class inconsistency.