no code implementations • 13 Nov 2020 • J. Tu, C. Chen, X. Huang, J. He, X. Guan
Based on this multi-modal information, the proposed DFR-ST constructs an appearance model for a multi-grained visual representation by a two-stream architecture and a spatio-temporal metric to provide complementary information.