TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Object Tracking	FE108	Multi-modal	Success Rate	63.4	# 3
Object Tracking	FE108	Multi-modal	Averaged Precision	92.4	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/object-tracking-by-jointly-exploiting-frame/object-tracking-on-fe108)](https://paperswithcode.com/sota/object-tracking-on-fe108?p=object-tracking-by-jointly-exploiting-frame)`

Object Tracking by Jointly Exploiting Frame and Event Domain

ICCV 2021 · Jiqing Zhang, Xin Yang, Yingkai Fu, Xiaopeng Wei, BaoCai Yin, Bo Dong ·

Inspired by the complementarity between conventional frame-based and bio-inspired event-based cameras, we propose a multi-modal based approach to fuse visual cues from the frame- and event-domain to enhance the single object tracking performance, especially in degraded conditions (e.g., scenes with high dynamic range, low light, and fast-motion objects). The proposed approach can effectively and adaptively combine meaningful information from both domains. Our approach's effectiveness is enforced by a novel designed cross-domain attention schemes, which can effectively enhance features based on self- and cross-domain attention schemes; The adaptiveness is guarded by a specially designed weighting scheme, which can adaptively balance the contribution of the two domains. To exploit event-based visual cues in single-object tracking, we construct a large-scale frame-event-based dataset, which we subsequently employ to train a novel frame-event fusion based model. Extensive experiments show that the proposed approach outperforms state-of-the-art frame-based tracking methods by at least 10.4% and 11.9% in terms of representative success rate and precision rate, respectively. Besides, the effectiveness of each key component of our approach is evidenced by our thorough ablation study.