TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Continual Learning	ASC (19 tasks)	A-GEM	F1 - macro	0.7844	# 6
Class Incremental Learning	cifar100	A-GEM	10-stage average accuracy	45.76	# 8

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-lifelong-learning-with-a-gem/continual-learning-on-asc-19-tasks)](https://paperswithcode.com/sota/continual-learning-on-asc-19-tasks?p=efficient-lifelong-learning-with-a-gem)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/efficient-lifelong-learning-with-a-gem/class-incremental-learning-on-cifar100)](https://paperswithcode.com/sota/class-incremental-learning-on-cifar100?p=efficient-lifelong-learning-with-a-gem)`

Efficient Lifelong Learning with A-GEM

ICLR 2019 · Arslan Chaudhry, Marc'Aurelio Ranzato, Marcus Rohrbach, Mohamed Elhoseiny ·

In lifelong learning, the learner is presented with a sequence of tasks, incrementally building a data-driven prior which may be leveraged to speed up learning of a new task. In this work, we investigate the efficiency of current lifelong approaches, in terms of sample complexity, computational and memory cost. Towards this end, we first introduce a new and a more realistic evaluation protocol, whereby learners observe each example only once and hyper-parameter selection is done on a small and disjoint set of tasks, which is not used for the actual learning experience and evaluation. Second, we introduce a new metric measuring how quickly a learner acquires a new skill. Third, we propose an improved version of GEM (Lopez-Paz & Ranzato, 2017), dubbed Averaged GEM (A-GEM), which enjoys the same or even better performance as GEM, while being almost as computationally and memory efficient as EWC (Kirkpatrick et al., 2016) and other regularization-based methods. Finally, we show that all algorithms including A-GEM can learn even more quickly if they are provided with task descriptors specifying the classification tasks under consideration. Our experiments on several standard lifelong learning benchmarks demonstrate that A-GEM has the best trade-off between accuracy and efficiency.