TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Age And Gender Classification	Adience Age	MiVOLO-D1	Accuracy (5-fold)	68.69	# 3
Age And Gender Classification	Adience Gender	MiVOLO-D1	Accuracy (5-fold)	96.51	# 3
Age Estimation	AgeDB	MiVOLO-D1	MAE	5.55	# 1
Gender Prediction	AgeDB	MiVOLO-D1	Accuracy	98.3	# 1
Facial Attribute Classification	FairFace	MiVOLO-D1	gender-top1	95.73	# 2
Facial Attribute Classification	FairFace	MiVOLO-D1	age-top1	61.07	# 2
Age Estimation	IMDB-Clean	VOLO-D1 age&gender	Average mean absolute error	4.22	# 3
Age Estimation	IMDB-Clean	MiVOLO-D1	Average mean absolute error	4.09	# 2
Age Estimation	LAGENDA	MiVOLO-D1	MAE	3.99	# 2
Gender Prediction	LAGENDA	MiVOLO-D1	Accuracy	97.36	# 2
Age and Gender Estimation	LAGENDA age	MiVOLO-D1	MAE	3.99	# 2
Age and Gender Estimation	LAGENDA age	MiVOLO-D1	CS@5	71.27	# 2
Age and Gender Estimation	LAGENDA gender	MiVOLO-D1	Accuracy	97.36	# 1
Age Estimation	UTKFace	VOLO-D1 age&gender	MAE	4.23	# 3
Age Estimation	UTKFace	MiVOLO-D1	MAE	3.7	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mivolo-multi-input-transformer-for-age-and/age-estimation-on-agedb)](https://paperswithcode.com/sota/age-estimation-on-agedb?p=mivolo-multi-input-transformer-for-age-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mivolo-multi-input-transformer-for-age-and/gender-prediction-on-agedb)](https://paperswithcode.com/sota/gender-prediction-on-agedb?p=mivolo-multi-input-transformer-for-age-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mivolo-multi-input-transformer-for-age-and/age-and-gender-estimation-on-lagenda-gender)](https://paperswithcode.com/sota/age-and-gender-estimation-on-lagenda-gender?p=mivolo-multi-input-transformer-for-age-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mivolo-multi-input-transformer-for-age-and/age-estimation-on-utkface)](https://paperswithcode.com/sota/age-estimation-on-utkface?p=mivolo-multi-input-transformer-for-age-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mivolo-multi-input-transformer-for-age-and/facial-attribute-classification-on-fairface)](https://paperswithcode.com/sota/facial-attribute-classification-on-fairface?p=mivolo-multi-input-transformer-for-age-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mivolo-multi-input-transformer-for-age-and/age-estimation-on-imdb-clean)](https://paperswithcode.com/sota/age-estimation-on-imdb-clean?p=mivolo-multi-input-transformer-for-age-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mivolo-multi-input-transformer-for-age-and/age-estimation-on-lagenda)](https://paperswithcode.com/sota/age-estimation-on-lagenda?p=mivolo-multi-input-transformer-for-age-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mivolo-multi-input-transformer-for-age-and/gender-prediction-on-lagenda)](https://paperswithcode.com/sota/gender-prediction-on-lagenda?p=mivolo-multi-input-transformer-for-age-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mivolo-multi-input-transformer-for-age-and/age-and-gender-estimation-on-lagenda-age)](https://paperswithcode.com/sota/age-and-gender-estimation-on-lagenda-age?p=mivolo-multi-input-transformer-for-age-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mivolo-multi-input-transformer-for-age-and/age-and-gender-classification-on-adience-age)](https://paperswithcode.com/sota/age-and-gender-classification-on-adience-age?p=mivolo-multi-input-transformer-for-age-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mivolo-multi-input-transformer-for-age-and/age-and-gender-classification-on-adience)](https://paperswithcode.com/sota/age-and-gender-classification-on-adience?p=mivolo-multi-input-transformer-for-age-and)`

MiVOLO: Multi-input Transformer for Age and Gender Estimation

10 Jul 2023 · Maksim Kuprashevich, Irina Tolstykh ·

Age and gender recognition in the wild is a highly challenging task: apart from the variability of conditions, pose complexities, and varying image quality, there are cases where the face is partially or completely occluded. We present MiVOLO (Multi Input VOLO), a straightforward approach for age and gender estimation using the latest vision transformer. Our method integrates both tasks into a unified dual input/output model, leveraging not only facial information but also person image data. This improves the generalization ability of our model and enables it to deliver satisfactory results even when the face is not visible in the image. To evaluate our proposed model, we conduct experiments on four popular benchmarks and achieve state-of-the-art performance, while demonstrating real-time processing capabilities. Additionally, we introduce a novel benchmark based on images from the Open Images Dataset. The ground truth annotations for this benchmark have been meticulously generated by human annotators, resulting in high accuracy answers due to the smart aggregation of votes. Furthermore, we compare our model's age recognition performance with human-level accuracy and demonstrate that it significantly outperforms humans across a majority of age ranges. Finally, we grant public access to our models, along with the code for validation and inference. In addition, we provide extra annotations for used datasets and introduce our new benchmark.

PDF Abstract

Code

Add Remove Mark official

wildchlamydia/mivolo official

↳ Quickstart in

Spaces

227

Tasks

Add Remove

Age And Gender Classification

Age and Gender Estimation

Age Estimation

Facial Attribute Classification

Gender Prediction

Datasets

Introduced in the Paper:

LAGENDA

Used in the Paper:

UTKFace FairFace

Adience

AgeDB

IMDB-Clean

Results from the Paper

Edit

Ranked #1 on Age Estimation on AgeDB (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Age And Gender Classification	Adience Age	MiVOLO-D1	Accuracy (5-fold)	68.69	# 3	Compare
Age And Gender Classification	Adience Gender	MiVOLO-D1	Accuracy (5-fold)	96.51	# 3	Compare
Age Estimation	AgeDB	MiVOLO-D1	MAE	5.55	# 1	Compare
Gender Prediction	AgeDB	MiVOLO-D1	Accuracy	98.3	# 1	Compare
Facial Attribute Classification	FairFace	MiVOLO-D1	gender-top1	95.73	# 2	Compare
Facial Attribute Classification	FairFace	MiVOLO-D1	age-top1	61.07	# 2	Compare
Age Estimation	IMDB-Clean	VOLO-D1 age&gender	Average mean absolute error	4.22	# 3	Compare
Age Estimation	IMDB-Clean	MiVOLO-D1	Average mean absolute error	4.09	# 2	Compare
Age Estimation	LAGENDA	MiVOLO-D1	MAE	3.99	# 2	Compare
Gender Prediction	LAGENDA	MiVOLO-D1	Accuracy	97.36	# 2	Compare
Age and Gender Estimation	LAGENDA age	MiVOLO-D1	MAE	3.99	# 2	Compare
Age and Gender Estimation	LAGENDA age	MiVOLO-D1	CS@5	71.27	# 2	Compare
Age and Gender Estimation	LAGENDA gender	MiVOLO-D1	Accuracy	97.36	# 1	Compare
Age Estimation	UTKFace	VOLO-D1 age&gender	MAE	4.23	# 3	Compare
Age Estimation	UTKFace	MiVOLO-D1	MAE	3.7	# 1	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

MiVOLO: Multi-input Transformer for Age and Gender Estimation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove