Artie Bias Corpus: An Open Dataset for Detecting Demographic Bias in Speech Applications

We describe the creation of the Artie Bias Corpus, an English dataset of expert-validated {\textless}audio, transcript{\textgreater} pairs with demographic tags for age, gender, accent. We also release open software which may be used with the Artie Bias Corpus to detect demographic bias in Automatic Speech Recognition systems, and can be extended to other speech technologies... (read more)

PDF Abstract

Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper

🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet