no code implementations • 11 Oct 2023 • Andrew M. Bean, Karolina Korgul, Felix Krones, Robert McCraith, Adam Mahdi
For each question, we score each model on the top-1 accuracy and the distribution of probabilities assigned.
Question Answering