FiE: Building a Global Probability Space by Leveraging Early Fusion in Encoder for Open-Domain Question Answering

18 Nov 2022  ·  Akhil Kedia, Mohd Abbas Zaidi, Haejun Lee ·

Generative models have recently started to outperform extractive models in Open Domain Question Answering, largely by leveraging their decoder to attend over multiple encoded passages and combining their information. However, generative models tend to be larger than extractive models due to the need for a decoder, run slower during inference due to auto-regressive decoder beam search, and their generated output often suffers from hallucinations. We propose to extend transformer encoders with the ability to fuse information from multiple passages, using global representation to provide cross-sample attention over all tokens across samples. Furthermore, we propose an alternative answer span probability calculation to better aggregate answer scores in the global space of all samples. Using our proposed method, we outperform the current state-of-the-art method by $2.5$ Exact Match score on the Natural Question dataset while using only $25\%$ of parameters and $35\%$ of the latency during inference, and $4.4$ Exact Match on WebQuestions dataset. When coupled with synthetic data augmentation, we outperform larger models on the TriviaQA dataset as well. The latency and parameter savings of our method make it particularly attractive for open-domain question answering, as these models are often compute-intensive.

PDF Abstract

Results from the Paper


 Ranked #1 on Question Answering on WebQuestions (using extra training data)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Question Answering Natural Questions FiE EM 58.4 # 3
Question Answering TriviaQA FiE+PAQ EM 72.6 # 17
Question Answering WebQuestions FiE+PAQ EM 56.3 # 1
Question Answering WebQuestions FiE EM 52.4 # 2
Open-Domain Question Answering WebQuestions FiE Exact Match 52.4 # 3
Open-Domain Question Answering WebQuestions FiE+PAQ Exact Match 56.3 # 2

Methods


No methods listed for this paper. Add relevant methods here