Simple Applications of BERT for Ad Hoc Document Retrieval

26 Mar 2019  ·  Wei Yang, Haotian Zhang, Jimmy Lin ·

Following recent successes in applying BERT to question answering, we explore simple applications to ad hoc document retrieval. This required confronting the challenge posed by documents that are typically longer than the length of input BERT was designed to handle. We address this issue by applying inference on sentences individually, and then aggregating sentence scores to produce document scores. Experiments on TREC microblog and newswire test collections show that our approach is simple yet effective, as we report the highest average precision on these datasets by neural approaches that we are aware of.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Ad-Hoc Information Retrieval TREC Robust04 BERT FT(Microblog) MAP 0.3278 # 2
P@20 0.4287 # 5

Methods