1 code implementation • 14 Oct 2022 • Stefan Larson, Gordon Lim, Yutong Ai, David Kuang, Kevin Leach
Our new out-of-distribution benchmark consists of two types of documents: those that are not part of any of the 16 in-domain RVL-CDIP categories (RVL-CDIP-O), and those that are one of the 16 in-domain categories yet are drawn from a distribution different from that of the original RVL-CDIP dataset (RVL-CDIP-N).