no code implementations • 25 Oct 2023 • Tofik Ali, Partha Pratim Roy
The proposed model leverages transformer-based models to encode all the information present in a document image, including textual, visual, and layout information.
Document Classification Question Answering +2