no code implementations • 29 Oct 2019 • Safwan Shatnawi, Mohamed Medhat Gaber, Mihaela Cocea
Subject experts rated the quality of the system's answers on a subset of questions and their ratings were used to identify the most appropriate automatic semantic text similarity metric to use as a validation metric for all answers.