no code implementations • 5 Jun 2019 • Minh N. Vu, Truc D. Nguyen, NhatHai Phan, Ralucca Gera, My T. Thai
Given a classifier's prediction and the corresponding explanation on that prediction, c-Eval is the minimum-distortion perturbation that successfully alters the prediction while keeping the explanation's features unchanged.