1 code implementation • NeurIPS 2021 • Vihari Piratla, Soumen Chakrabarty, Sunita Sarawagi
Our goal is to evaluate the accuracy of a black-box classification model, not as a single aggregate on a given test data distribution, but as a surface over a large number of combinations of attributes characterizing multiple test data distributions.