QuAcc: Using Quantification to Predict Classifier Accuracy Under Prior Probability Shift
AUTHORS: Lorenzo Volpi, Alejandro Moreo, Fabrizio Sebastiani
WORK PACKAGE: WP8 UbiQuity
URL:https://journals.sagepub.com/doi/abs/10.1177/17248035251338347
Keywords:
Abstract
Using cross-validation to predict the accuracy of a classifier on unseen data can be done reliably only in the absence of dataset shift, i.e., when the training data and the unseen data are IID. In this work we deal instead with the problem of predicting classifier accuracy on unseen data affected by prior probability shift (PPS), an important type of dataset shift. We propose QuAcc, a method built on top of “quantification” algorithms robust to PPS, i.e., algorithms devised for estimating the prevalence values of the classes in unseen data affected by PPS. QuAcc is based on the idea of viewing the cells of the contingency table (on which classifier accuracy is computed) as classes, and of estimating, via a quantification algorithm, their prevalence values on the unseen data labelled by the classifier. We perform systematic experiments in which we compare the prediction error incurred by QuAcc with that of state-of-the-art classifier accuracy prediction (CAP) methods.