{"publication_identifier":{"issn":["0920-5691"],"eissn":["1573-1405"]},"external_id":{"isi":["000494406800001"]},"author":[{"full_name":"Sun, Rémy","last_name":"Sun","first_name":"Rémy"},{"orcid":"0000-0001-8622-7887","last_name":"Lampert","first_name":"Christoph","full_name":"Lampert, Christoph","id":"40C20FD2-F248-11E8-B48F-1D18A9856A87"}],"publication":"International Journal of Computer Vision","has_accepted_license":"1","doi":"10.1007/s11263-019-01232-x","page":"970-995","_id":"6944","language":[{"iso":"eng"}],"citation":{"mla":"Sun, Rémy, and Christoph Lampert. “KS(Conf): A Light-Weight Test If a Multiclass Classifier Operates Outside of Its Specifications.” International Journal of Computer Vision, vol. 128, no. 4, Springer Nature, 2020, pp. 970–95, doi:10.1007/s11263-019-01232-x.","ieee":"R. Sun and C. Lampert, “KS(conf): A light-weight test if a multiclass classifier operates outside of its specifications,” International Journal of Computer Vision, vol. 128, no. 4. Springer Nature, pp. 970–995, 2020.","apa":"Sun, R., & Lampert, C. (2020). KS(conf): A light-weight test if a multiclass classifier operates outside of its specifications. International Journal of Computer Vision. Springer Nature. https://doi.org/10.1007/s11263-019-01232-x","chicago":"Sun, Rémy, and Christoph Lampert. “KS(Conf): A Light-Weight Test If a Multiclass Classifier Operates Outside of Its Specifications.” International Journal of Computer Vision. Springer Nature, 2020. https://doi.org/10.1007/s11263-019-01232-x.","ista":"Sun R, Lampert C. 2020. KS(conf): A light-weight test if a multiclass classifier operates outside of its specifications. International Journal of Computer Vision. 128(4), 970–995.","ama":"Sun R, Lampert C. KS(conf): A light-weight test if a multiclass classifier operates outside of its specifications. International Journal of Computer Vision. 2020;128(4):970-995. doi:10.1007/s11263-019-01232-x","short":"R. Sun, C. Lampert, International Journal of Computer Vision 128 (2020) 970–995."},"article_type":"original","ec_funded":1,"year":"2020","volume":128,"intvolume":" 128","project":[{"grant_number":"308036","name":"Lifelong Learning of Visual Scene Understanding","_id":"2532554C-B435-11E9-9278-68D0E5697425","call_identifier":"FP7"},{"name":"IST Austria Open Access Fund","_id":"B67AFEDC-15C9-11EA-A837-991A96BB2854"}],"ddc":["004"],"file_date_updated":"2020-07-14T12:47:45Z","isi":1,"date_updated":"2024-02-22T14:57:30Z","tmp":{"image":"/images/cc_by.png","name":"Creative Commons Attribution 4.0 International Public License (CC-BY 4.0)","short":"CC BY (4.0)","legal_code_url":"https://creativecommons.org/licenses/by/4.0/legalcode"},"month":"04","article_processing_charge":"Yes (via OA deal)","day":"01","license":"https://creativecommons.org/licenses/by/4.0/","type":"journal_article","file":[{"file_id":"7110","date_created":"2019-11-26T10:30:02Z","file_size":1715072,"date_updated":"2020-07-14T12:47:45Z","content_type":"application/pdf","creator":"dernst","access_level":"open_access","file_name":"2019_IJCV_Sun.pdf","checksum":"155e63edf664dcacb3bdc1c2223e606f","relation":"main_file"}],"abstract":[{"lang":"eng","text":"We study the problem of automatically detecting if a given multi-class classifier operates outside of its specifications (out-of-specs), i.e. on input data from a different distribution than what it was trained for. This is an important problem to solve on the road towards creating reliable computer vision systems for real-world applications, because the quality of a classifier’s predictions cannot be guaranteed if it operates out-of-specs. Previously proposed methods for out-of-specs detection make decisions on the level of single inputs. This, however, is insufficient to achieve low false positive rate and high false negative rates at the same time. In this work, we describe a new procedure named KS(conf), based on statistical reasoning. Its main component is a classical Kolmogorov–Smirnov test that is applied to the set of predicted confidence values for batches of samples. Working with batches instead of single samples allows increasing the true positive rate without negatively affecting the false positive rate, thereby overcoming a crucial limitation of single sample tests. We show by extensive experiments using a variety of convolutional network architectures and datasets that KS(conf) reliably detects out-of-specs situations even under conditions where other tests fail. It furthermore has a number of properties that make it an excellent candidate for practical deployment: it is easy to implement, adds almost no overhead to the system, works with any classifier that outputs confidence scores, and requires no a priori knowledge about how the data distribution could change."}],"department":[{"_id":"ChLa"}],"issue":"4","publication_status":"published","date_published":"2020-04-01T00:00:00Z","scopus_import":"1","publisher":"Springer Nature","user_id":"3E5EF7F0-F248-11E8-B48F-1D18A9856A87","date_created":"2019-10-14T09:14:28Z","oa_version":"Published Version","oa":1,"status":"public","related_material":{"link":[{"relation":"erratum","url":"https://doi.org/10.1007/s11263-019-01262-5"}],"record":[{"id":"6482","relation":"earlier_version","status":"public"}]},"quality_controlled":"1","title":"KS(conf): A light-weight test if a multiclass classifier operates outside of its specifications"}