The optimal input-independent baseline for binary classification:The Dutch Draw

Before any binary classification model is taken into practice, it is important to validate its performance on a proper test set. Without a frame of reference given by a baseline method, it is impossible to determine if a score is “good” or “bad.” The goal of this paper is to examine all baseline methods that are independent of feature values and determine which model is the “best” and why. By identifying which baseline models are optimal, a crucial selection decision in the evaluation process is simplified. We prove that the recently proposed Dutch Draw baseline is the best input-independent c... Mehr ...

Verfasser: Pries, Joris
van de Bijl, Etienne
Klein, Jan
Bhulai, Sandjai
van der Mei, Rob
Dokumenttyp: Artikel
Erscheinungsdatum: 2023
Reihe/Periodikum: Pries , J , van de Bijl , E , Klein , J , Bhulai , S & van der Mei , R 2023 , ' The optimal input-independent baseline for binary classification : The Dutch Draw ' , Statistica Neerlandica , vol. 77 , no. 4 , pp. 543-554 . https://doi.org/10.1111/stan.12297
Schlagwörter: baseline / benchmark / binary classification / evaluation / supervised learning
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-29046534
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://research.vu.nl/en/publications/4418e229-cb44-4f1f-b270-006a8f7e1598

Before any binary classification model is taken into practice, it is important to validate its performance on a proper test set. Without a frame of reference given by a baseline method, it is impossible to determine if a score is “good” or “bad.” The goal of this paper is to examine all baseline methods that are independent of feature values and determine which model is the “best” and why. By identifying which baseline models are optimal, a crucial selection decision in the evaluation process is simplified. We prove that the recently proposed Dutch Draw baseline is the best input-independent classifier (independent of feature values) for all order-invariant measures (independent of sequence order) assuming that the samples are randomly shuffled. This means that the Dutch Draw baseline is the optimal baseline under these intuitive requirements and should therefore be used in practice.