Print Email Facebook Twitter Nuclear discrepancy for single-shot batch active learning Title Nuclear discrepancy for single-shot batch active learning Author Viering, T.J. (TU Delft Pattern Recognition and Bioinformatics) Krijthe, J.H. (TU Delft Pattern Recognition and Bioinformatics) Loog, M. (TU Delft Pattern Recognition and Bioinformatics) Date 2019-09 Abstract Active learning algorithms propose what data should be labeled given a pool of unlabeled data. Instead of selecting randomly what data to annotate, active learning strategies aim to select data so as to get a good predictive model with as little labeled samples as possible. Single-shot batch active learners select all samples to be labeled in a single step, before any labels are observed.We study single-shot active learners that minimize generalization bounds to select a representative sample, such as the maximum mean discrepancy (MMD) active learner.We prove that a related bound, the discrepancy, provides a tighter worst-case bound. We study these bounds probabilistically, which inspires us to introduce a novel bound, the nuclear discrepancy (ND). The ND bound is tighter for the expected loss under optimistic probabilistic assumptions. Our experiments show that the MMD active learner performs better than the discrepancy in terms of the mean squared error, indicating that tighter worst case bounds do not imply better active learning performance. The proposed active learner improves significantly upon the MMD and discrepancy in the realizable setting and a similar trend is observed in the agnostic setting, showing the benefits of a probabilistic approach to active learning. Our study highlights that assumptions underlying generalization bounds can be equally important as bound-tightness, when it comes to active learning performance. Code for reproducing our experimental results can be found at https://github.com/tomviering/ NuclearDiscrepancy. Subject Active learningDiscrepancyKernel methodsMaximum mean discrepancy To reference this document use: http://resolver.tudelft.nl/uuid:e31a0297-93c8-48ee-9eab-d30481e76f93 DOI https://doi.org/10.1007/s10994-019-05817-y ISSN 0885-6125 Source Machine Learning, 108 (8-9), 1561-1599 Part of collection Institutional Repository Document type journal article Rights © 2019 T.J. Viering, J.H. Krijthe, M. Loog Files PDF Viering2019_Article_Nucle ... gle_sh.pdf 1007.32 KB Close viewer /islandora/object/uuid:e31a0297-93c8-48ee-9eab-d30481e76f93/datastream/OBJ/view