Computing the Testing Error without a Testing Set

被引:33
作者
Corneanu, Ciprian A. [1 ]
Escalera, Sergio [2 ]
Martinez, Aleix M. [3 ]
机构
[1] Univ Barcelona, Tawny, Barcelona, Spain
[2] Univ Barcelona, CVC, Barcelona, Spain
[3] OSU, Amazon, Oklahoma City, OK USA
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2020年
关键词
D O I
10.1109/CVPR42600.2020.00275
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Neural Networks (DNNs) have revolutionized computer vision. We now have DNNs that achieve top (performance) results in many problems, including object recognition, facial expression analysis, and semantic segmentation, to name but a few. The design of the DNNs that achieve top results is, however, non-trivial and mostly done by trail-and-error. That is, typically, researchers will derive many DNN architectures (i.e., topologies) and then test them on multiple datasets. However, there are no guarantees that the selected DNN will perform well in the real world. One can use a testing set to estimate the performance gap between the training and testing sets, but avoiding overfitting-to-thetesting-data is almost impossible. Using a sequestered testing dataset may address this problem, but this requires a constant update of the dataset, a very expensive venture. Here, we derive an algorithm to estimate the performance gap between training and testing that does not require any testing dataset. Specifically, we derive a number of persistent topology measures that identify when a DNN is learning to generalize to unseen samples. This allows us to compute the DNN's testing error on unseen samples, even when we do not have access to them. We provide extensive experimental validation on multiple networks and datasets to demonstrate the feasibility of the proposed approach.
引用
收藏
页码:2674 / 2682
页数:9
相关论文
共 29 条
  • [1] [Anonymous], 2017, CoRR
  • [2] [Anonymous], 2018, ARXIV180205296
  • [3] Emotional Expressions Reconsidered: Challenges to Inferring Emotion From Human Facial Movements
    Barrett, Lisa Feldman
    Adolphs, Ralph
    Marsella, Stacy
    Martinez, Aleix M.
    Pollak, Seth D.
    [J]. PSYCHOLOGICAL SCIENCE IN THE PUBLIC INTEREST, 2019, 20 (01) : 1 - 68
  • [4] Bartlett P., 2017, ADV NEUR IN, P6240
  • [5] Benitez-Quiroz C., 2017, ARXIV170301210
  • [6] EmotioNet: An accurate, real-time algorithm for the automatic annotation of a million facial expressions in the wild
    Benitez-Quiroz, C. Fabian
    Srinivasan, Ramprakash
    Martinez, Aleix M.
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5562 - 5570
  • [7] Towards a topological-geometrical theory of group equivariant non-expansive operators for data analysis and machine learning
    Bergomi, Mattia G.
    Frosini, Patrizio
    Giorgi, Daniela
    Quercioli, Nicola
    [J]. NATURE MACHINE INTELLIGENCE, 2019, 1 (09) : 423 - 433
  • [8] Gromov-Hausdorff Stable Signatures for Shapes using Persistence
    Chazal, Frederic
    Cohen-Steiner, David
    Guibas, Leonidas J.
    Memoli, Facundo
    Oudot, Steve Y.
    [J]. COMPUTER GRAPHICS FORUM, 2009, 28 (05) : 1393 - 1403
  • [9] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [10] What does it mean to learn in deep networks? And, how does one detect adversarial attacks?
    Corneanu, Ciprian A.
    Madadi, Meysam
    Escalera, Sergio
    Martinez, Aleix M.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4752 - 4761