On the trade-off between number of examples and precision of supervision in machine learning problems

被引:6
作者
Gnecco, Giorgio [1 ]
Nutarelli, Federico [1 ]
机构
[1] IMT Sch Adv Studies, Piazza S Francesco 19, I-55100 Lucca, Italy
关键词
Optimal supervision time; Linear regression; Variance control; Ordinary least squares; Large-sample approximation;
D O I
10.1007/s11590-019-01486-x
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We investigate linear regression problems for which one is given the additional possibility of controlling the conditional variance of the output given the input, by varying the computational time dedicated to supervise each example. For a given upper bound on the total computational time for supervision, we optimize the trade-off between the number of examples and their precision (the reciprocal of the conditional variance of the output), by formulating and solving suitable optimization problems, based on large-sample approximations of the outputs of the classical ordinary least squares and weighted least squares regression algorithms. Considering a specific functional form for that precision, we prove that there are cases in which "many but bad" examples provide a smaller generalization error than "few but good" ones, but also that the converse can occur, depending on the "returns to scale" of the precision with respect to the computational time assigned to supervise each example. Hence, the results of this study highlight that increasing the size of the dataset is not always beneficial, if one has the possibility to collect a smaller number of more reliable examples. We conclude presenting numerical results validating the theory, and discussing extensions of the proposed framework to other optimization problems.
引用
收藏
页码:1711 / 1733
页数:23
相关论文
共 18 条
[1]  
[Anonymous], 1998, STAT LEARNING THEORY
[2]  
[Anonymous], 2004, Kernel methods for pattern analysis
[3]   Recursive partitioning for heterogeneous causal effects [J].
Athey, Susan ;
Imbens, Guido .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (27) :7353-7360
[4]  
Bacigalupo A, 2018, J PHYS C P, V1092, P4
[5]   Optimal design of low-frequency band gaps in anti-tetrachiral lattice meta-materials [J].
Bacigalupo, Andrea ;
Gnecco, Giorgio ;
Lepidi, Marco ;
Gambarotta, Luigi .
COMPOSITES PART B-ENGINEERING, 2017, 115 :341-359
[6]   Optimal design of auxetic hexachiral metamaterials with local resonators [J].
Bacigalupo, Andrea ;
Lepidi, Marco ;
Gnecco, Giorgio ;
Gambarotta, Luigi .
SMART MATERIALS AND STRUCTURES, 2016, 25 (05)
[7]  
Barlow R., 1989, Statistics: A Guide to the Use of Statistical Method in the Physical Sciences
[8]  
Gnecco G., 2019, P 4 INT C INT NEUR N, P1
[9]  
Gnecco G, 2017, NEURAL COMPUT, V29, P2203, DOI [10.1162/neco_a_00976, 10.1162/NECO_a_00976]
[10]  
Greene H.W., 2008, ECONOMETRIC ANAL, V6th