Learning systems often describe a target class as a disjunction of conjunctions of conditions. Recent work has noted that small disjuncts, i.e., those supported by few training examples, typically have poor predictive accuracy. One model of this accuracy is provided by the Bayes-Laplace formula based on the number of training examples covered by the disjunct and the number of them belonging to the target class. However, experiments show that small disjunts associated with target classes of different relative frequencies tend to have different error rates. This note defines the context of a disjunct as the set of training examples that fail to satisfy at most one of its conditions. An empirical adaptation of the Bayes-Laplace formula is presented that also makes use of the relative frequency of the target class in this context. Trials are reported comparing the performance of the original formula and the adaptation in six learning tasks.
机构:
CALIF POLYTECH STATE UNIV SAN LUIS OBISPO, IRRIG TRAINING & RES CTR, SAN LUIS OBISPO, CA 93407 USACALIF POLYTECH STATE UNIV SAN LUIS OBISPO, IRRIG TRAINING & RES CTR, SAN LUIS OBISPO, CA 93407 USA
Clemmens, AJ
Burt, CM
论文数: 0引用数: 0
h-index: 0
机构:
CALIF POLYTECH STATE UNIV SAN LUIS OBISPO, IRRIG TRAINING & RES CTR, SAN LUIS OBISPO, CA 93407 USACALIF POLYTECH STATE UNIV SAN LUIS OBISPO, IRRIG TRAINING & RES CTR, SAN LUIS OBISPO, CA 93407 USA
机构:
Washington State Univ, Dept Chem, Pullman, WA 99164 USAWashington State Univ, Dept Chem, Pullman, WA 99164 USA
Feller, David
Peterson, Kirk A.
论文数: 0引用数: 0
h-index: 0
机构:
Washington State Univ, Dept Chem, Pullman, WA 99164 USAWashington State Univ, Dept Chem, Pullman, WA 99164 USA
Peterson, Kirk A.
Ruscic, Branko
论文数: 0引用数: 0
h-index: 0
机构:
Argonne Natl Lab, Chem Sci & Engn Div, Argonne, IL 60439 USA
Univ Chicago, Computat Inst, Chicago, IL 60637 USAWashington State Univ, Dept Chem, Pullman, WA 99164 USA