The aspect Bernoulli model: multiple causes of presences and absences

被引:19
作者
Bingham, Ella [1 ,2 ]
Kaban, Ata [3 ]
Fortelius, Mikael [4 ]
机构
[1] Univ Helsinki, Helsinki Inst Informat Technol, FIN-00014 Helsinki, Finland
[2] Aalto Univ, Helsinki 00014, Finland
[3] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
[4] Univ Helsinki, Div Palaeontol, FIN-00014 Helsinki, Finland
关键词
Data mining; Probabilistic latent variable models; Multiple cause models; 0-1; data; SELECTION;
D O I
10.1007/s10044-007-0096-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a probabilistic multiple cause model for the analysis of binary (0-1) data. A distinctive feature of the aspect Bernoulli (AB) model is its ability to automatically detect and distinguish between "true absences" and "false absences" (both of which are coded as 0 in the data), and similarly, between "true presences" and "false presences" (both of which are coded as 1). This is accomplished by specific additive noise components which explicitly account for such non-content bearing causes. The AB model is thus suitable for noise removal and data explanatory purposes, including omission/addition detection. An important application of AB that we demonstrate is data-driven reasoning about palaeontological recordings. Additionally, results on recovering corrupted handwritten digit images and expanding short text documents are also given, and comparisons to other methods are demonstrated and discussed.
引用
收藏
页码:55 / 78
页数:24
相关论文
共 64 条
[1]  
Akaike H., 1992, Breakthroughs in Statistics, P610, DOI [10.1007/978-1-4612-0919-538, DOI 10.1007/978-1-4612-0919-538, DOI 10.1007/978-1-4612-1694-0_15]
[2]   Singular value decomposition for genome-wide expression data processing and modeling [J].
Alter, O ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) :10101-10106
[3]  
[Anonymous], 2004, Advances in Neural Information Processing Systems
[4]  
[Anonymous], TR98042 INT COMP SCI
[5]   Finding Minimum Entropy Codes [J].
Barlow, H. B. ;
Kaushal, T. P. ;
Mitchison, G. J. .
NEURAL COMPUTATION, 1989, 1 (03) :412-423
[6]   Matching words and pictures [J].
Barnard, K ;
Duygulu, P ;
Forsyth, D ;
de Freitas, N ;
Blei, DM ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1107-1135
[7]  
Barnard K, 2001, PROC CVPR IEEE, P434
[8]  
Barry JC, 2002, PALEOBIOLOGY, V28, P1, DOI 10.1666/0094-8373(2002)28[1:FAECIT]2.0.CO
[9]  
2
[10]  
Bernardo J. M., 1994, Bayesian theory, DOI 10.1002/9780470316870