Noise Analysis of Duplicated Data on Microarrays Using Mixture Distribution Modeling

被引:0
作者
Masaru Takeya
Takehiro Matsuda
Masao Iwamoto
Norimichi Tsumura
Toshiya Nakaguchi
Yoichi Miyake
机构
[1] Chiba University,Graduate School of Science and Technology
[2] National Institute of Agrobiological Sciences,Division of Plant Sciences
[3] National Institute of Agrobiological Sciences,Division of Genome and Biodiversity Research
来源
Optical Review | 2007年 / 14卷
关键词
cDNA microarray; mixture distribution model; duplicated data; circadian rhythms;
D O I
暂无
中图分类号
学科分类号
摘要
We propose a technique for estimating gene expression values for duplicated data on cDNA microarrays. In the scatter plots, the distribution is constructed from a mixture of normal two-dimensional distributions, which represent fluctuations in gene expression values due to noise. An expectation-maximization (EM) algorithm is used for estimating the modeling parameters. The probability that duplicated data is shifted by noise is calculated using Bayesian estimation. Six data sets of rice cDNA microarray assays were used to test the proposed technique. Genes in the data sets were subjected to clustering based on probability of true value. Clustering successfully identified candidate genes regulated by circadian rhythms in rice.
引用
收藏
页码:97 / 104
页数:7
相关论文
共 81 条
[1]  
Holstege F. C.(1998)undefined Cell 95 717-undefined
[2]  
Jennings E. G.(1999)undefined Nat. Genet. 21 33-undefined
[3]  
Wyrich J. J.(1996)undefined Genome Res. 6 492-undefined
[4]  
Lee T. I.(2000)undefined Nucleic Acids Res. 28 e47-undefined
[5]  
Hengartner C. J.(2002)undefined J. Biomed. Opt. 7 507-undefined
[6]  
Green M. R.(2002)undefined Proc. Natl. Acad. Sci. U.S.A. 99 14031-undefined
[7]  
Golub T. R.(2003)undefined J. Comput. Biol. 10 433-undefined
[8]  
Lander E. S.(2000)undefined Proc. Natl. Acad. Sci. U.S.A. 97 9834-undefined
[9]  
Young R. A.(2000)undefined DNA Res. 7 367-undefined
[10]  
Brown P. O.(2004)undefined Bioinformatics 20 2016-undefined