GPMS: A Genetic Programming Based Approach to Multiple Alignment of Liquid Chromatography-Mass Spectrometry Data

被引:2
作者
Ahmed, Soha [1 ]
Zhang, Mengjie [1 ]
Peng, Lifeng [2 ]
机构
[1] Sch Engn & Comp Sci, Wellington, New Zealand
[2] Victoria Univ Wellington, Wellington 6140, New Zealand
来源
APPLICATIONS OF EVOLUTIONARY COMPUTATION | 2014年 / 8602卷
关键词
LC-MS; PROTEOMICS; MZMINE; SUITE;
D O I
10.1007/978-3-662-45523-4_74
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Alignment of samples from Liquid chromatography-mass spectrometry (LC-MS) measurements has a significant role in the detection of biomarkers and in metabolomic studies. The machine drift causes differences between LC-MS measurements, and an accurate alignment of the shifts introduced to the same peptide or metabolite is needed. In this paper, we propose the use of genetic programming (GP) for multiple alignment of LC-MS data. The proposed approach consists of two main phases. The first phase is the peak matching where the peaks from different LC-MS maps (peak lists) are matched to allow the calculation of the retention time deviation. The second phase is to use GP for multiple alignment of the peak lists with respect to a reference. In this paper, GP is designed to perform multiple-output regression by using a special node in the tree which divides the output of the tree into multiple outputs. Finally, the peaks that show the maximum correlation after dewarping the retention times are selected to form a consensus aligned map. The proposed approach is tested on one proteomics and two metabolomics LCMS datasets with different number of samples. The method is compared to several benchmark methods and the results show that the proposed approach outperforms these methods in three fractions of the protoemics dataset and the metabolomics dataset with a larger number of maps. Moreover, the results on the rest of the datasets are highly competitive with the other methods.
引用
收藏
页码:915 / 927
页数:13
相关论文
共 22 条
  • [1] Ahalpara D. P., 2010, GECCO 10 P 12 ANN C, P977
  • [2] [Anonymous], P DETC 04 ASME 2004
  • [3] A suite of algorithms for the comprehensive analysis of complex protein mixtures using high-resolution LC-MS
    Bellew, Matthew
    Coram, Marc
    Fitzgibbon, Matthew
    Igra, Mark
    Randolph, Tim
    Wang, Pei
    May, Damon
    Eng, Jimmy
    Fang, Ruihua
    Lin, ChenWei
    Chen, Jinzhi
    Goodlett, David
    Whiteaker, Jeffrey
    Paulovich, Amanda
    McIntosh, Martin
    [J]. BIOINFORMATICS, 2006, 22 (15) : 1902 - 1909
  • [4] Heidi Vhmaa Ville R., 2011, INT J PROTEOMICS, P1
  • [5] MZmine: toolbox for processing and visualization of mass spectrometry based molecular profile data
    Katajamaa, M
    Miettinen, J
    Oresic, M
    [J]. BIOINFORMATICS, 2006, 22 (05) : 634 - 636
  • [6] TOPP -: the OpenMS proteomics pipeline
    Kohlbacher, Oliver
    Reinert, Knut
    Groepl, Clemens
    Lange, Eva
    Pfeifer, Nico
    Schulz-Trieglaff, Ole
    Sturm, Marc
    [J]. BIOINFORMATICS, 2007, 23 (02) : E191 - E197
  • [7] A geometric approach for the alignment of liquid chromatography -: mass spectrometry data
    Lange, Eva
    Groepl, Clemens
    Schulz-Trieglaff, Ole
    Leinenbach, Andreas
    Huber, Christian
    Reinert, Knut
    [J]. BIOINFORMATICS, 2007, 23 (13) : I273 - I281
  • [8] Critical assessment of alignment procedures for LC- MS proteomics and metabolomics measurements
    Lange, Eva
    Tautenhahn, Ralf
    Neumann, Steffen
    Groepl, Clemens
    [J]. BMC BIOINFORMATICS, 2008, 9 (1)
  • [9] A software suite for the generation and comparison of peptide arrays from sets of data collected by liquid chromatography-mass spectrometry
    Li, XJ
    Yi, EC
    Kemp, CJ
    Zhang, H
    Aebersold, R
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2005, 4 (09) : 1328 - 1340
  • [10] Difference detection in LC-MS data for protein biomarker discovery
    Listgarten, Jennifer
    Neal, Radford M.
    Roweis, Sam T.
    Wong, Peter
    Emili, Andrew
    [J]. BIOINFORMATICS, 2007, 23 (02) : E198 - E204