Controlling the false discovery rate for feature selection in high-resolution NMR spectra

被引:18
|
作者
Kim, Seoung Bum [1 ]
Chen, Victoria C. P. [1 ]
Park, Youngja [2 ]
Ziegler, Thomas R. [2 ]
Jones, Dean P. [2 ]
机构
[1] Department of Industrial and Manufacturing Systems Engineering, University of Texas at Arlington, Arlington, TX
[2] Clinical Biomarker Laboratory, Center for Clinical and Molecular Nutrition, Department of Medicine, Emory University, Atlanta, GA
来源
Statistical Analysis and Data Mining | 2008年 / 1卷 / 02期
关键词
False discovery rate; Feature selection; Metabolomics; Nuclear magnetic resonance; Orthogonal signal correction;
D O I
10.1002/sam.10005
中图分类号
学科分类号
摘要
Successful implementation of feature selection in nuclear magnetic resonance (NMR) spectra not only improves classification ability, but also simplifies the entire modeling process and, thus, reduces computational and analytical efforts. Principal component analysis (PCA) and partial least squares (PLS) have been widely used for feature selection in NMR spectra. However, extracting meaningful metabolite features from the reduced dimensions obtained through PCA or PLS is complicated because these reduced dimensions are linear combinations of a large number of the original features. In this paper, we propose a multiple testing procedure controlling false discovery rate (FDR) as an efficient method for feature selection in NMR spectra. The procedure clearly compensates for the limitation of PCA and PLS and identifies individual metabolite features necessary for classification. In addition, we present orthogonal signal correction to improve classification and visualization by removing unnecessary variations in NMR spectra. Our experimental results with real NMR spectra showed that classification models constructed with the features selected by our proposed procedure yielded smaller misclassification rates than those with all features. © 2008 Wiley Periodicals, Inc.
引用
收藏
页码:57 / 66
页数:9
相关论文
共 50 条
  • [21] High resolution NMR spectra of fluorotrimethylsilane
    Wilson, W. W.
    Haiges, R.
    Christe, K. O.
    JOURNAL OF FLUORINE CHEMISTRY, 2023, 270
  • [22] In-Phase Ultra High-Resolution In Vivo NMR
    Fugariu, Ioana
    Bermel, Wolfgang
    Lane, Daniel
    Soong, Ronald
    Simpson, Andre J.
    ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2017, 56 (22) : 6324 - 6328
  • [23] Controlling Bayes directional false discovery rate in random effects model
    Sarkar, Sanat K.
    Zhou, Tianhui
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2008, 138 (03) : 682 - 693
  • [24] An exponentially weighted moving average chart controlling false discovery rate
    Lee, Sang-Ho
    Park, Jang-Ho
    Jun, Chi-Hyuck
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2014, 84 (08) : 1830 - 1840
  • [25] Classifying genes according to predefined patterns by controlling false discovery rate
    Park, Hae-Sang
    Jun, Chi-Hyuck
    Yoo, Joo-Yeon
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (09) : 11753 - 11759
  • [26] High-resolution NMR in the native state
    Baldus, Marc
    IUCRJ, 2017, 4 : 102 - 103
  • [27] Controlling the false discovery rate and increasing statistical power in ecological studies
    Waite, Thomas A.
    Campbell, Lesley G.
    ECOSCIENCE, 2006, 13 (04): : 439 - 442
  • [28] Use of high-resolution NMR spectra transformed by paramagnetic complexes for studying molecular structure
    Voronov, Vladimir K.
    IZVESTIYA VUZOV-PRIKLADNAYA KHIMIYA I BIOTEKHNOLOGIYA, 2019, 9 (02): : 183 - 193
  • [29] A new method for high-resolution NMR spectra in inhomogeneous fields with efficient solvent suppression
    Lin Mei-Jin
    Chen Xi
    Chen Zhi-Wei
    Chen Zhong
    CHINESE JOURNAL OF CHEMISTRY, 2007, 25 (06) : 751 - 755
  • [30] Toward high-resolution NMR spectroscopy of microscopic liquid samples
    Butler, Mark C.
    Mehta, Hardeep S.
    Chen, Ying
    Reardon, Patrick N.
    Renslow, Ryan S.
    Khbeis, Michael
    Irish, Duane
    Mueller, Karl T.
    PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2017, 19 (22) : 14256 - 14261