A novel storage method for near infrared spectroscopy chemometric models

被引:1
作者
Zhang, Zhi-Min [1 ]
Chen, Shan [1 ]
Liang, Yi-Zeng [1 ]
机构
[1] Cent South Univ, Res Ctr Modernizat Chinese Med, Coll Chem & Chem Engn, Changsha 410083, Hunan, Peoples R China
关键词
Chemometrics; Modeling; Extensible Markup Language; Model storage; Near infrared spectroscopy; Raman; WORLD-WIDE-WEB; CHEMICAL MARKUP; XML VOCABULARY; REPRESENTATION; PREDICTION; ROBUST;
D O I
10.1016/j.aca.2010.04.032
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Chemometric Modeling Markup Language (CMML) is developed by us for containing chemometrics models within one document through converting binary data into strings by base64 encode/decode algorithms to solve the interoperability issue in sharing chemometrics models. It provides a base functionality for storage of sampling, variable selection, pretreating, outlier and modeling parameters and data. With the help of base64 algorithm, the usability of CMML is in equilibrium with size by transforming the binary data into base64 encoded string. Due to the advantages of Extensible Markup Language (XML), models stored in CMML can be easily reused in various other software and programming languages as long as the programming language has XML parsing library. One can also use the XML Path Language (XPath) query language to select desired data from the CMML file effectively. The application of this language in near infrared spectroscopy model storage is implemented as a class in C++ language and available as open source software (http://code.google.com/p/cmml), and the implementations in other languages, such as MATLAB and R are in progress. (c) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:149 / 154
页数:6
相关论文
共 33 条
[11]  
DYCK M, 2009, XML PATH LANGUAGE XP
[12]   Calibration model transfer for near-infrared spectra based on canonical correlation analysis [J].
Fan, Wei ;
Liang, Yizeng ;
Yuan, Dalin ;
Wang, Jiajun .
ANALYTICA CHIMICA ACTA, 2008, 623 (01) :22-29
[13]   Chemical markup, XML, and the World-Wide Web. 3. Toward a signed semantic chemical web of trust [J].
Gkoutos, GV ;
Murray-Rust, P ;
Rzepa, HS ;
Wright, M .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (05) :1124-1130
[14]   Blue Obelisk - Interoperability in chemical informatics [J].
Guha, Rajarshi ;
Howard, Michael T. ;
Hutchison, Geoffrey R. ;
Murray-Rust, Peter ;
Rzepa, Henry ;
Steinbeck, Christoph ;
Wegner, Jorg ;
Willighagen, Egon L. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (03) :991-998
[15]   Chemical markup, XML, and the world wide web. 6. CMLReact, an XML vocabulary for chemical reactions [J].
Holliday, GL ;
Murray-Rust, P ;
Rzepa, HS .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (01) :145-157
[16]   The systems biology markup language (SBML):: a medium for representation and exchange of biochemical network models [J].
Hucka, M ;
Finney, A ;
Sauro, HM ;
Bolouri, H ;
Doyle, JC ;
Kitano, H ;
Arkin, AP ;
Bornstein, BJ ;
Bray, D ;
Cornish-Bowden, A ;
Cuellar, AA ;
Dronov, S ;
Gilles, ED ;
Ginkel, M ;
Gor, V ;
Goryanin, II ;
Hedley, WJ ;
Hodgman, TC ;
Hofmeyr, JH ;
Hunter, PJ ;
Juty, NS ;
Kasberger, JL ;
Kremling, A ;
Kummer, U ;
Le Novère, N ;
Loew, LM ;
Lucio, D ;
Mendes, P ;
Minch, E ;
Mjolsness, ED ;
Nakayama, Y ;
Nelson, MR ;
Nielsen, PF ;
Sakurada, T ;
Schaff, JC ;
Shapiro, BE ;
Shimizu, TS ;
Spence, HD ;
Stelling, J ;
Takahashi, K ;
Tomita, M ;
Wagner, J ;
Wang, J .
BIOINFORMATICS, 2003, 19 (04) :524-531
[17]  
JOSEFSSON S, 2003, 3548 RFC
[18]   Chemical markup, XML, and the world wide web. 7. CMLSpect, an XML vocabulary for spectral data [J].
Kuhn, Stefan ;
Helmus, Tobias ;
Lancashire, Robert J. ;
Murray-Rust, Peter ;
Rzepa, Henry S. ;
Steinbeck, Christoph ;
Willighagen, Egon L. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (06) :2015-2034
[19]  
Malhotra A, 2004, XML SCHEMA 2
[20]   Chemical Markup, XML, and the World Wide Web. 5. Applications of chemical metadata in RSS aggregators [J].
Murray-Rust, P ;
Rzepa, HS ;
Williamson, MJ ;
Willighagen, EL .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (02) :462-469