Data flow modeling, data mining and QSAR in high-throughput discovery of functional nanomaterials

被引:25
|
作者
Yang, Yang [1 ]
Lin, Tian [1 ]
Weng, Xiao L. [2 ]
Darr, Jawwad A. [2 ]
Wang, Xue Z. [1 ]
机构
[1] Univ Leeds, Inst Particle Sci & Engn, Sch Proc Environm & Mat Engn, Leeds LS2 9JT, W Yorkshire, England
[2] UCL, Dept Chem, London WC1H 0AJ, England
基金
英国工程与自然科学研究理事会;
关键词
Data mining; QSAR; Design of experiments; Genetic algorithm; Nanoparticle; High-throughput; PROCESS OPERATIONAL DATA; CONTINUOUS HYDROTHERMAL SYNTHESIS; CEO2-ZRO2 MIXED OXIDES; SOLID-SOLUTIONS; DECISION TREES; ECOTOXICITY DATA; CERIA; NANOPARTICLES; CATALYSTS; COMBINATORIAL;
D O I
10.1016/j.compchemeng.2010.04.018
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Metal oxide nanoparticles are promising materials in applications for fuel cells, gas sensors and fine chemical catalysis. Their functionality depends excessively on composition, structure as well as synthesis and processing conditions. Continuous hydrothermal flow synthesis (CHFS) reactors are an effective technology to make nanoceramics. In order to increase sample throughput of CHFS, a manual high-throughput continuous hydrothermal (HiTCH) flow synthesis process capable of formulating scores of samples per day was developed. More recently, a fully automated nanoceramics synthesis platform called RAMSI (rapid automated synthesis instrument) based on the HiTCH synthesis technology was developed. When large numbers of nanoceramics are made and formulated into appropriate libraries, automated analytical instruments can be used to allow collection of a large amount of useful data. This paper describes the information flow management system of RAMSI (as well as CHFS) and the data mining system for supporting discovery, QSAR (quantitative structure-activity relationship) modeling and DoE (design of experiments). Case studies demonstrating the use of the high-throughput data mining system are presented. These include clustering of Raman spectra, interpretation of X-ray diffraction (XRD) measurements, and QSAR model building linking XRD data and photocatalytic properties. A genetic algorithm method for DoE is also presented that can guide the experiments to search optimal XRD patterns. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:671 / 678
页数:8
相关论文
共 50 条
  • [31] Inductive Data Mining: Automatic Generation of Decision Trees from Data for QSAR Modelling and Process Historical Data Analysis
    Ma, Chao Y.
    Buontempo, Frances V.
    Wang, Xue Z.
    18TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING, 2008, 25 : 581 - 586
  • [32] High-throughput multi-multicast transfers in data center networks
    Raúl H. Palacios
    Antonio F. Díaz
    Mancia Anguita
    Julio Ortega
    Cristina Rodríguez-Quintana
    The Journal of Supercomputing, 2017, 73 : 152 - 163
  • [33] High-throughput multi-multicast transfers in data center networks
    Palacios, Raul H.
    Diaz, Antonio F.
    Anguita, Mancia
    Ortega, Julio
    Rodriguez-Quintana, Cristina
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (01) : 152 - 163
  • [34] msBiodat analysis tool, big data analysis for high-throughput experiments
    Munoz-Torres, Pau M.
    Rokc, Filip
    Beluzic, Robert
    Grbesa, Ivana
    Vugrek, Oliver
    BIODATA MINING, 2016, 9
  • [35] Recent Progress in High-Throughput Enzymatic DNA Synthesis for Data Storage
    Baek, David
    Joe, Sung-Yune
    Shin, Haewon
    Park, Chaewon
    Jo, Seokwoo
    Chun, Honggu
    BIOCHIP JOURNAL, 2024, 18 (3) : 357 - 372
  • [36] Hierarchical lightweight high-throughput blockchain for industrial Internet data security
    Xu X.
    Jin Y.
    Zeng Z.
    Yang S.
    Chen R.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2019, 25 (12): : 3258 - 3266
  • [37] Kinetic data acquisition in high-throughput Fischer-Tropsch experimentation
    Hazemann, Paul
    Decottignies, Dominique
    Maury, Sylvie
    Humbert, Severine
    Berliet, Adrien
    Daniel, Cecile
    Schuurman, Yves
    CATALYSIS SCIENCE & TECHNOLOGY, 2020, 10 (21) : 7331 - 7343
  • [38] msBiodat analysis tool, big data analysis for high-throughput experiments
    Pau M. Muñoz-Torres
    Filip Rokć
    Robert Belužic
    Ivana Grbeša
    Oliver Vugrek
    BioData Mining, 9
  • [39] Data mining to support simulation modeling of patient flow in hospitals
    Isken, MW
    Rajagopalan, B
    JOURNAL OF MEDICAL SYSTEMS, 2002, 26 (02) : 179 - 197
  • [40] Data Mining to Support Simulation Modeling of Patient Flow in Hospitals
    Mark W. Isken
    Balaji Rajagopalan
    Journal of Medical Systems, 2002, 26 : 179 - 197