Natural Language Processing-Driven Microscopy Ontology Development

被引:2
作者
Bayerlein, Bernd [1 ]
Schilling, Markus [1 ]
Curran, Maurice [2 ]
Campbell, Carelyn E. [3 ]
Dima, Alden A. [3 ]
Birkholz, Henk [4 ]
Lau, June W. [3 ]
机构
[1] Bundesanstalt Materialforschung und prufung BAM, Met High Temp Mat, D-12205 Berlin, Germany
[2] Univ Virginia, Dept Chem Engn, Charlottesville, VA USA
[3] Natl Inst Stand & Technol NIST, Gaithersburg, MD USA
[4] Leibniz Inst Mat Engn, IWT, Bremen, Germany
关键词
Microscopy ontology; Natural language processing; Semantic interoperability; Data integration; Ontology development acceleration; Data discovery enhancement; KNOWLEDGE;
D O I
10.1007/s40192-024-00378-y
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This manuscript describes the accelerated development of an ontology for microscopy in materials science and engineering, leveraging natural language processing (NLP) techniques. Drawing from a comprehensive corpus comprising over 14 k contributions to the Microscopy and Microanalysis conference series, we employed two neural network-based algorithms for NLP. The goal was to semiautomatically create the Microscopy Ontology (MO) that encapsulates and interconnects the terminology most frequently used by the community. The MO, characterized by its interlinked entities and relationships, is designed to enhance the quality of user query results within NexusLIMS. This enhancement is facilitated through the concurrent querying of related terms and the seamless integration of logical connections.
引用
收藏
页码:915 / 926
页数:12
相关论文
共 61 条
[51]   NexusLIMS: A Laboratory Information Management System for Shared-Use Electron Microscopy Facilities [J].
Taillon, Joshua A. ;
Bina, Thomas F. ;
Plante, Raymond L. ;
Newrock, Marcus W. ;
Greene, Gretchen R. ;
Lau, June W. .
MICROSCOPY AND MICROANALYSIS, 2021, 27 (03) :511-527
[52]   Visualizing Scientists' Cognitive Representation of Materials Data through the Application of Ontology [J].
Takahashi, Lauren ;
Takahashi, Keisuke .
JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2019, 10 (23) :7482-7491
[53]  
TIB Terminology Service, US
[54]   Unsupervised word embeddings capture latent knowledge from materials science literature [J].
Tshitoyan, Vahe ;
Dagdelen, John ;
Weston, Leigh ;
Dunn, Alexander ;
Rong, Ziqin ;
Kononova, Olga ;
Persson, Kristin A. ;
Ceder, Gerbrand ;
Jain, Anubhav .
NATURE, 2019, 571 (7763) :95-+
[55]   The Intersection Between Semantic Web and Materials Science [J].
Valdestilhas, Andre ;
Bayerlein, Bernd ;
Moreno Torres, Benjamin ;
Zia, Ghezal Ahmad Jan ;
Muth, Thilo .
ADVANCED INTELLIGENT SYSTEMS, 2023, 5 (08)
[56]  
Van Rossum G., 1995, Python reference manual
[57]  
w3.org, TERSE RDF TRIPLE LAN
[58]  
w3.org, W3C SKOS SIMPLE KNOW
[59]   Named Entity Recognition and Normalization Applied to Large-Scale Information Extraction from the Materials Science Literature [J].
Weston, L. ;
Tshitoyan, V ;
Dagdelen, J. ;
Kononova, O. ;
Trewartha, A. ;
Persson, K. A. ;
Ceder, G. ;
Jain, A. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (09) :3692-3702
[60]   Comment: The FAIR Guiding Principles for scientific data management and stewardship [J].
Wilkinson, Mark D. ;
Dumontier, Michel ;
Aalbersberg, IJsbrand Jan ;
Appleton, Gabrielle ;
Axton, Myles ;
Baak, Arie ;
Blomberg, Niklas ;
Boiten, Jan-Willem ;
Santos, Luiz Bonino da Silva ;
Bourne, Philip E. ;
Bouwman, Jildau ;
Brookes, Anthony J. ;
Clark, Tim ;
Crosas, Merce ;
Dillo, Ingrid ;
Dumon, Olivier ;
Edmunds, Scott ;
Evelo, Chris T. ;
Finkers, Richard ;
Gonzalez-Beltran, Alejandra ;
Gray, Alasdair J. G. ;
Groth, Paul ;
Goble, Carole ;
Grethe, Jeffrey S. ;
Heringa, Jaap ;
't Hoen, Peter A. C. ;
Hooft, Rob ;
Kuhn, Tobias ;
Kok, Ruben ;
Kok, Joost ;
Lusher, Scott J. ;
Martone, Maryann E. ;
Mons, Albert ;
Packer, Abel L. ;
Persson, Bengt ;
Rocca-Serra, Philippe ;
Roos, Marco ;
van Schaik, Rene ;
Sansone, Susanna-Assunta ;
Schultes, Erik ;
Sengstag, Thierry ;
Slater, Ted ;
Strawn, George ;
Swertz, Morris A. ;
Thompson, Mark ;
van der Lei, Johan ;
van Mulligen, Erik ;
Velterop, Jan ;
Waagmeester, Andra ;
Wittenburg, Peter .
SCIENTIFIC DATA, 2016, 3