Desiderata for ontologies to be used in semantic annotation of biomedical documents

被引:8
|
作者
Bada, Michael [1 ]
Hunter, Lawrence [1 ]
机构
[1] Univ Colorado Denver, Dept Pharmacol, Aurora, CO 80045 USA
关键词
Ontologies; Annotation; Desiderata; Corpus; NLP; Terminologies; OBO; Markup; KNOWLEDGE; UNIFICATION; EVOLUTION; DATABASE; CORPUS; TOOLS;
D O I
10.1016/j.jbi.2010.10.002
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A wealth of knowledge valuable to the translational research scientist is contained within the vast biomedical literature, but this knowledge is typically in the form of natural language. Sophisticated natural-language-processing systems are needed to translate text into unambiguous formal representations grounded in high-quality consensus ontologies, and these systems in turn rely on gold-standard corpora of annotated documents for training and testing. To this end, we are constructing the Colorado Richly Annotated Full-Text (CRAFT) Corpus, a collection of 97 full-text biomedical journal articles that are being manually annotated with the entire sets of terms from select vocabularies, predominantly from the Open Biomedical Ontologies (OBO) library. Our efforts in building this corpus has illuminated infelicities of these ontologies with respect to the semantic annotation of biomedical documents, and we propose desiderata whose implementation could substantially improve their utility in this task; these include the integration of overlapping terms across OBOs, the resolution of OBO-specific ambiguities, the integration of the BFO with the OBOs and the use of mid-level ontologies, the inclusion of noncanonical instances, and the expansion of relations and realizable entities. (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:94 / 101
页数:8
相关论文
共 50 条
  • [1] Semantic Similarity in Biomedical Ontologies
    Pesquita, Catia
    Faria, Daniel
    Falcao, Andre O.
    Lord, Phillip
    Couto, Francisco M.
    PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (07)
  • [2] Semantic annotation: Mapping text to ontologies
    Laboratoire d'Informatique de Paris-Nord, CNRS, Universiteá Paris 13, 99, Avenue J-B. Cleáment, F-93430 Villetaneuse, France
    Int. J. Metadata Semant. Ontol., 2007, 2 (67-78):
  • [3] Semantic Annotation in Historical Documents
    Pereira, Juliana Wolf
    Barros Goncalves, Marcelo Rocha
    Prado Santos, Marilde Terezinha
    2017 12TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2017,
  • [4] An annotation tool for semantic documents
    Eriksson, Henrik
    SEMANTIC WEB: RESEARCH AND APPLICATIONS, PROCEEDINGS, 2007, 4519 : 759 - 768
  • [5] Automatic Annotation of Bioinformatics Workflows with Biomedical Ontologies
    Garcia-Jimenez, Beatriz
    Wilkinson, Mark D.
    LEVERAGING APPLICATIONS OF FORMAL METHODS, VERIFICATION AND VALIDATION: SPECIALIZED TECHNIQUES AND APPLICATIONS, PT II, 2014, 8803 : 464 - 478
  • [6] PROPOSAL FOR A SEMANTIC ANNOTATION OF GRAPHICAL DOCUMENTS
    Truck, I.
    Archambault, D.
    Leger, L.
    Fenouillet, F.
    Muratet, M.
    Moreau, C.
    Gabriel, G.
    Tromeur, A.
    DECISION MAKING AND SOFT COMPUTING, 2014, 9 : 478 - 483
  • [7] Semantic Annotation and Classification of Mammography Images using Ontologies
    Pereira, Juliana Wolf
    Ribeiro, Marcela Xavier
    2021 IEEE 34TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2021, : 378 - 383
  • [8] Semantic Annotation of Documents Based on Wikipedia Concepts
    Brank, Janez
    Leban, Gregor
    Grobelnik, Marko
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2018, 42 (01): : 23 - 32
  • [9] Collaborative semantic annotation of digitized old documents
    El Ouaazizi, Mohamed
    Chenfour, Noureddine
    2012 SECOND INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH), 2012, : 253 - 258
  • [10] Semantic Annotation of Medical Documents in CDA Context
    Monti, Diego
    Morisio, Maurizio
    INFORMATION TECHNOLOGY IN BIO- AND MEDICAL INFORMATICS, 2016, 9832 : 163 - 172