Automatic extraction of reference gene from literature in plants based on texting mining

被引:3
|
作者
He Lin [1 ]
Shen Gengyu [2 ]
Li Fei [3 ]
Huang Shuiqing [1 ]
机构
[1] Nanjing Agr Univ, Dept Informat Management, Nanjing 210095, Jiangsu, Peoples R China
[2] Nanjing Agr Univ, Lib, Nanjing 210095, Jiangsu, Peoples R China
[3] Nanjing Agr Univ, Dept Entomol, Nanjing 210095, Jiangsu, Peoples R China
关键词
biological knowledge discovery; machine learning; NLP; reference gene; text mining; real-time quantitative polymerase chain reaction; bioinformatics; POLYMERASE-CHAIN-REACTION; BIOMEDICAL LITERATURE; EXPRESSION; SELECTION; NORMALIZATION; SYSTEM; TOOL;
D O I
10.1504/IJDMB.2015.070063
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Real-Time Quantitative Polymerase Chain Reaction (qRT-PCR) is widely used in biological research. It is a key to the availability of qRT-PCR experiment to select a stable reference gene. However, selecting an appropriate reference gene usually requires strict biological experiment for verification with high cost in the process of selection. Scientific literatures have accumulated a lot of achievements on the selection of reference gene. Therefore, mining reference genes under specific experiment environments from literatures can provide quite reliable reference genes for similar qRT-PCR experiments with the advantages of reliability, economic and efficiency. An auxiliary reference gene discovery method from literature is proposed in this paper which integrated machine learning, natural language processing and text mining approaches. The validity tests showed that this new method has a better precision and recall on the extraction of reference genes and their environments.
引用
收藏
页码:400 / 416
页数:17
相关论文
共 50 条
  • [1] TarMiner: automatic extraction of miRNA targets from literature
    Tsoupidi, Rodothea-Myrsini
    Kanellos, Ilias
    Vergoulis, Thanasis
    Vlachos, Ioannis S.
    Hatzigeorgiou, Artemis G.
    Dalamagas, Theodore
    PROCEEDINGS OF THE 27TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2015,
  • [2] Text mining for precision medicine: automating disease-mutation relationship extraction from biomedical literature
    Singhal, Ayush
    Simmons, Michael
    Lu, Zhiyong
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2016, 23 (04) : 766 - 772
  • [3] eGIFT: Mining Gene Information from the Literature
    Tudor, Catalina O.
    Schmidt, Carl J.
    Vijay-Shanker, K.
    BMC BIOINFORMATICS, 2010, 11
  • [4] eGIFT: Mining Gene Information from the Literature
    Catalina O Tudor
    Carl J Schmidt
    K Vijay-Shanker
    BMC Bioinformatics, 11
  • [5] Automatic target validation based on neuroscientific literature mining for tractography
    Vasques, Xavier
    Richardet, Renaud
    Hill, Sean L.
    Slater, David
    Chappelier, Jean-Cedric
    Pralong, Etienne
    Bloch, Jocelyne
    Draganski, Bogdan
    Cif, Laura
    FRONTIERS IN NEUROANATOMY, 2015, 9
  • [6] Automatic consistency assurance for literature-based gene ontology annotation
    Chen, Jiyu
    Geard, Nicholas
    Zobel, Justin
    Verspoor, Karin
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [7] Deep Text Mining for Automatic Keyphrase Extraction from Text Documents
    Abulaish, Muhammad
    Jahiruddin
    Dey, Lipika
    JOURNAL OF INTELLIGENT SYSTEMS, 2011, 20 (04) : 327 - 351
  • [8] MeInfoText 2.0: gene methylation and cancer relation extraction from biomedical literature
    Fang, Yu-Ching
    Lai, Po-Ting
    Dai, Hong-Jie
    Hsu, Wen-Lian
    BMC BIOINFORMATICS, 2011, 12
  • [9] Sieve-based relation extraction of gene regulatory networks from biological literature
    Zitnik, Slavko
    Zitnik, Marinka
    Zupan, Blaz
    Bajec, Marko
    BMC BIOINFORMATICS, 2015, 16
  • [10] Mining gene-related information from biomedical literature
    Tudor, Catalina O.
    Vijay-Shanker, K.
    Schmidt, Carl J.
    BIBMW: 2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOP, 2009, : 335 - 335