Scientific Literature Information Extraction Using Text Mining Techniques for Human Health Risk Assessment of Electromagnetic Fields

被引:2
作者
Lee, Sang-Woo [1 ]
Kwon, Jung-Hyok [2 ]
Lee, Ben [3 ]
Kim, Eui-Jik [1 ]
机构
[1] Hallym Univ, Sch Software, 1 Hallymdaehak Gil, Chunchon 24252, Gangwon Do, South Korea
[2] Hallym Univ, Smart Comp Lab, 1 Hallymdaehak Gil, Chunchon 24252, Gangwon Do, South Korea
[3] Oregon State Univ, Sch Elect Engn & Comp Sci, Corvallis, OR 97331 USA
关键词
EMF exposure; information extraction; text mining; scientific literature;
D O I
10.18494/SAM.2020.2572
中图分类号
TH7 [仪器、仪表];
学科分类号
0804 ; 080401 ; 081102 ;
摘要
This paper presents a scientific literature information extraction architecture using text mining techniques to assess the human health risk of electromagnetic fields (EMFs) generated by wireless sensor devices in Internet of Things (IoT). The proposed architecture uses three text mining techniques to extract three types of information-purpose statement, research category, and source of EMF exposure-from the scientific literature to help researchers assess the human health risk of EMFs. For the purpose statement, a representative sentence expressing the authors' intentions and purposes was extracted from the abstract text of the articles through processes of candidate sentence selection, topic lexicon creation, and weighting. For the research category, the articles were classified into three study types-epidemiological, animal experimental, and cell experimental-using a weighting process based on the predefined feature lexicon of each category. Finally, all words representing frequency bands included in the abstract text of the articles were extracted to identify the source of EMF exposure. The aforementioned text mining techniques were used to extract the information from 100 scientific articles and the performance of this architecture was proved through expert verification. The experimental results show that the proposed architecture can extract the desired information to assess the human health risk of EMFs from the scientific literature with high accuracy.
引用
收藏
页码:149 / 157
页数:9
相关论文
共 8 条
[1]  
Denventer E. V., 2011, BIOELECTROMAGNETICS, V32, P417, DOI [10.1002/bem.20660, DOI 10.1002/BEM.20660]
[2]   Review of Studies Concerning Electromagnetic Field (EMF) Exposure Assessment in Europe: Low Frequency Fields (50 Hz-100 kHz) [J].
Gajsek, Peter ;
Ravazzani, Paolo ;
Grellier, James ;
Samaras, Theodoros ;
Bakos, Jozsef ;
Thuroczy, Gyorgy .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2016, 13 (09)
[3]   WormBase 2017: molting into a new stage [J].
Lee, Raymond Y. N. ;
Howe, Kevin L. ;
Harris, Todd W. ;
Arnaboldi, Valerio ;
Cain, Scott ;
Chan, Juancarlos ;
Chen, Wen J. ;
Davis, Paul ;
Gao, Sibyl ;
Grove, Christian ;
Kishore, Ranjana ;
Muller, Hans-Michael ;
Nakamura, Cecilia ;
Nuin, Paulo ;
Paulini, Michael ;
Raciti, Daniela ;
Rodgers, Faye ;
Russell, Matt ;
Schindelman, Gary ;
Tuli, Mary Ann ;
Van Auken, Kimberly ;
Wang, Qinghua ;
Williams, Gary ;
Wright, Adam ;
Yook, Karen ;
Berriman, Matthew ;
Kersey, Paul ;
Schedl, Tim ;
Stein, Lincoln ;
Sternberg, Paul W. .
NUCLEIC ACIDS RESEARCH, 2018, 46 (D1) :D869-D874
[4]  
Loper E., 2002, arXiv
[5]   Textpresso Central: a customizable platform for searching, text mining, viewing, and curating biomedical literature [J].
Muller, H. -M. ;
Van Auken, K. M. ;
Li, Y. ;
Sternberg, P. W. .
BMC BIOINFORMATICS, 2018, 19
[6]   Radiofrequency electromagnetic field exposure in everyday microenvironments in Europe: A systematic literature review [J].
Sagar, Sanjay ;
Dongus, Stefan ;
Schoeni, Anna ;
Roser, Katharina ;
Eeftens, Marloes ;
Struchen, Benjamin ;
Foerster, Milena ;
Meier, Noemi ;
Adem, Seid ;
Roosli, Martin .
JOURNAL OF EXPOSURE SCIENCE AND ENVIRONMENTAL EPIDEMIOLOGY, 2018, 28 (02) :147-160
[7]   BioReader: a text mining tool for performing classification of biomedical literature [J].
Simon, Christian ;
Davidsen, Kristian ;
Hansen, Christina ;
Seymour, Emily ;
Barnkob, Mike Bogetofte ;
Olsen, Lars Ronn .
BMC BIOINFORMATICS, 2019, 19 (Suppl 13)
[8]   PubTator: a web-based text mining tool for assisting biocuration [J].
Wei, Chih-Hsuan ;
Kao, Hung-Yu ;
Lu, Zhiyong .
NUCLEIC ACIDS RESEARCH, 2013, 41 (W1) :W518-W522