Graphing Ecotoxicology: The MAGIC Graph for Linking Environmental Data on Chemicals

被引:5
作者
Bub, Sascha [1 ]
Wolfram, Jakob [1 ]
Stehle, Sebastian [1 ,2 ]
Petschick, Lara L. [1 ]
Schulz, Ralf [1 ]
机构
[1] Univ Koblenz Landau, Inst Environm Sci, D-76829 Landau, Germany
[2] Univ Koblenz Landau, Eusserthal Ecosyst Res Stn, D-76857 Eusserthal, Germany
关键词
ecotoxicology; graph database; environmental data; data harmonization; chemical use types; organic contaminants; synonyms; nomenclature; specificity;
D O I
10.3390/data4010034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Assessing the impact of chemicals on the environment and addressing subsequent issues are two central challenges to their safe use. Environmental data are continuously expanding, requiring flexible, scalable, and extendable data management solutions that can harmonize multiple data sources with potentially differing nomenclatures or levels of specificity. Here, we present the methodological steps taken to construct a rule-based labeled property graph database, the "Meta-analysis of the Global Impact of Chemicals" (MAGIC) graph, for potential environmental impact chemicals (PEIC) and its subsequent application harmonizing multiple large-scale databases. The resulting data encompass 16,739 unique PEICs attributed to their corresponding chemical class, stereo-chemical information, valid synonyms, use types, unique identifiers (e.g., Chemical Abstract Service registry number CAS RN), and others. These data provide researchers with additional chemical information for a large amount of PEICs and can also be publicly accessed using a web interface. Our analysis has shown that data harmonization can increase up to 98% when using the MAGIC graph approach compared to relational data systems for datasets with different nomenclatures. The graph database system and its data appear more suitable for large-scale analysis where traditional (i.e., relational) data systems are reaching conceptional limitations.
引用
收藏
页数:17
相关论文
共 24 条
[1]  
[Anonymous], 2013, P JOINT EDBT ICDT 20, DOI DOI 10.1145/2457317.2457351
[2]  
[Anonymous], 2012, Int J Comput Appl
[3]  
[Anonymous], 1978, J Chem Inf Comput Sci, P58, DOI [10.1021/ci60013a609, DOI 10.1021/CI60013A609]
[4]  
Baker N.T., 2016, US GEOLOGICAL SURVEY
[5]  
Batra S., 2012, International Journal of Soft Computing and Engineering (IJSCE), V2, P509
[6]  
Beck K., 2003, TEST DRIVEN DEV EXAM
[7]  
Constantinov C., 2015, ANN U CRAIOVA, V12, P38
[8]  
European Food Safety Authority, OP
[9]   InChI - the worldwide chemical structure identifier standard [J].
Heller, Stephen ;
McNaught, Alan ;
Stein, Stephen ;
Tchekhovskoi, Dmitrii ;
Pletnev, Igor .
JOURNAL OF CHEMINFORMATICS, 2013, 5
[10]  
Lakshman Avinash, 2010, Operating Systems Review, V44, P35, DOI 10.1145/1773912.1773922