Deep learning-based mineral exploration named entity recognition: A case study of granitic pegmatite-type lithium deposits

被引:0
|
作者
Tao, Jintao [1 ,2 ,3 ]
Zhang, Nannan [1 ,2 ,3 ]
Chang, Jinyu [1 ,2 ,3 ]
Chen, Li [1 ,2 ,3 ]
Zhang, Hao [1 ,2 ,3 ]
Liao, Shibin [1 ,2 ,3 ]
Li, Siyuan [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Xinjiang Inst Ecol & Geog, State Key Lab Desert & Oasis Ecol, Key Lab Ecol Secur & Sustainable Dev Arid Areas, Urumqi 830011, Peoples R China
[2] Xinjiang Key Lab Mineral Resources & Digital Geol, Urumqi 830011, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
Named entity recognition; Geological text; Mineral exploration information extraction; Lithium deposit; Deep learning; GEOSCIENCE; EXTRACTION;
D O I
10.1016/j.oregeorev.2024.106367
中图分类号
P5 [地质学];
学科分类号
0709 ; 081803 ;
摘要
Geological text data play a crucial role as sources of geological information and knowledge for mineral exploration. Mineral exploration involves predicting and detecting mineral resources using geological, geochemical, geophysical, and remote sensing data. However, existing named entity recognition studies on mineral deposits have mainly focused on geological environments and mineral deposit models, which are insufficient for capturing the extensive knowledge essential for mineral exploration and supporting subsequent exploration efforts. This paper presents an efficient workflow for automatically extracting mineral exploration information from unstructured geological text data using a deep learning method. Initially, 21 entity types were identified based on a conceptual prospecting model of granitic pegmatite-type lithium deposits. A mineral exploration corpus was constructed from Chinese geological literature and reports, comprising 3,386 sentences and 13,167 entities. Subsequently, a Mineral Exploration Named Entity Recognition (MENER) model is proposed to extract mineral exploration information. This model integrates entity-type enhanced characters, words, and contextual features to enhance the performance. Bidirectional encoder representations from the transformer model were employed to obtain character embeddings of the input text. Mineral exploration entity types provide external knowledge, aiding the understanding of entity semantics within sentences through multi-head attention. Convolutional neural networks and bidirectional long short-term memory models have been employed to extract word and contextual features and capture additional structural information. Geological entity nomenclature and expressions follow certain default conventions and paradigms. A boundary prediction classifier was introduced to identify the head and tail characteristics of geological entities. A conditional random field was then utilized to classify the entities. The MENER model achieved an average F1-score of 79.69% on the constructed dataset. Finally, a geological document was selected as a case study to demonstrate the effectiveness of the proposed model. The workflow outlined in this study enables the rapid and robust extraction of specific information and knowledge mining from geological text data, with potential applications across various domains.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] A geological knowledge-constrained entity and relation extraction method for text: A case study of granitic pegmatite-type lithium deposits
    Tao, Jintao
    Zhang, Nannan
    Chang, Jinyu
    Chen, Li
    Zhang, Hao
    Liao, Shibin
    Li, Siyuan
    Jing, Jianpeng
    COMPUTERS & GEOSCIENCES, 2025, 200
  • [2] Progress in geological study of pegmatite-type lithium deposits in the world
    Chen Y.
    Xue L.
    Wang X.
    Zhao Z.
    Han J.
    Zhou K.
    Dizhi Xuebao/Acta Geologica Sinica, 2021, 95 (10): : 2971 - 2995
  • [3] Named entity recognition based on deep learning
    Ji Z.
    Kong D.
    Liu W.
    Dong W.
    Sang Y.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2022, 28 (06): : 1603 - 1615
  • [4] Deep Learning-Based Named Entity Recognition and Knowledge Graph Construction for Geological Hazards
    Fan, Runyu
    Wang, Lizhe
    Yan, Jining
    Song, Weijing
    Zhu, Yingqian
    Chen, Xiaodao
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2020, 9 (01)
  • [5] Supporting Deep Learning-Based Named Entity Recognition Using Cloud Resource Management
    Hartmann, Benedict
    Tamla, Philippe
    Hemmje, Matthias
    HCI INTERNATIONAL 2023 LATE BREAKING PAPERS, HCII 2023, PT VI, 2023, 14059 : 84 - 100
  • [6] A Comparative Study of Deep Learning based Named Entity Recognition Algorithms for Cybersecurity
    Dasgupta, Soham
    Piplai, Aritran
    Kotal, Anantaa
    Joshi, Anupam
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 2596 - 2604
  • [7] Military Named Entity Recognition Method Based on Deep Learning
    Wang, Xuefeng
    Yang, Ruopeng
    Lu, Yiwei
    Wu, Qingfeng
    PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 479 - 483
  • [8] Application of named entity recognition on tweets during earthquake disaster: a deep learning-based approach
    Eliguzel, Nazmiye
    Cetinkaya, Cihan
    Dereli, Turkay
    SOFT COMPUTING, 2022, 26 (01) : 395 - 421
  • [9] Application of named entity recognition on tweets during earthquake disaster: a deep learning-based approach
    Nazmiye Eligüzel
    Cihan Çetinkaya
    Türkay Dereli
    Soft Computing, 2022, 26 : 395 - 421
  • [10] A Comparative Study of Dictionary-based and Machine Learning-based Named Entity Recognition in Pashto
    Momand, Rafiullah
    Waseeb, Shakirullah
    Rai, Ahmad Masood Latif
    2020 4TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2020, 2020, : 96 - 101