Deep learning-based mineral exploration named entity recognition: A case study of granitic pegmatite-type lithium deposits

被引:0
|
作者
Tao, Jintao [1 ,2 ,3 ]
Zhang, Nannan [1 ,2 ,3 ]
Chang, Jinyu [1 ,2 ,3 ]
Chen, Li [1 ,2 ,3 ]
Zhang, Hao [1 ,2 ,3 ]
Liao, Shibin [1 ,2 ,3 ]
Li, Siyuan [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Xinjiang Inst Ecol & Geog, State Key Lab Desert & Oasis Ecol, Key Lab Ecol Secur & Sustainable Dev Arid Areas, Urumqi 830011, Peoples R China
[2] Xinjiang Key Lab Mineral Resources & Digital Geol, Urumqi 830011, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
Named entity recognition; Geological text; Mineral exploration information extraction; Lithium deposit; Deep learning; GEOSCIENCE; EXTRACTION;
D O I
10.1016/j.oregeorev.2024.106367
中图分类号
P5 [地质学];
学科分类号
0709 ; 081803 ;
摘要
Geological text data play a crucial role as sources of geological information and knowledge for mineral exploration. Mineral exploration involves predicting and detecting mineral resources using geological, geochemical, geophysical, and remote sensing data. However, existing named entity recognition studies on mineral deposits have mainly focused on geological environments and mineral deposit models, which are insufficient for capturing the extensive knowledge essential for mineral exploration and supporting subsequent exploration efforts. This paper presents an efficient workflow for automatically extracting mineral exploration information from unstructured geological text data using a deep learning method. Initially, 21 entity types were identified based on a conceptual prospecting model of granitic pegmatite-type lithium deposits. A mineral exploration corpus was constructed from Chinese geological literature and reports, comprising 3,386 sentences and 13,167 entities. Subsequently, a Mineral Exploration Named Entity Recognition (MENER) model is proposed to extract mineral exploration information. This model integrates entity-type enhanced characters, words, and contextual features to enhance the performance. Bidirectional encoder representations from the transformer model were employed to obtain character embeddings of the input text. Mineral exploration entity types provide external knowledge, aiding the understanding of entity semantics within sentences through multi-head attention. Convolutional neural networks and bidirectional long short-term memory models have been employed to extract word and contextual features and capture additional structural information. Geological entity nomenclature and expressions follow certain default conventions and paradigms. A boundary prediction classifier was introduced to identify the head and tail characteristics of geological entities. A conditional random field was then utilized to classify the entities. The MENER model achieved an average F1-score of 79.69% on the constructed dataset. Finally, a geological document was selected as a case study to demonstrate the effectiveness of the proposed model. The workflow outlined in this study enables the rapid and robust extraction of specific information and knowledge mining from geological text data, with potential applications across various domains.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] A Deep Learning Based Approach for Biomedical Named Entity Recognition Using Multitasking Transfer Learning with BiLSTM, BERT and CRF
    Pooja H.
    Jagadeesh M.P.P.
    SN Computer Science, 5 (5)
  • [32] Research on named entity recognition in the field of CNC machine tool design based on deep learning Knowledge map of mechanical field
    Zhang, Shuai
    Guan, Yanzhi
    Gu, Zhongyu
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MODELING, NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING, CMNM 2024, 2024, : 257 - 262
  • [33] Deep learning-based weathering type recognition in historical stone monuments
    Hatir, Mehmet Ergun
    Barstugan, Mucahit
    Ince, Ismail
    JOURNAL OF CULTURAL HERITAGE, 2020, 45 : 193 - 203
  • [34] An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records
    Li, Luqi
    Zhao, Jie
    Hou, Li
    Zhai, Yunkai
    Shi, Jinming
    Cui, Fangfang
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (01)
  • [35] An attention-based deep learning model for clinical named entity recognition of Chinese electronic medical records
    Luqi Li
    Jie Zhao
    Li Hou
    Yunkai Zhai
    Jinming Shi
    Fangfang Cui
    BMC Medical Informatics and Decision Making, 19
  • [36] Deep Learning-Based Video Coding: A Review and a Case Study
    Liu, Dong
    Li, Yue
    Lin, Jianping
    Li, Houqiang
    Wu, Feng
    ACM COMPUTING SURVEYS, 2020, 53 (01)
  • [37] A Critical Study of Recent Deep Learning-Based Continuous Sign Language Recognition
    Hanan A. Taher
    Subhi R. M. Zeebaree
    The Review of Socionetwork Strategies, 2025, 19 (1) : 131 - 161
  • [38] An Effective Biomedical Named Entity Recognition by Handling Imbalanced Data Sets Using Deep Learning and Rule-Based Methods
    Archana S.M.
    Prakash J.
    Singh P.K.
    Ahmed W.
    SN Computer Science, 4 (5)
  • [39] A case study in applying artificial intelligence-based named entity recognition to develop an automated ophthalmic disease registry
    Carmelo Z Macri
    Sheng Chieh Teoh
    Stephen Bacchi
    Ian Tan
    Robert Casson
    Michelle T Sun
    Dinesh Selva
    WengOnn Chan
    Graefe's Archive for Clinical and Experimental Ophthalmology, 2023, 261 : 3335 - 3344
  • [40] A case study in applying artificial intelligence-based named entity recognition to develop an automated ophthalmic disease registry
    Macri, Carmelo Z.
    Teoh, Sheng Chieh
    Bacchi, Stephen
    Tan, Ian
    Casson, Robert
    Sun, Michelle T.
    Selva, Dinesh
    Chan, WengOnn
    GRAEFES ARCHIVE FOR CLINICAL AND EXPERIMENTAL OPHTHALMOLOGY, 2023, 261 (11) : 3335 - 3344