Deep learning-based mineral exploration named entity recognition: A case study of granitic pegmatite-type lithium deposits

被引:0
|
作者
Tao, Jintao [1 ,2 ,3 ]
Zhang, Nannan [1 ,2 ,3 ]
Chang, Jinyu [1 ,2 ,3 ]
Chen, Li [1 ,2 ,3 ]
Zhang, Hao [1 ,2 ,3 ]
Liao, Shibin [1 ,2 ,3 ]
Li, Siyuan [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Xinjiang Inst Ecol & Geog, State Key Lab Desert & Oasis Ecol, Key Lab Ecol Secur & Sustainable Dev Arid Areas, Urumqi 830011, Peoples R China
[2] Xinjiang Key Lab Mineral Resources & Digital Geol, Urumqi 830011, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
Named entity recognition; Geological text; Mineral exploration information extraction; Lithium deposit; Deep learning; GEOSCIENCE; EXTRACTION;
D O I
10.1016/j.oregeorev.2024.106367
中图分类号
P5 [地质学];
学科分类号
0709 ; 081803 ;
摘要
Geological text data play a crucial role as sources of geological information and knowledge for mineral exploration. Mineral exploration involves predicting and detecting mineral resources using geological, geochemical, geophysical, and remote sensing data. However, existing named entity recognition studies on mineral deposits have mainly focused on geological environments and mineral deposit models, which are insufficient for capturing the extensive knowledge essential for mineral exploration and supporting subsequent exploration efforts. This paper presents an efficient workflow for automatically extracting mineral exploration information from unstructured geological text data using a deep learning method. Initially, 21 entity types were identified based on a conceptual prospecting model of granitic pegmatite-type lithium deposits. A mineral exploration corpus was constructed from Chinese geological literature and reports, comprising 3,386 sentences and 13,167 entities. Subsequently, a Mineral Exploration Named Entity Recognition (MENER) model is proposed to extract mineral exploration information. This model integrates entity-type enhanced characters, words, and contextual features to enhance the performance. Bidirectional encoder representations from the transformer model were employed to obtain character embeddings of the input text. Mineral exploration entity types provide external knowledge, aiding the understanding of entity semantics within sentences through multi-head attention. Convolutional neural networks and bidirectional long short-term memory models have been employed to extract word and contextual features and capture additional structural information. Geological entity nomenclature and expressions follow certain default conventions and paradigms. A boundary prediction classifier was introduced to identify the head and tail characteristics of geological entities. A conditional random field was then utilized to classify the entities. The MENER model achieved an average F1-score of 79.69% on the constructed dataset. Finally, a geological document was selected as a case study to demonstrate the effectiveness of the proposed model. The workflow outlined in this study enables the rapid and robust extraction of specific information and knowledge mining from geological text data, with potential applications across various domains.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Language model based on deep learning network for biomedical named entity recognition
    Hou, Guan
    Jian, Yuhao
    Zhao, Qingqing
    Quan, Xiongwen
    Zhang, Han
    METHODS, 2024, 226 : 71 - 77
  • [22] A Method of Network Attack Named Entity Recognition based on Deep Active Learning
    Wang, Li
    Ma, Yunxiao
    Li, Mingyue
    Li, Hua
    Zhang, Peilong
    2024 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2024, : 376 - 387
  • [23] ML-CNN: a novel deep learning based disease named entity recognition architecture
    Zhao, Zhehuan
    Yang, Zhihao
    Luo, Ling
    Zhang, Yin
    Wang, Lei
    Lin, Hongfei
    Wang, Jian
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 794 - 794
  • [24] An imConvNet-based deep learning model for Chinese medical named entity recognition
    Zheng, Yuchen
    Han, Zhenggong
    Cai, Yimin
    Duan, Xubo
    Sun, Jiangling
    Yang, Wei
    Huang, Haisong
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
  • [25] An imConvNet-based deep learning model for Chinese medical named entity recognition
    Yuchen Zheng
    Zhenggong Han
    Yimin Cai
    Xubo Duan
    Jiangling Sun
    Wei Yang
    Haisong Huang
    BMC Medical Informatics and Decision Making, 22
  • [26] Deep learning-based modulation recognition with constellation diagram: A case study
    Leblebici, Merih
    Calhan, Ali
    Cicioglu, Murtaza
    PHYSICAL COMMUNICATION, 2024, 63
  • [27] Electronic Medical Record Recommendation System Based on Deep Embedding Learning with Named Entity Recognition
    Zheng, Yuqian
    Yan, Xu
    Cao, Xin
    Ai, Chunhui
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 298 - 309
  • [28] A Multiclass Classification Method Based on Deep Learning for Named Entity Recognition in Electronic Medical Records
    Dong, Xishuang
    Qian, Lijun
    Guan, Yi
    Huang, Lei
    Yu, Qiubin
    Yang, Jinfeng
    2016 NEW YORK SCIENTIFIC DATA SUMMIT (NYSDS), 2016,
  • [29] Deep Learning-Based Text Entity Recognition Method for Distribution Network Operation and Maintenance
    Gao, Yongmin
    Kang, Bing
    Zhao, Tiancheng
    Xiao, Hui
    Li, Jiashuai
    Xu, Zhihao
    Ding, Guili
    Wang, Zongyao
    2022 9TH INTERNATIONAL FORUM ON ELECTRICAL ENGINEERING AND AUTOMATION, IFEEA, 2022, : 1096 - 1100
  • [30] Lexicon and attention-based named entity recognition for kiwifruit diseases and pests: A Deep learning approach
    Zhang, Lilin
    Nie, Xiaolin
    Zhang, Mingmei
    Gu, Mingyang
    Geissen, Violette
    Ritsema, Coen J.
    Niu, Dangdang
    Zhang, Hongming
    FRONTIERS IN PLANT SCIENCE, 2022, 13