Unsupervised and self-supervised deep learning approaches for biomedical text mining

被引:42
|
作者
Nadif, Mohamed [1 ]
Role, Francois [1 ]
机构
[1] Univ Paris, CNRS, Ctr Borelli, F-75006 Paris, France
关键词
unsupervised learning; self-supervised learning; deep learning; text mining;
D O I
10.1093/bib/bbab016
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Biomedical scientific literature is growing at a very rapid pace, which makes increasingly difficult for human experts to spot the most relevant results hidden in the papers. Automatized information extraction tools based on text mining techniques are therefore needed to assist them in this task. In the last few years, deep neural networks-based techniques have significantly contributed to advance the state-of-the-art in this research area. Although the contribution to this progress made by supervised methods is relatively well-known, this is less so for other kinds of learning, namely unsupervised and self-supervised learning. Unsupervised learning is a kind of learning that does not require the cost of creating labels, which is very useful in the exploratory stages of a biomedical study where agile techniques are needed to rapidly explore many paths. In particular, clustering techniques applied to biomedical text mining allow to gather large sets of documents into more manageable groups. Deep learning techniques have allowed to produce new clustering-friendly representations of the data. On the other hand, self-supervised learning is a kind of supervised learning where the labels do not have to be manually created by humans, but are automatically derived from relations found in the input texts. In combination with innovative network architectures (e.g. transformer-based architectures), self-supervised techniques have allowed to design increasingly effective vector-based word representations (word embeddings). We show in this survey how word representations obtained in this way have proven to successfully interact with common supervised modules (e.g. classification networks) to whose performance they greatly contribute.
引用
收藏
页码:1592 / 1602
页数:11
相关论文
共 50 条
  • [41] Leveraging Ensembles and Self-Supervised Learning for Fully-Unsupervised Person Re-Identification and Text Authorship Attribution
    Bertocco, Gabriel
    Theophilo, Antonio
    Andalo, Fernanda
    Rocha, Anderson
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 3876 - 3890
  • [42] A Self-Supervised Learning Approach for Text-Based Person Search
    Ji Z.
    Hu J.
    Ding X.
    Li S.
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2023, 56 (02): : 169 - 176
  • [43] Biomedical Named Entity Recognition Based on Self-supervised Deep Belief Network
    ZHANG Yajun
    LIU Zongtian
    ZHOU Wen
    Chinese Journal of Electronics, 2020, 29 (03) : 455 - 462
  • [44] CONTRASTIVE SELF-SUPERVISED LEARNING FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
    Zhang, Haoran
    Zou, Yuexian
    Wang, Helin
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6713 - 6717
  • [45] METRICBERT: TEXT REPRESENTATION LEARNING VIA SELF-SUPERVISED TRIPLET TRAINING
    Malkiel, Itzik
    Ginzburg, Dvir
    Barkan, Oren
    Caciularu, Avi
    Weill, Yoni
    Koenigstein, Noam
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8142 - 8146
  • [46] A Simple and Effective Usage of Self-supervised Contrastive Learning for Text Clustering
    Shi, Haoxiang
    Wang, Cen
    Sakai, Tetsuya
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 315 - 320
  • [47] Self-supervised deep learning for highly efficient spatial immunophenotyping
    Zhang, Hanyun
    Abduljabbar, Khalid
    Grunewald, Tami
    Akarca, Ayse U.
    Hagos, Yeman
    Sobhani, Faranak
    Lecat, Catherine S. Y.
    Patel, Dominic
    Lee, Lydia
    Rodriguez-Justo, Manuel
    Yong, Kwee
    Ledermann, Jonathan A.
    Le Quesne, John
    Hwang, Shelley
    Mara, Teresa
    Yuan, Yinyin
    EBIOMEDICINE, 2023, 95
  • [48] AN ITERATIVE FRAMEWORK FOR SELF-SUPERVISED DEEP SPEAKER REPRESENTATION LEARNING
    Cai, Danwei
    Wang, Weiqing
    Li, Ming
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6728 - 6732
  • [49] Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry
    Li, Shunkai
    Xue, Fei
    Wang, Xin
    Yan, Zike
    Zha, Hongbin
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2851 - 2860
  • [50] Self-Supervised Deep Learning on Point Clouds by Reconstructing Space
    Sauder, Jonathan
    Sievers, Bjarne
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32