Text feature extraction based on deep learning: a review

被引:165
作者
Liang, Hong [1 ]
Sun, Xiao [1 ]
Sun, Yunlei [1 ]
Gao, Yuan [1 ]
机构
[1] China Univ Petr East China, Coll Comp & Commun Engn, 66 Changjiang West Rd, Qingdao 266580, Peoples R China
关键词
Deep learning; Feature extraction; Text characteristic; Natural language processing; Text mining; FEATURE-SELECTION; DIMENSION REDUCTION; NEURAL-NETWORK; CLASSIFICATION; RECOGNITION;
D O I
10.1186/s13638-017-0993-1
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Selection of text feature item is a basic and important matter for text mining and information retrieval. Traditional methods of feature extraction require handcrafted features. To hand-design, an effective feature is a lengthy process, but aiming at new applications, deep learning enables to acquire new effective feature representation from training data. As a new feature extraction method, deep learning has made achievements in text mining. The major difference between deep learning and conventional methods is that deep learning automatically learns features from big data, instead of adopting handcrafted features, which mainly depends on priori knowledge of designers and is highly impossible to take the advantage of big data. Deep learning can automatically learn feature representation from big data, including millions of parameters. This thesis outlines the common methods used in text feature extraction first, and then expands frequently used deep learning methods in text feature extraction and its applications, and forecasts the application of deep learning in feature extraction.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Network intrusion detection method based on deep learning feature extraction
    Song Y.
    Hou B.
    Cai Z.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2021, 49 (02): : 115 - 120
  • [22] Feature Extraction for Side Scan Sonar Image Based on Deep Learning
    Tang, Yanghua
    Wang, Hongjian
    Xiao, Yao
    Gao, Wei
    Wang, Zhao
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8416 - 8421
  • [23] Deep Learning for Human Activity Recognition Based on Causality Feature Extraction
    Hwang, Yu Min
    Park, Sangjun
    Lee, Hyung Ok
    Ko, Seok-Kap
    Lee, Byung-Tak
    IEEE ACCESS, 2021, 9 : 112257 - 112275
  • [24] Impact of word embedding models on text analytics in deep learning environment: a review
    Asudani, Deepak Suresh
    Nagwani, Naresh Kumar
    Singh, Pradeep
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (09) : 10345 - 10425
  • [25] Deep Learning Algorithms Based Text Classifier
    Venkataraman, Arthi
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2016, : 220 - 224
  • [26] Improved Deep BeliefNetwork to Feature Extraction in Chinese Text Classification
    Gao, Jingmin
    Yi, Junkai
    Jia, Wenhao
    Zhao, Xianghui
    PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 283 - 287
  • [27] Deep Transfer Learning-Based Feature Extraction: An Approach to Improve Nonintrusive Load Monitoring
    Cavalca, Diego L.
    Fernandes, Ricardo A. S.
    IEEE ACCESS, 2021, 9 : 139328 - 139335
  • [28] Identification of Shipborne VHF Radio Based on Deep Learning with Feature Extraction
    Chen, Liang
    Liu, Jiayu
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (05)
  • [29] Flow feature extraction models based on deep learning
    Zhan Qing-Liang
    Ge Yao-Jun
    Bai Chun-Jin
    ACTA PHYSICA SINICA, 2022, 71 (07)
  • [30] Hyperspectral Data Feature Extraction Using Deep Learning Hybrid Model
    Jiang, Xinhua
    Xue, Heru
    Zhang, Lina
    Gao, Xiaojing
    Zhou, Yanqing
    Bai, Jie
    WIRELESS PERSONAL COMMUNICATIONS, 2018, 102 (04) : 3529 - 3543