Text feature extraction based on deep learning: a review

被引:166
作者
Liang, Hong [1 ]
Sun, Xiao [1 ]
Sun, Yunlei [1 ]
Gao, Yuan [1 ]
机构
[1] China Univ Petr East China, Coll Comp & Commun Engn, 66 Changjiang West Rd, Qingdao 266580, Peoples R China
关键词
Deep learning; Feature extraction; Text characteristic; Natural language processing; Text mining; FEATURE-SELECTION; DIMENSION REDUCTION; NEURAL-NETWORK; CLASSIFICATION; RECOGNITION;
D O I
10.1186/s13638-017-0993-1
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Selection of text feature item is a basic and important matter for text mining and information retrieval. Traditional methods of feature extraction require handcrafted features. To hand-design, an effective feature is a lengthy process, but aiming at new applications, deep learning enables to acquire new effective feature representation from training data. As a new feature extraction method, deep learning has made achievements in text mining. The major difference between deep learning and conventional methods is that deep learning automatically learns features from big data, instead of adopting handcrafted features, which mainly depends on priori knowledge of designers and is highly impossible to take the advantage of big data. Deep learning can automatically learn feature representation from big data, including millions of parameters. This thesis outlines the common methods used in text feature extraction first, and then expands frequently used deep learning methods in text feature extraction and its applications, and forecasts the application of deep learning in feature extraction.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Comprehensive Review of Feature Extraction Techniques for sEMG Signal Classification: From Handcrafted Features to Deep Learning Approaches
    Sid'El Moctar, Sidi Mohamed
    Rida, Imad
    Boudaoud, Sofiane
    [J]. IRBM, 2024, 45 (06)
  • [32] Active Deep Feature Extraction for Hyperspectral Image Classification Based on Adversarial Learning
    Wang, Xue
    Tan, Kun
    Pan, Cen
    Ding, Jianwei
    Liu, Zhaoxian
    Han, Bo
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [33] Deep Learning Based Cost Constraint Algorithm for Intrusion Detection Feature Extraction
    Liu, Yun
    Zheng, Wenfeng
    Zhang, Yi
    [J]. 2021 3RD INTERNATIONAL CONFERENCE ON MACHINE LEARNING, BIG DATA AND BUSINESS INTELLIGENCE (MLBDBI 2021), 2021, : 520 - 526
  • [34] Ensemble classification for intrusion detection via feature extraction based on deep Learning
    Yousefnezhad, Maryam
    Hamidzadeh, Javad
    Aliannejadi, Mohammad
    [J]. SOFT COMPUTING, 2021, 25 (20) : 12667 - 12683
  • [35] A Deep Learning-Based Feature Extraction Framework for System Security Assessment
    Sun, Mingyang
    Konstantelos, Ioannis
    Strbac, Goran
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2019, 10 (05) : 5007 - 5020
  • [36] Polarimetric SAR Feature Extraction With Neighborhood Preservation-Based Deep Learning
    Liu, Hongying
    Yang, Shuyuan
    Gou, Shuiping
    Zhu, Dexiang
    Wang, Rongfang
    Jiao, Licheng
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (04) : 1456 - 1466
  • [37] A Novel and Efficient Feature Extraction Method for Deep Learning Based Continuous Estimation
    Ma, Chenfei
    Guo, Weiyu
    Zhang, Hang
    Samuel, Oluwarotimi Williams
    Ji, Xiaopeng
    Xu, Lisheng
    Li, Guanglin
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04): : 7341 - 7348
  • [38] Music Feature Recognition and Classification Using a Deep Learning Algorithm
    Xu, Lihong
    Zhang, Shenghuan
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2023, 22 (03)
  • [39] Interference Signal Feature Extraction and Pattern Classification Algorithm Based on Deep Learning
    Qin, Jiangyi
    Zhang, Fei
    Wang, Kai
    Zuo, Yuan
    Deng, Chenxi
    [J]. ELECTRONICS, 2022, 11 (14)
  • [40] PSDRNN: An Efficient and Effective HAR Scheme Based on Feature Extraction and Deep Learning
    Li, Xiao
    Wang, Yufeng
    Zhang, Bo
    Ma, Jianhua
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (10) : 6703 - 6713