Text feature extraction based on deep learning: a review

被引:176
作者
Liang, Hong [1 ]
Sun, Xiao [1 ]
Sun, Yunlei [1 ]
Gao, Yuan [1 ]
机构
[1] China Univ Petr East China, Coll Comp & Commun Engn, 66 Changjiang West Rd, Qingdao 266580, Peoples R China
关键词
Deep learning; Feature extraction; Text characteristic; Natural language processing; Text mining; FEATURE-SELECTION; DIMENSION REDUCTION; NEURAL-NETWORK; CLASSIFICATION; RECOGNITION;
D O I
10.1186/s13638-017-0993-1
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Selection of text feature item is a basic and important matter for text mining and information retrieval. Traditional methods of feature extraction require handcrafted features. To hand-design, an effective feature is a lengthy process, but aiming at new applications, deep learning enables to acquire new effective feature representation from training data. As a new feature extraction method, deep learning has made achievements in text mining. The major difference between deep learning and conventional methods is that deep learning automatically learns features from big data, instead of adopting handcrafted features, which mainly depends on priori knowledge of designers and is highly impossible to take the advantage of big data. Deep learning can automatically learn feature representation from big data, including millions of parameters. This thesis outlines the common methods used in text feature extraction first, and then expands frequently used deep learning methods in text feature extraction and its applications, and forecasts the application of deep learning in feature extraction.
引用
收藏
页数:12
相关论文
共 50 条
[41]   Research on Feature Extraction and Multimodal Fusion of Video Caption Based on Deep Learning [J].
Chen, Hongjun ;
Li, Hengyi ;
Wu, Xueqin .
2020 THE 4TH INTERNATIONAL CONFERENCE ON MANAGEMENT ENGINEERING, SOFTWARE ENGINEERING AND SERVICE SCIENCES (ICMSS 2020), 2020, :73-76
[42]   Ensemble classification for intrusion detection via feature extraction based on deep Learning [J].
Maryam Yousefnezhad ;
Javad Hamidzadeh ;
Mohammad Aliannejadi .
Soft Computing, 2021, 25 :12667-12683
[43]   Visualization of Driving Behavior Based on Hidden Feature Extraction by Using Deep Learning [J].
Liu, HaiLong ;
Taniguchi, Tadahiro ;
Tanaka, Yusuke ;
Takenaka, Kazuhito ;
Bando, Takashi .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 18 (09) :2477-2489
[44]   Deep learning approaches to scene text detection: a comprehensive review [J].
Khan, Tauseef ;
Sarkar, Ram ;
Mollah, Ayatullah Faruk .
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (05) :3239-3298
[45]   Healthcare data analysis by feature extraction and classification using deep learning with cloud based cyber security [J].
Qamar, Shamimul .
COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
[46]   Unified feature extraction framework based on contrastive learning [J].
Zhang, Hongjie ;
Qiang, Wenwen ;
Zhang, Jinxin ;
Chen, Yingyi ;
Jing, Ling .
KNOWLEDGE-BASED SYSTEMS, 2022, 258
[47]   Feature Extraction of Dialogue Text Based on Big Data and Machine Learning [J].
Liu X. ;
Zhang H. ;
Cheng Y. .
International Journal of Web-Based Learning and Teaching Technologies, 2024, 19 (01)
[48]   Text Classification of Mixed Model Based on Deep Learning [J].
Lee, Sang-Hwa .
TEHNICKI GLASNIK-TECHNICAL JOURNAL, 2023, 17 (03) :367-374
[49]   Image and Text Correlation Judgement Based on Deep Learning [J].
Liu, Yinyang ;
Xu, Xiaobin ;
Li, Feixiang .
PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, :844-847
[50]   Feature Extraction of Hyperspectral Images Based on Deep Boltzmann Machine [J].
Yang, Jiangong ;
Guo, Yanhui ;
Wang, Xili .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (06) :1077-1081