Large⁃scale semantic text overlapping region retrieval based on deep learning

被引:0
作者
Dong L.-L. [1 ]
Yang D. [1 ]
Zhang X. [1 ]
机构
[1] School of Information and Control Engineering, Xi'an University of Architecture and Technology, Xi'an
来源
Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition) | 2021年 / 51卷 / 05期
关键词
Deep confidence network; Deep learning; Feature learning; Overlapping region retrieval; Semantic text;
D O I
10.13229/j.cnki.jdxbgxb20200370
中图分类号
学科分类号
摘要
As a hot topic in natural language processing, overlapping region recognition needs to be further explored and studied. Aiming at the problem of poor accuracy and recall in traditional text overlapping region retrieval methods, a large-scale semantic text overlapping region retrieval method based on deep learning is proposed. Combined with sparse automatic encoder and depth confidence network, a hybrid model is constructed. According to the hybrid model, a text classifier is designed and constructed. The main components of the classifier are text preprocessing, feature learning and classification retrieval. In this paper, a series of preprocessing, such as de-noising, word segmentation and stop word removal, are carried out. Finally, softmax regression is used to realize text classification, and the learned text features are used as the input of the classifier to get the classification and retrieval results of the overlapping regions. The experimental results show that the accuracy and recall of the method are both high, showing reliability and robustness. © 2021, Jilin University Press. All right reserved.
引用
收藏
页码:1817 / 1822
页数:5
相关论文
共 10 条
[1]  
ZhangQian -qian, TianXue -dong, YangFang, Et al., Integration retrieval model based on transformation of mathematical text and expression, Computer Engineering, 45, 3, pp. 175-181, (2019)
[2]  
CheXiang -jiu, WangLi, GuoXiao -xin, Improved boundary detection based on multi-scale cues fusion, Journal of Jilin University(Engineering and Technology Edition), 48, 5, pp. 1621-1628, (2018)
[3]  
LinZe -qi, ZouYan -zhen, ZhaoJun -feng, Et al., Software text semantic search approach based on code structure knowledge, Journal of Software, 30, 12, pp. 3714-3729, (2019)
[4]  
WangGui -fang, YangMei -ni, Et al., Construction of precise search queries based on word embedding, Modern Information, 38, 11, pp. 55-58, (2018)
[5]  
SunXiao -gang, JiangYao -gang, Et al., Live detection algorithm based on semantic segmentation, Journal of Jilin University(Engineering and Technology Edition), 50, 3, pp. 281-287, (2020)
[6]  
WuXi, YuNeng -hai, ZhangWei -ming, Et al., An improved multi-keyword fuzzy search scheme based on BloomFilter over encrypted text, Control and Decision, 34, 1, pp. 97-104, (2019)
[7]  
LiZhi -yi, HuangZi -feng, XuXiao -mian, A review of the cross-modal retrieval model and feature extraction based on representation learning, Journal of the China Society for Scientific and Technical Information, 37, 4, pp. 86-99, (2018)
[8]  
HanLei, Design of animation material automatic retrieval system based on text driven, Modern Electronics Technique, 41, 24, pp. 177-179, (2018)
[9]  
LiuFeng, LiHong -hui, Et al., Overlapping community detection algorithm by label propagation using PageRank and node clustering coefficients, Journal of National University of Defense Technology, 41, 1, pp. 186-193, (2019)
[10]  
JiaHua -ding, XiongYu -ning, Approximate neighbors selection method for mobile user based on services similarity, Computer Engineering, 44, 5, pp. 168-173, (2018)