Deep Learning Based Semantic Similarity Detection Using Text Data

被引:14
|
作者
Mansoor, Muhammad [1 ]
Rehman, Zahoor Ur [1 ]
Shaheen, Muhammad [2 ]
Khan, Muhammad Attique [3 ]
Habib, Mohamed [4 ,5 ]
机构
[1] COMSATS Univ Islamabad, Comp Sci Dept, Attock Campus, Islamabad, Pakistan
[2] Fdn Univ Islamabad, Fac Engn & IT, Islamabad, Pakistan
[3] HITEC Univ Taxila, Dept Comp Sci, Taxila, Pakistan
[4] Saudi Elect Univ, Coll Comp & Informat, Riyadh, Saudi Arabia
[5] Port Said Univ, Fac Engn, Port Fuad City, Egypt
来源
INFORMATION TECHNOLOGY AND CONTROL | 2020年 / 49卷 / 04期
关键词
Deep Learning; Semantics; Similarity; Quora; question duplication; LSTM and CNN; CONTRAST ENHANCEMENT; NEURAL-NETWORK; RECOGNITION; SELECTION; MODEL;
D O I
10.5755/j01.itc.49.4.27118
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Similarity detection in the text is the main task for a number of Natural Language Processing (NLP) applications. As textual data are comparatively large in quantity and in volume than the numeric data, measuring textual similarity is one of the important problems. Most of the similarity detection algorithms are based upon word to word matching, sentence/paragraph matching, and matching of the whole document. In this research, a novel approach is proposed using deep learning models, combining Long Short-Term Memory Network (LSTM) with Convolutional Neural Network (CNN) for measuring semantics similarity between two questions. The proposed model takes sentence pairs as input to measure the similarity between them. The model is tested on publicly available Quora's dataset. In comparison to the existing techniques gave 87.50 % accuracy which is better than the previous approaches.
引用
收藏
页码:495 / 510
页数:16
相关论文
共 50 条
  • [31] Sherlock: A Deep Learning Approach to Semantic Data Type Detection
    Hulsebos, Madelon
    Hu, Kevin
    Bakker, Michiel
    Zgraggen, Emanuel
    Satyanarayan, Arvind
    Kraska, Tim
    Demiralp, Cagatay
    Hidalgo, Cesar
    KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 1500 - 1508
  • [32] SEMANTIC EDGE DETECTION BASED ON DEEP METRIC LEARNING
    Cai, Shulian
    Huang, Jiabin
    Ding, Xinghao
    Zeng, Delu
    2017 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS 2017), 2017, : 707 - 712
  • [33] Semantic Representation Based on Deep Learning for Spam Detection
    Saidani, Nadjate
    Adi, Kamel
    Allili, Mohand Said
    FOUNDATIONS AND PRACTICE OF SECURITY, FPS 2019, 2020, 12056 : 72 - 81
  • [34] Semantic detection for tabular data in text
    Alrashed, SA
    Gray, WA
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL VI, PROCEEDINGS: INFORMATION SYSTEMS, TECHNOLOGIES AND APPLICATIONS: I, 2003, : 209 - 214
  • [35] Abstractive text summarization based on deep learning and semantic content generalization
    Kouris, Panagiotis
    Alexandridis, Georgios
    Stafylopatis, Andreas
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5082 - 5092
  • [36] MHDT: A Deep-Learning-Based Text Detection Algorithm for Unstructured Data in Banking
    Ma, Shenglan
    Yang, Lingling
    Wang, Hao
    Xiao, Hong
    Dai, Hong-Ning
    Cheng, Shuhan
    Wang, Tongsen
    ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 295 - 300
  • [37] Text Detection for Dust Image Based on Deep Learning
    Liu, Hao
    Li, Ce
    Jia, Shengze
    Zhang, Dong
    PROCEEDINGS 2018 33RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2018, : 754 - 759
  • [38] Deep Learning Based Scene Text Detection: A Survey
    Jiang W.
    Zhang C.-S.
    Yin X.-C.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (05): : 1152 - 1161
  • [39] Change Detection Using Deep Learning Based Semantic Segmentation for Nuclear Activity Detection and Monitoring
    Song, Ahram
    Lee, Changhui
    Lee, Jinmin
    Han, Youkyung
    KOREAN JOURNAL OF REMOTE SENSING, 2022, 38 (06) : 991 - 1005
  • [40] Ticket Text Detection and Recognition Based on Deep Learning
    Chen, Xiuxin
    Lv, Zhijing
    Zhu, Dongdong
    Yu, Chongchong
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3922 - 3926