Exploring Deep Learning in Semantic Question Matching

被引:0
|
作者
Dhakal, Ashwin [1 ]
Poudel, Arpan [1 ]
Pandey, Sagar [1 ]
Gaire, Sagar [1 ]
Baral, Hari Prasad [1 ]
机构
[1] Tribhuvan Univ, Inst Engn, Dept Elect & Comp Engn, Paschimanchal Campus,Lamachour 16, Pokhara, Nepal
来源
PROCEEDINGS ON 2018 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND SECURITY (ICCCS) | 2018年
关键词
Semantic matching; Question duplication; natural language processing; deep learning; Google News Vector;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Question duplication is the major problem encountered by Q&A forums like Quora, Stack-overflow, Reddit, etc. Answers get fragmented across different versions of the same question due to the redundancy of questions in these forums. Eventually, this results in lack of a sensible search, answer fatigue, segregation of information and the paucity of response to the questioners. The duplicate questions can be detected using Machine Learning and Natural Language Processing. Dataset of more than 400,000 questions pairs provided by Quora are pre-processed through tokenization, lemmatization and removal of stop words. This pre-processed dataset is used for the feature extraction. Artificial Neural Network is then designed and the features hence extracted, are fit into the model. This neural network gives accuracy of 86.09%. In a nutshell, this research predicts the semantic coincidence between the question pairs extracting highly dominant features and hence, determine the probability of question being duplicate.
引用
收藏
页码:86 / 91
页数:6
相关论文
共 50 条
  • [21] A deep neural architecture for sentence semantic matching
    Zhang, Xu
    Lu, Wenpeng
    Li, Fangfang
    Zhang, Ruoyu
    Cheng, Jinyong
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2020, 21 (04) : 574 - 582
  • [22] FAQ question Answering method based on semantic similarity matching
    Ji Tenghao
    2022 6TH INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL, ISCSIC, 2022, : 93 - 100
  • [23] Question answering using sentence parsing and semantic network matching
    Hartrumpf, S
    MULTILINGUAL INFORMATION ACCESS FOR TEXT, SPEECH AND IMAGES, 2005, 3491 : 512 - 521
  • [24] gMatch: Knowledge base question answering via semantic matching
    Jiao, Jie
    Wang, Shujun
    Zhang, Xiaowang
    Wang, Longbiao
    Feng, Zhiyong
    Wang, Junhu
    KNOWLEDGE-BASED SYSTEMS, 2021, 228
  • [25] Forum Duplicate Question Detection by Domain Adaptive Semantic Matching
    Xu, Zhuojia
    Yuan, Hua
    IEEE ACCESS, 2020, 8 : 56029 - 56038
  • [26] An Semantic Similarity Matching Method for Chinese Medical Question Text
    Wang, Liru
    Zhang, Tongxuan
    Tian, Jiewen
    Lin, Hongfei
    HEALTH INFORMATION PROCESSING, CHIP 2022, 2023, 1772 : 82 - 94
  • [27] Semantic Structure in Deep Learning
    Pavlick, Ellie
    ANNUAL REVIEW OF LINGUISTICS, 2022, 8 : 447 - 471
  • [28] Semantic Adversarial Deep Learning
    Seshia, Sanjit A.
    Jha, Somesh
    Dreossi, Tommaso
    IEEE DESIGN & TEST, 2020, 37 (02) : 8 - 18
  • [29] Semantic Adversarial Deep Learning
    Dreossi, Tommaso
    Jha, Somesh
    Seshia, Sanjit A.
    COMPUTER AIDED VERIFICATION (CAV 2018), PT I, 2018, 10981 : 3 - 26
  • [30] Learning question classifiers: The role of semantic information
    Department of Computer Science, University of Illinois, Urbana-Champaign, IL 61801, United States
    Nat Lang Eng, 2006, 3 (229-249):