Building siamese attention-augmented recurrent convolutional neural networks for document similarity scoring

被引:9
|
作者
Han, Sifei [1 ]
Shi, Lingyun [1 ]
Richie, Russell [1 ]
Tsui, Fuchiang R. Rich [1 ,2 ]
机构
[1] Childrens Hosp Philadelphia, Dept Biomed & Hlth Informat, Tsui Lab, 2716 South St, Philadelphia, PA 19104 USA
[2] Univ Penn, Perelman Sch Med, 3400 Spruce St,Suite 680 Dulles, Philadelphia, PA USA
基金
美国国家科学基金会;
关键词
Attention neural network; Deep learning; Machine learning; Natural language processing; Information retrieval; Text similarity;
D O I
10.1016/j.ins.2022.10.032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatically measuring document similarity is imperative in natural language process-ing, with applications ranging from recommendation to duplicate document detection. State-of-the-art approach in document similarity commonly involves deep neural net-works, yet there is little study on how different architectures may be combined. Thus, we introduce the Siamese Attention-augmented Recurrent Convolutional Neural Network (S-ARCNN) that combines multiple neural network architectures. In each subnet-work of S-ARCNN, a document passes through a bidirectional Long Short-Term Memory (bi-LSTM) layer, which sends representations to local and global document modules. A local document module uses convolution, pooling, and attention layers, whereas a global document module uses last states of the bi-LSTM. Both local and global features are con-catenated to form a single document representation. Using the Quora Question Pairs data -set, we evaluated S-ARCNN, Siamese convolutional neural networks (S-CNNs), Siamese LSTM, and two BERT models. While S-CNNs (82.02% F1) outperformed S-ARCNN (79.83% F1) overall, S-ARCNN slightly outperformed S-CNN on duplicate question pairs with more than 50 words (39.96% vs. 39.42% accuracy). With the potential advantage of S-ARCNN for processing longer documents, S-ARCNN may help researchers identify collaborators with similar research interests, help editors find potential reviewers, or match resumes with job descriptions.(c) 2022 Published by Elsevier Inc.
引用
收藏
页码:90 / 102
页数:13
相关论文
共 50 条
  • [31] Robust Arabic Text Categorization by Combining Convolutional and Recurrent Neural Networks
    Ameur, Mohamed Seghir Hadj
    Belkebir, Riadh
    Guessoum, Ahmed
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (05)
  • [32] ATTENTION IN RECURRENT NEURAL NETWORKS FOR RANSOMWARE DETECTION
    Agrawal, Rakshit
    Stokes, Jack W.
    Selvaraj, Karthik
    Marinescu, Mady
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3222 - 3226
  • [33] A Hybrid Model for Soybean Yield Prediction Integrating Convolutional Neural Networks, Recurrent Neural Networks, and Graph Convolutional Networks
    Ingole, Vikram S.
    Kshirsagar, Ujwala A.
    Singh, Vikash
    Yadav, Manish Varun
    Krishna, Bipin
    Kumar, Roshan
    COMPUTATION, 2025, 13 (01)
  • [34] A robust similarity based deep siamese convolutional neural network for gait recognition across views
    George, Merlin Linda
    Govindarajan, Themozhi
    Rajasekaran, Kavitha Angamuthu
    Bandi, Sudheer Reddy
    COMPUTATIONAL INTELLIGENCE, 2020, 36 (03) : 1290 - 1319
  • [35] Quantifying Student Attention using Convolutional Neural Networks
    Coaja, Andreea
    Rusu, Catalin, V
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 293 - 299
  • [36] Spatial Channel Attention for Deep Convolutional Neural Networks
    Liu, Tonglai
    Luo, Ronghai
    Xu, Longqin
    Feng, Dachun
    Cao, Liang
    Liu, Shuangyin
    Guo, Jianjun
    MATHEMATICS, 2022, 10 (10)
  • [37] Siamese Neural Networks on the Trail of Similarity in Bugs in 5G Mobile Network Base Stations
    Zarebski, Sebastian
    Kuzmich, Aleksander
    Sitko, Sebastian
    Rusek, Krzysztof
    Cholda, Piotr
    ELECTRONICS, 2022, 11 (22)
  • [38] Convolutional Recurrent Neural Networks for Hyperspectral Data Classification
    Wu, Hao
    Prasad, Saurabh
    REMOTE SENSING, 2017, 9 (03)
  • [39] Convolutional Neural Networks for Page Segmentation of Historical Document Images
    Chen, Kai
    Seuret, Mathias
    Henneberet, Jean
    Ingold, Rolf
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 965 - 970
  • [40] Pruning convolutional neural networks via filter similarity analysis
    Lili Geng
    Baoning Niu
    Machine Learning, 2022, 111 : 3161 - 3180