Building siamese attention-augmented recurrent convolutional neural networks for document similarity scoring

被引:9
|
作者
Han, Sifei [1 ]
Shi, Lingyun [1 ]
Richie, Russell [1 ]
Tsui, Fuchiang R. Rich [1 ,2 ]
机构
[1] Childrens Hosp Philadelphia, Dept Biomed & Hlth Informat, Tsui Lab, 2716 South St, Philadelphia, PA 19104 USA
[2] Univ Penn, Perelman Sch Med, 3400 Spruce St,Suite 680 Dulles, Philadelphia, PA USA
基金
美国国家科学基金会;
关键词
Attention neural network; Deep learning; Machine learning; Natural language processing; Information retrieval; Text similarity;
D O I
10.1016/j.ins.2022.10.032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatically measuring document similarity is imperative in natural language process-ing, with applications ranging from recommendation to duplicate document detection. State-of-the-art approach in document similarity commonly involves deep neural net-works, yet there is little study on how different architectures may be combined. Thus, we introduce the Siamese Attention-augmented Recurrent Convolutional Neural Network (S-ARCNN) that combines multiple neural network architectures. In each subnet-work of S-ARCNN, a document passes through a bidirectional Long Short-Term Memory (bi-LSTM) layer, which sends representations to local and global document modules. A local document module uses convolution, pooling, and attention layers, whereas a global document module uses last states of the bi-LSTM. Both local and global features are con-catenated to form a single document representation. Using the Quora Question Pairs data -set, we evaluated S-ARCNN, Siamese convolutional neural networks (S-CNNs), Siamese LSTM, and two BERT models. While S-CNNs (82.02% F1) outperformed S-ARCNN (79.83% F1) overall, S-ARCNN slightly outperformed S-CNN on duplicate question pairs with more than 50 words (39.96% vs. 39.42% accuracy). With the potential advantage of S-ARCNN for processing longer documents, S-ARCNN may help researchers identify collaborators with similar research interests, help editors find potential reviewers, or match resumes with job descriptions.(c) 2022 Published by Elsevier Inc.
引用
收藏
页码:90 / 102
页数:13
相关论文
共 50 条
  • [1] CSI-Fingerprinting Indoor Localization via Attention-Augmented Residual Convolutional Neural Network
    Zhang, Bowen
    Sifaou, Houssem
    Li, Geoffrey Ye
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (08) : 5583 - 5597
  • [2] Enhancing Plant Disease Detection Using Attention-Augmented Residual Networks and Faster Region-Convolutional Networks
    Sathya, K.
    Balakrishnan, Arunkumar
    Baskaran, P.
    Ramamoorthy, Arun Kumar
    IEEE ACCESS, 2025, 13 : 48625 - 48642
  • [3] Siamese Convolutional Neural Networks to Quantify Crack Pattern Similarity in Masonry Facades
    Rozsas, Arpad
    Slobbe, Arthur
    Huizinga, Wyke
    Kruithof, Maarten
    Pillai, Krishna Ajithkumar
    Kleijn, Kelvin
    Giardina, Giorgia
    INTERNATIONAL JOURNAL OF ARCHITECTURAL HERITAGE, 2023, 17 (01) : 147 - 169
  • [4] An Attention-Augmented Convolutional Neural Network With Focal Loss for Mixed-Type Wafer Defect Classification
    Batool, Uzma
    Shapiai, Mohd Ibrahim
    Mostafa, Salama A.
    Ibrahim, Mohd Zamri
    IEEE ACCESS, 2023, 11 : 108891 - 108905
  • [5] Attention-Augmented Convolutional Autoencoder for Radar-Based Human Activity Recognition
    Campbell, Christopher
    Ahmad, Fauzia
    2020 IEEE INTERNATIONAL RADAR CONFERENCE (RADAR), 2020, : 990 - 995
  • [6] Recognizing text lines in handwritten archival document images using octave convolutional and attention recurrent neural networks
    Olfa Mechi
    Maroua Mehri
    Rolf Ingold
    Najoua Essoukri Ben Amara
    Multimedia Tools and Applications, 2025, 84 (17) : 18095 - 18122
  • [7] A New Siamese Heterogeneous Convolutional Neural Networks Based on Attention Mechanism and Feature Pyramid
    Lu, Zhenyu
    Bian, Yuelou
    Yang, Tingya
    Ge, Quanbo
    Wang, Yuanliang
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (01) : 13 - 24
  • [8] Enhancing resolution uniformity of spectral line confocal 3D sensors through integration of vision transformer with perpendicular attention-augmented parallel convolutional neural networks
    Wang, Shuai
    Zheng, Zechen
    Diao, Kuan
    Liu, Xiaojun
    MEASUREMENT, 2025, 249
  • [9] Automated fundus ultrasound image classification based on siamese convolutional neural networks with multi-attention
    Jiachen Tan
    Yongquan Dong
    Junchi Li
    BMC Medical Imaging, 23
  • [10] Automatic playlist generation using Convolutional Neural Networks and Recurrent Neural Networks
    Irene, Rosilde Tatiana
    Borrelli, Clara
    Zanoni, Massimiliano
    Buccoli, Michele
    Sarti, Augusto
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,