Hierarchical Multiview Top-k Pooling with Deep-Q-Networks

被引:0
作者
Li Z.-P. [1 ,2 ]
Su H.-L. [3 ]
Wu Y.-. [3 ]
Zhang Q.-H. [1 ]
Yuan C.-A. [4 ]
Gribova V. [5 ]
Filaretov V.F. [5 ]
Huang D.-S. [1 ]
机构
[1] Eastern Institute of Technology, Zhejiang, Ningbo
[2] University of Science and Technology of China, School of Life Sciences, Anhui, Hefei
[3] Tongji University, Institute of Machine Learning and Systems Biology, School of Electronics and Information Engineering, Shanghai
[4] Guangxi Academy of Sciences, Institute of Big Data and Intelligent Computing Research Center, Nanning
[5] Far Eastern Branch of the Russian Academy of Sciences, Institute of Automation and Control Processes, Vladivostok
来源
IEEE Transactions on Artificial Intelligence | 2024年 / 5卷 / 06期
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Deep-Q-network; graph neural networks; graph pooling; multiview information;
D O I
10.1109/TAI.2023.3334261
中图分类号
学科分类号
摘要
Graph neural networks (GNNs) are extensions of deep neural networks to graph-structured data. It has already attracted widespread attention for various tasks such as node classification and link prediction. Existing research focuses more on graph convolutional neural networks (GCNs). However, it is usually overlooked that graph pooling can obtain graph representations by summarizing and down-sampling node information. Meanwhile, existing graph pooling methods mainly use top-k for node selection, but most of them consider only single view information when scoring nodes, and the k values in top-k are usually selected empirically. This work proposes the hierarchical multiview top-k pooling (HMTPool) with deep-Q-networks, which scores nodes taking into account multiview information (considering graph structure and features) and does not rely on the empirical adaptive selection of the best k value. HMTPool is a two-stage process. It first uses a variant GCN and multilayer perceptron to score the nodes from structural and feature perspectives, respectively and then performs fusion operations on the multiview information scores of the nodes. In addition, to select the optimal pooling ratio of top-k, we propose a deep-Q-network-based top-k (DTop-k) node selection method, which can adaptively select the best pooling ratio without prior knowledge. Experimental results on six TUDatasets and two Benchmarking GNNs datasets demonstrate the effectiveness of our proposed approach. © 2020 IEEE.
引用
收藏
页码:2985 / 2996
页数:11
相关论文
共 69 条
[1]  
Abdel-Hamid O., Mohamed A.R., Jiang H., Deng L., Penn G., Yu D., Convolutional neural networks for speech recognition, IEEE/ ACM Trans. Audio, Speech, Language Process., 22, 10, pp. 1533-1545, (2014)
[2]  
Xiong W., Wu L., Alleva F., Droppo J., Huang X., Stolcke A., The Microsoft 2017 conversational speech recognition system, Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), pp. 5934-5938, (2018)
[3]  
Shi B., Bai X., Yao C., An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition, IEEE Trans. Pattern Anal. Mach. Intell., 39, 11, pp. 2298-2304, (2017)
[4]  
Wang W., Et al., PAN: Towards efficient and accurate end-to-end spotting of arbitrarily-shaped text, IEEE Trans. Pattern Anal. Mach. Intell., 44, 9, pp. 5349-5367, (2021)
[5]  
Bochkovskiy A., Wang C.-Y., Liao H., Yolov4: Optimal speed and accuracy of object detection, (2020)
[6]  
He K., Gkioxari G., Dollar P., Girshick R., Mask R-CNN, Proc. IEEE Int. Conf. Comput. Vision (ICCV), pp. 2980-2988, (2017)
[7]  
Ronneberger O., Fischer P., Brox T., U-Net: Convolutional networks for biomedical image segmentation, Med. Image Comput. Comput.-Assisted Intervention-MICCAI 2015: 18th Int. Conf., pp. 234-241, (2015)
[8]  
Wu Y., Et al., Discriminative local representation learning for crossmodality visible-thermal person re-identification, IEEE Trans. Biom., Behav., Ident. Sci., 5, 1, pp. 1-14, (2023)
[9]  
Fan W., Et al., Graph neural networks for social recommendation, Proc. World Wide Web Conf. (WWW), pp. 417-426, (2019)
[10]  
Wu L., Sun P., Hong R., Fu Y., Wang X., Wang M., SocialGCN: An efficient graph convolutional network based model for social recommendation, (2018)