Enhanced Semantic-Aware Multi-Keyword Ranked Search Scheme Over Encrypted Cloud Data

被引:11
作者
Dai, Xuelong [1 ]
Dai, Hua [1 ]
Rong, Chunming [2 ]
Yang, Geng [1 ]
Xiao, Fu [1 ]
Xiao, Bin [3 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Comp Sci & Technol, Nanjing 210049, Peoples R China
[2] Univ Stavanger, Ctr IP Based Serv Innovat, N-4021 Stavanger, Norway
[3] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
关键词
Semantic-aware search; searchable encryption; cloud computing; multi-keyword ranked search; EFFICIENT;
D O I
10.1109/TCC.2020.3047921
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional searchable encryption schemes based on the Term Frequency-Inverse Document Frequency (TF-IDF) model adopt the presence of keywords to measure the relevance of documents to queries, which ignores the latent semantic meanings that are concealed in the context. Latent Dirichlet Allocation (LDA) topic model can be utilized for modeling the semantics among texts to achieve semantic-aware multi-keyword search. However, the LDA topic model treats queries and documents from the perspective of topics, and the keywords information is ignored. In this article, we propose a privacy-preserving searchable encryption scheme based on the LDA topic model and the query likelihood model. We extract the feature keywords from the document using the LDA-based Information Gain (IG) and Topic Frequency-Inverse Topic Frequency (TF-ITF) model. With feature keyword extraction and the query likelihood model, our scheme can achieve a more accurate semantic-aware keyword search. A special index tree is used to enhance search efficiency. The secure inner product operation is utilized to implement the privacy-preserving ranked search. The experiments on real-world datasets demonstrate the effectiveness of our scheme.
引用
收藏
页码:2595 / 2612
页数:18
相关论文
共 38 条
[1]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[2]  
Buckland L. P., 2004, P 13 TEXT RETRIEVAL, V500
[3]  
Bulut M, 2014, Arxiv, DOI arXiv:1401.7741
[4]   Privacy-Preserving Multi-Keyword Ranked Search over Encrypted Cloud Data [J].
Cao, Ning ;
Wang, Cong ;
Li, Ming ;
Ren, Kui ;
Lou, Wenjing .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (01) :222-233
[5]  
Cao N, 2011, IEEE INFOCOM SER, P829, DOI 10.1109/INFCOM.2011.5935306
[6]   An Efficient Privacy-Preserving Ranked Keyword Search Method [J].
Chen, Chi ;
Zhu, Xiaojie ;
Shen, Peisong ;
Hu, Jiankun ;
Guo, Song ;
Tari, Zahir ;
Zomaya, Albert Y. .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (04) :951-963
[7]  
Dai X., 2019, J NETW COMPUT APPL, V147, P1
[8]  
Davis C., 1962, Numer. Math., V4, P343
[9]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[10]   Semantic-Aware Searching Over Encrypted Data for Cloud Computing [J].
Fu, Zhangjie ;
Xia, Lili ;
Sun, Xingming ;
Liu, Alex X. ;
Xie, Guowu .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2018, 13 (09) :2359-2371