Reversible Natural Language Watermarking Using Synonym Substitution and Arithmetic Coding

被引:104
作者
Xiang, Lingyun [1 ,2 ]
Li, Yan [2 ]
Hao, Wei [3 ]
Yang, Peng [4 ]
Shen, Xiaobo [5 ]
机构
[1] Changsha Univ Sci & Technol, Hunan Prov Key Lab Intelligent Proc Big Data Tran, Changsha 410114, Hunan, Peoples R China
[2] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Hunan, Peoples R China
[3] Changsha Univ Sci & Technol, Sch Traff & Transportat Engn, Changsha 410114, Hunan, Peoples R China
[4] CNCERT CC, Hunan Branch, Changsha 410004, Hunan, Peoples R China
[5] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2018年 / 55卷 / 03期
基金
中国国家自然科学基金;
关键词
Arithmetic coding; synonym substitution; lossless compression; reversible watermarking; INTEGER WAVELET TRANSFORM; IMAGE WATERMARKING; PREDICTION;
D O I
10.3970/cmc.2018.03510
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For protecting the copyright of a text and recovering its original content harmlessly, this paper proposes a novel reversible natural language watermarking method that combines arithmetic coding and synonym substitution operations. By analyzing relative frequencies of synonymous words, synonyms employed for carrying payload are quantized into an unbalanced and redundant binary sequence. The quantized binary sequence is compressed by adaptive binary arithmetic coding losslessly to provide a spare for accommodating additional data. Then, the compressed data appended with the watermark are embedded into the cover text via synonym substitutions in an invertible manner, On the receiver side, the watermark and compressed data can be extracted by decoding the values of synonyms in the watermarked text, as a result of which the original context can be perfectly recovered by decompressing the extracted compressed data and substituting the replaced synonyms with their original synonyms. Experimental results demonstrate that the proposed method can extract the watermark successfully and achieve a lossless recovery of the original text. Additionally, it achieves a high embedding capacity.
引用
收藏
页码:541 / 559
页数:19
相关论文
共 28 条
[11]  
Jiang Chuanxian, 2010, Journal of Computer Aided Design & Computer Graphics, V22, P879, DOI 10.3724/SP.J.1089.2010.10788
[12]   Reversible data embedding into images using wavelet techniques and sorting [J].
Kamstra, L ;
Heijmans, HJAM .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2005, 14 (12) :2082-2090
[13]   Exploring the learning capabilities of convolutional neural networks for robust image watermarking [J].
Kandi, Haribabu ;
Mishra, Deepak ;
Gorthi, Subrahmanyam R. K. Sai .
COMPUTERS & SECURITY, 2017, 65 :247-268
[14]   A recent survey of reversible watermarking techniques [J].
Khan, Asifullah ;
Siddiqa, Ayesha ;
Munib, Summuyya ;
Malik, Sana Ambreen .
INFORMATION SCIENCES, 2014, 279 :251-272
[15]  
Kumar R., 2017, IEEE INT C COMP COMM, P1090
[16]   Reversible image watermarking based on integer-to-integer wavelet transform [J].
Lee, Sunil ;
Yoo, Chang D. ;
Kalker, Ton .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2007, 2 (03) :321-330
[17]  
[林新建 Lin Xinjian], 2015, [中文信息学报, Journal of Chinese Information Processing], V29, P151
[18]   Natural language watermarking via morphosyntactic alterations [J].
Meral, Hasan Mesut ;
Sankur, Buelent ;
Oezsoy, A. Sumru ;
Guengoer, Tunga ;
Sevinc, Emre .
COMPUTER SPEECH AND LANGUAGE, 2009, 23 (01) :107-125
[19]  
Topkara U., 2006, P 8 WORKSHOP MULTIME, P164, DOI [DOI 10.1145/1161366.1161397, 10.1145/1161366.1161397]
[20]  
Witten I. H., 2002, P IEEE, V82, P857