Reversible Natural Language Watermarking Using Synonym Substitution and Arithmetic Coding

被引:106
作者
Xiang, Lingyun [1 ,2 ]
Li, Yan [2 ]
Hao, Wei [3 ]
Yang, Peng [4 ]
Shen, Xiaobo [5 ]
机构
[1] Changsha Univ Sci & Technol, Hunan Prov Key Lab Intelligent Proc Big Data Tran, Changsha 410114, Hunan, Peoples R China
[2] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Hunan, Peoples R China
[3] Changsha Univ Sci & Technol, Sch Traff & Transportat Engn, Changsha 410114, Hunan, Peoples R China
[4] CNCERT CC, Hunan Branch, Changsha 410004, Hunan, Peoples R China
[5] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2018年 / 55卷 / 03期
基金
中国国家自然科学基金;
关键词
Arithmetic coding; synonym substitution; lossless compression; reversible watermarking; INTEGER WAVELET TRANSFORM; IMAGE WATERMARKING; PREDICTION;
D O I
10.3970/cmc.2018.03510
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For protecting the copyright of a text and recovering its original content harmlessly, this paper proposes a novel reversible natural language watermarking method that combines arithmetic coding and synonym substitution operations. By analyzing relative frequencies of synonymous words, synonyms employed for carrying payload are quantized into an unbalanced and redundant binary sequence. The quantized binary sequence is compressed by adaptive binary arithmetic coding losslessly to provide a spare for accommodating additional data. Then, the compressed data appended with the watermark are embedded into the cover text via synonym substitutions in an invertible manner, On the receiver side, the watermark and compressed data can be extracted by decoding the values of synonyms in the watermarked text, as a result of which the original context can be perfectly recovered by decompressing the extracted compressed data and substituting the replaced synonyms with their original synonyms. Experimental results demonstrate that the proposed method can extract the watermark successfully and achieve a lossless recovery of the original text. Additionally, it achieves a high embedding capacity.
引用
收藏
页码:541 / 559
页数:19
相关论文
共 28 条
[1]  
Bolshakov IA, 2004, LECT NOTES COMPUT SC, V3200, P180
[2]   Lossless image compression based on optimal prediction, adaptive lifting, and conditional arithmetic coding [J].
Boulgouris, NV ;
Tzovaras, D ;
Strintzis, MC .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2001, 10 (01) :1-14
[3]  
Luyen CT, 2016, 2016 IEEE RIVF INTERNATIONAL CONFERENCE ON COMPUTING & COMMUNICATION TECHNOLOGIES, RESEARCH, INNOVATION, AND VISION FOR THE FUTURE (RIVF), P108, DOI 10.1109/RIVF.2016.7800278
[4]   Lossless generalized-LSB data embedding [J].
Celik, MU ;
Sharma, G ;
Tekalp, AM ;
Saber, E .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2005, 14 (02) :253-266
[5]  
Chiang Y. L., 2003, 2 INT WORKSH DIG WAT, P129
[6]   Reversible Watermarking Based on Invariant Image Classification and Dynamic Histogram Shifting [J].
Coatrieux, Gouenou ;
Pan, Wei ;
Cuppens-Boulahia, Nora ;
Cuppens, Frederic ;
Roux, Christian .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2013, 8 (01) :111-120
[7]  
Fei WB, 2013, ADV INTEL SYS RES, V30, P401
[8]  
Howard PG, 1987, P IEEE, V30, P857
[9]   Adaptive Text Steganography by Exploring Statistical and Linguistical Distortion [J].
Hu Huanhuan ;
Zuo Xin ;
Zhang Weiming ;
Yu Nenghai .
2017 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC), 2017, :145-150
[10]   Difference Expansion Based Reversible Data Hiding Using Two Embedding Directions [J].
Hu, Yongjian ;
Lee, Heung-Kyu ;
Chen, Kaiying ;
Li, Jianwei .
IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (08) :1500-1512