摘要
文本水印通过在文本中嵌入版权标识信息(水印)来保护文本作品的知识产权。提出一种对文本中语句的主谓语进行编码来嵌入水印的方法。将水印信息转换成十六进制的Unicode码串,借助哈尔滨工业大学的语言技术平台(LTP),对文本中的语句进行一系列处理获取其中的主谓语,用上述Unicode码串中的一段对每一个主谓语进行编码表示,以此实现水印的嵌入。提取水印时,从被检测的文本中获取语句的主谓语,对照嵌入水印时形成的码本,对每个主谓语进行比较和译码,取出各主谓语所对应的Unicode码段,将它们按正确顺序拼接起来,转换成对应的字符,得到嵌入的水印信息。所提算法具有很好的隐蔽性,能有效抵抗各种常见的攻击。
Text watermarking protects the copyrights of text works by embedding copyright information(watermark) into a text. This paper proposed a text watermarking technique, in which the watermark is embedded by encoding the subject-verbs of the sentences in a text. A watermark message is converted into a string of the hexadecimal Unicode code. With the help of the language technology platform(LTP) of Harbin Institute of Technology,a series of processes are applied to the text to obtain the subject-verbs in the text. Each of the subject-verbs is encoded with one piece of the Unicode string,which achieves the embedding of the watermark. When extracting the watermark, the subject-verbs are obtained from the detected text and decoded according to the codebook generated in the watermark embedding. The corresponding pieces of the Unicode string are taken out from the codebook and put together in correct order. They are then converted back into the original characters to obtain the embedded watermark message. The proposed algorithm has a good nature of concealment and can resist various watermark attacks.
出处
《计算机科学》
CSCD
北大核心
2015年第B11期374-377,共4页
Computer Science
基金
国家自然科学基金项目(61402244)
南通市应用研究计划项目(BK2011026)资助
关键词
数字水印
文本水印
主谓语编码
LTP
Digital watermarking, Text watermarking, Subject-verb encoding, LTP