摘要
现代通信业务中.复合文档占据较大比重.由于文件传输过程中的解调误码和阻塞丢包等原因.接收的复合文档由于高误码常无法打开.无法获得其携带的有效内容.以OpenXML复合文档中的Word2007文档为对象展开研究.利用文档自身的鲁棒性.提出一种基于关键组件重组的OpenXML复合文档修复方法.通过某些关键XML文件和关系文件重新构造复合文档.实现对破损复合文档承载信息的最大化获取.
Compound documents occupy a large proportion in modern communication service.In the transmission of the documents,the received compound documents usually can’t be opened because of the high bit error rate,so that it is unable to obtain the effective contents from them.To solve this problem,the paper selects the version of 2007 Word documents as objects for in-depth study.Based on the recombination of key components,a recovery method is proposed by using the robustness of the documents.The method presents an idea that OpenXML compound documents can be reconstructed through some key XML files and relational files,which achieves maximum acquisition of the information contained in the broken compound documents.
作者
杨东煜
王晓梅
郑遥
YANG Dongyu;WANG Xiaomei;ZHENG Yao(Information Engineering University. Zhengzhou 450001. China)
出处
《信息工程大学学报》
2018年第5期580-585,共6页
Journal of Information Engineering University
基金
西南电子电信技术研究所资助项目(2014024)