期刊文献+

基于本体的办公文档处理研究

Research on office document processing based on ontology
下载PDF
导出
摘要 目前的办公文档通常都是基于XML格式的,其树型存储结构中包括逻辑内容、格式描述、页面版式描述以及编辑元素描述,它们之间既相互分离又相互融合,给文档的处理带来复杂性。论文分析了办公文档的结构特征,提出了在两种典型应用处理场景中基于本体的文档操作方法。本体的引入可以使办公文档的处理能够根据不同的应用环境,通过机器推理机制实现文档处理的智能化,同时有利于实现文档处理的互操作;在处理过程中节点的定位相对于XPath更高效,并能够满足在特定应用中,文档的处理不破坏文档的基本结构需求。本文以中文办公软件格式标准UOF为基础建立基于本体的文档结构模型,并利用SWRL推理规则,实现办公文档的智能化处理。 Currently,office document formats are usually based on XML.It includes some logic content nodes,format style nodes,page layout describing nodes and some editing element nodes in its tree storage structure.It raises some issues for processing.Paper analyses the characteristics of document structure,and two methods of document processing under different typical application scenarios which based on ontology are presented.As ontology technology is introduced into,office document processing can be reasoned by machine according to various environments and to be executed automatically,at the same time it brings some benefits for interoperability,and in the procedure of processing,positioning the nodes will get more efficiency than XPath without destroying the document structure in special applications.In the end,the paper shows how to build office document ontology model based on UOF format and describes simple SWRL rules for intelligent processing for office document.
出处 《北京信息科技大学学报(自然科学版)》 2010年第S2期97-102,142,共7页 Journal of Beijing Information Science and Technology University
基金 北京市教委科技发展重点项目暨北京市自然科学基金(KZ200810772017) 北京市属市管高等学校人才强教计划资助项目(PHR201007131)
关键词 办公文档 本体 智能操作 机器理解 UOF office document ontology intelligent operation machine understanding UOF
  • 相关文献

参考文献11

  • 1李宁,牟永敏,董慧,方春燕.文档格式中“内容”与“表现”的分离与融合[J].电子学报,2007,35(2):375-378. 被引量:10
  • 2李为冲.XML到OWL文档生成方法研究[D]中国石油大学,中国石油大学2008.
  • 3GB/T 20916-2007.中文办公软件文档格式规范[S],2007.
  • 4ISO/IEC JTC1.Informationtechnology——Open Document Format for OfficeApplications (OpenDocument)v1.0[S/OL]. ISO/IEC 26300:2006 . 2010
  • 5ISO/IEC JTC1.Informationtechnology-Document description and processinglanguage——Office Open XML file formats[S/OL]. ISO/IEC 29500:2008 . 2010
  • 6Sergej Melnik.Briging the gap between RDF andXML[R/OL]. http:∥infolab.stanford.edu/~melnik/rdf/fusion.html#rdf:RDF99 . 2010
  • 7Wikipedia.WYSIWYG. http:∥en.wikipedia.org/wiki/WYSIWYG . 2010
  • 8LaksV.S.Lakshmanan,Fereidoon Sadri.XML In-teroperability. http:∥citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.4.1263&rep=rep1&type=pdf . 2010
  • 9Hannes Bohring,S ren Auer.Mapping XML toOWL Ontologies[C/OL]. http:∥www.zdnetasia.com/whitepaper/mapping-xml-to-owl-ontolo-gies_wp-392920.htm . 2010
  • 10Michel Klein,Dieter Fensel,Frank van Harmel-en,et al.The relation between ontologies andXML schemata. ht-tp:∥www.ida.liu.se/ext/epa/cis/2001/004/paper.pdf . 2010

二级参考文献14

  • 1Deach S.What is XSL-FO and when should I use it? [J] .The Seybold Report, 2002,2(17) : 1 - 8.
  • 2Alschuler L.ABCD.SGML,A User's Guide to Structured Information[M]. Boston: International Thomson Computer Press,1995.31 - 32.
  • 3Goldfarb C,Prescod.Paul.XML Handbook (5th Edition) [M].New Jersey:Prentice Hall FIR,2003.350- 394.
  • 4OASIS. Open Document Format for Office Applications (Open-Document) v 1. 0 [ S/OL ]. http://www. oasis-open, org/committees/download, php/12572/OpertDoctment-v1. 0-os. pdf,2006-04-15.
  • 5Microsoft. Microsoft Office Open XML Formats Overview [ R/OL]. http://www. micrisift. com/office/preview/developers/fileoverview, mspx, 2006-04-15.
  • 6GB/T 19667.1-2005,基于XML的电子公文格式规范第1部分:总则[S].
  • 7GB/T 19667.2-2005,基于XML的电子公文格式规范第2部分:公文体[S].
  • 8Vlist E.XML Schema[M] .Sebastopol:O'Reilly,2002.104- 105.
  • 9Lenz E,McRae M,St. Laurent S. Office 2003 XML[M] .Sebastopol:O'Reilly,2004.144- 147.
  • 10W3C. XML Path Language (XPath) Version 1.0[S/OL]. http.//www. w3. org/TR/xpath, 2006-04-15.

共引文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部