期刊文献+

EDCMS:A Content Management System for Engineering Documents

EDCMS:A Content Management System for Engineering Documents
下载PDF
导出
摘要 Engineers often need to look for the right pieces of information by sifting through long engineering documents, It is a very tiring and time-consuming job. To address this issue, researchers are increasingly devoting their attention to new ways to help information users, including engineers, to access and retrieve document content. The research reported in this paper explores how to use the key technologies of document decomposition (study of document structure), document mark-up (with EXtensible Mark- up Language (XML), HyperText Mark-up Language (HTML), and Scalable Vector Graphics (SVG)), and a facetted classification mechanism. Document content extraction is implemented via computer programming (with Java). An Engineering Document Content Management System (EDCMS) developed in this research demonstrates that as information providers we can make document content in a more accessible manner for information users including engineers.The main features of the EDCMS system are: 1) EDCMS is a system that enables users, especially engineers, to access and retrieve information at content rather than document level. In other words, it provides the right pieces of information that answer specific questions so that engineers don't need to waste time sifting through the whole document to obtain the required piece of information. 2) Users can use the EDCMS via both the data and metadata of a document to access engineering document content. 3) Users can use the EDCMS to access and retrieve content objects, i.e. text, images and graphics (including engineering drawings) via multiple views and at different granularities based on decomposition schemes. Experiments with the EDCMS have been conducted on semi-structured documents, a textbook of CADCAM, and a set of project posters in the Engineering Design domain. Experimental results show that the system provides information users with a powerful solution to access document content. Engineers often need to look for the right pieces of information by sifting through long engineering documents, It is a very tiring and time-consuming job. To address this issue, researchers are increasingly devoting their attention to new ways to help information users, including engineers, to access and retrieve document content. The research reported in this paper explores how to use the key technologies of document decomposition (study of document structure), document mark-up (with EXtensible Mark- up Language (XML), HyperText Mark-up Language (HTML), and Scalable Vector Graphics (SVG)), and a facetted classification mechanism. Document content extraction is implemented via computer programming (with Java). An Engineering Document Content Management System (EDCMS) developed in this research demonstrates that as information providers we can make document content in a more accessible manner for information users including engineers.The main features of the EDCMS system are: 1) EDCMS is a system that enables users, especially engineers, to access and retrieve information at content rather than document level. In other words, it provides the right pieces of information that answer specific questions so that engineers don't need to waste time sifting through the whole document to obtain the required piece of information. 2) Users can use the EDCMS via both the data and metadata of a document to access engineering document content. 3) Users can use the EDCMS to access and retrieve content objects, i.e. text, images and graphics (including engineering drawings) via multiple views and at different granularities based on decomposition schemes. Experiments with the EDCMS have been conducted on semi-structured documents, a textbook of CADCAM, and a set of project posters in the Engineering Design domain. Experimental results show that the system provides information users with a powerful solution to access document content.
出处 《International Journal of Automation and computing》 EI 2007年第1期56-70,共15页 国际自动化与计算杂志(英文版)
基金 This work was supported by the UK Engineering and Physical Sciences Research Council(EPSRC)(No.GR/R67507/01).
关键词 Document content management engineering design decomposition schemes document mark-up facetted classification. Document content management, engineering design, decomposition schemes, document mark-up, facetted classification.
  • 相关文献

参考文献12

  • 1Suhit Gupta,Gail E. Kaiser,Peter Grimm,Michael F. Chiang,Justin Starren.Automating Content Extraction of HTML Documents[J].World Wide Web.2005(2)
  • 2D. A. Lizorkin,K. Yu. Lisovsky.Implementation of the XML linking language XLink by functional methods[J].Programming and Computer Software.2005(1)
  • 3J.Rowley,J.Farrow.Organising Knowledge:an Introduction to Information Retrieval[]..2000
  • 4T.Quatrani.Visual Modelling with Rational Rose 2000 and UML[]..2000
  • 5.XML DTD,W3C[]..2006
  • 6A.C.Foskett.The Subject Approach to Information[]..1996
  • 7Dublin Core. http://dublincore.org/ . 2006
  • 8C.A.McMahon,A.Lowe,S.J.Culley,M.Corderoy,R.Crossland,T.Shah,D.Stewart.Waypoint:an Integrated Search and Retrieval System for Engineering Documents[].Journal of Computing and Information Science in Engineering.2004
  • 9L.H.Chen,,W.L.Chue.Using Web Structure and Summarisation Techniques for Web Content Mining[].Information Processing Letters.2005
  • 10Sitecore Content Manager. http://www.sitecore.net . 2006

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部