摘要
网络科技资源具有地理分布广、异构、不规范、海量的特点,为了高效地查找和使用网络科技资源,提出了元数据技术来实现网络科技资源的统一组织和表示。使用基于正则表达式的元数据信息提取模型,有效地从数量巨大的Web页面中提取元数据信息。并基于轻量级目录访问协议LDAP(Lightweight Directory Access Protocol)的目录服务机制实现了科技资源元数据的存储和访问,测试表明其具有良好的可行性和有效性。
In order to search and utitize net based science and technology resources,which are broadly distributed,hybrid, irregular,massive,with high efficiency.the technology of metadata,which is utilized to organize and represent the resources in a unified way,is presented.Using metadata information extraction model based on regular expressions, metadata can be extracted from a huge number of Web pages.Directory service mechanism based on LDAP (Lightweight Directory Access Protocol) is used for science and technology metadata storage and accessing,and this method performs feasibly and effectively.
出处
《计算机工程与应用》
CSCD
北大核心
2009年第25期141-144,共4页
Computer Engineering and Applications
基金
国家科技基础条件平台建设项目(No.2005DKA63904)
关键词
网络科技资源
元数据
元数据提取
轻量级目录访问协议(LDAP)
net based science and technological resources
metadata
metadata extraction
Light weight Directory Access Protocol (LDAP)