摘要
UCL(UniformContentLocator)是作者、编者和读者进行语义沟通的工具,是进行信息快速选择、智能代理和信息主动服务的基础。该文针对网络信息检索中的自动标引问题,提出了一种基于UCL的网页自动标引技术。研究了从HTML编写的网页映射到XML文档的过程,并从中提取符合用户兴趣模型的UCL字段,从而达到网页自动标引的目的。实验验证了理论方案的正确性和有效性。
UCL(Uniform Content Locator)builds a bridge between author,editer and reader for better understanding,which is a key technique for data processing such as receiving quickly,information filtering,service intelligently and actively in many domains.Aiming at the problem of automatic indexing in the information searches,an indexing method for web pages based on UCL is put forward in this paper.In order to achieve the computer automatic indexing,we study the mapping process from HTML to XML ,and extract the UCL information of fitting for the interesting profile of client users.The experiment result shows that the creative technology and the new designs are correctness and efficiency.
出处
《计算机工程与应用》
CSCD
北大核心
2004年第17期148-151,共4页
Computer Engineering and Applications
基金
国家自然科学基金资助项目(编号:60272014)
国家863高技术研究发展计划项目(编号:2002AA121063)资助