摘要
目前互联网上已有的本体难以满足Web服务语义查询的需要,而手工建立本体不仅困难而且成本很高,因此有必要建立一种从已有Web服务描述中进行本体学习的方法,辅助领域专家建立高质量的领域本体。针对上述问题,提出了一种针对Web服务描述的本体学习方法。该方法利用一种基于层次Dirichlet过程(hierarchical Dirichlet process,HDP)的主题模型自动学习本体层次结构和每一层中所包含的主题数目。每一层次的主题采用"代表单词"表示,"代表单词"由算法计算得出。基于参数组合模式的规则定义语义丰富规则,并被应用在自底向上的本体语义丰富算法中。实验表明,该方法在语义内容上要比单独使用h HDP(hierarchies of hierarchical Dirichlet process)方法更加丰富,在语义层次上要好于使用关联规则挖掘方法形成的本体。
At present, ontologies on the Internet are hard to satisfy the needs of semantic searching on Web services. Manually building ontologies referring to specific application is difficult and costly, so that it is necessary to estab-lish a method of ontology automatically learning from Web service descriptions to facilitate domain experts generating high quality ontologies. In view of these problems, this paper proposes an ontology learning method from Web ser-vice descriptions. This method automatically learns ontology hierarchical structures and topics in each level by using topic model based on HDP (hierarchical Dirichlet process). Topics in each level are represented by“representation word”whose calculation is defined. Rules according to parameters composition pattern which define semantic enriching rules are utilized in the bottom-up ontology sematic enriching algorithm. Experiments show that the proposed method is richer in semantics than hHDP (hierarchies of hierarchical Dirichlet process) and better in semantic hierarchies than the method using association rule mining.
出处
《计算机科学与探索》
CSCD
北大核心
2015年第5期575-585,共11页
Journal of Frontiers of Computer Science and Technology
基金
国家自然科学基金No.61373037
国家重点基础研究发展计划(973计划)No.2014CB340404
国家科技支撑计划No.2012BAH07B01
山东省自然科学基金No.2013YD01040~~