摘要
该文针对现有的中文POI(points of interest)匹配中存在的不足,提出了基于角色标注的中文POI匹配的方法,提高了POI匹配的准确性和效率。其基本思想是:根据在POI匹配中的作用,在对POI分词的基础上用HMM(Hide Markov Model隐马模型)对POI的切分单位进行角色标注,切分单位的角色不同,其在匹配过程中的地位也不同,在精确匹配失败后,再根据角色信息进行模糊匹配,从而提高了中文POI匹配成功率。
This paper analyzes the shortcoming of existed Chinese POI macthing methods.We present a role tagging-based way,which can improve the rate of POI macthing success.That is:tokens after segmentation are tagged using HMM with diffirent roles according to theirs functions in the generation of chinese POI s.Tokens with diffirent roles have diffirent degree of importance in matching.After exact matching failure.Fuzzy matching is excuted according to tokens' roles.
出处
《电脑知识与技术》
2011年第7X期5144-5146,5164,共4页
Computer Knowledge and Technology
关键词
分词
角色标注
HMM
POI匹配
segmentation
Chinese POI matching
role tagging
HMM