一种基于义素的网页信息项语义匹配方法研究

A Sememe Based Semantic Matching of Web Information Item

下载PDF

导出

摘要本文提出了一种改进的基于语义的义素相似度,并从理论上分析参数β值的影响效果。在这个基础上,提出一种基于义素的词相似度,从语义上去匹配新名词和旧名词。在基于义素的词相似度基础上,提出一种网页信息项的语义匹配方法,来识别网页信息项的类别。实验结果表明,基于义素相似度的网页信息项语义匹配方法具有较好的匹配效果。 Because some new words will be created in the development of human knowledge and the expression forms of words may be various, the word matching method is needed to research in order to adapt the changes from the point of the semantic features of words. This paper proposes an improved sememe similarity, which is based on se- mantic features of words. To analyze the effect of the coefficient β, this paper deducts three theorems in theory and then educes that the value of the coefficient β must be set in a range, which is to say that the coefficient β cannot be set much little value or much large value. Based on the sememe similarity, a word similarity is put forward to match new words with old words from the view of the semantic features. To adapt the Web page information, which in- cludes the human knowledge and has various expression forms, a novel semantic matching method is proposed to iden- tify the class of the Web information items. The semantic matching method is based on both the word similarity and the sememe similarity. As the experiment results show, the sememe based semantic matching method gains higher ac- curacy to identify the class of the Web information items.

作者卢正鼎张茂元

机构地区华中科技大学计算机科学与技术学院

出处《计算机科学》 CSCD 北大核心 2005年第4期49-51,54,共4页 Computer Science

关键词网页信息匹配方法语义相似度影响效果分析参数匹配效果基础名词 Sememe Similarity Semantic Matching

分类号 TP393.092 [自动化与计算机技术—计算机应用技术] TN919.81 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献6

1Da L G, Facon J, Borges D L. Visual speech recognition: a solution from feature extraction to words classification. In: Proc. of Symposium on Computer Graphics and Image Processing, XVI Brazilian, 2003. 399～405
2Nakashima T. Classification of characteristic words of electronic newspaper based on the directed relation. In: 2001 IEEE Pacific Rim Conf. on Communications, Computers and signal Processing, 2001(2) :591～594
3Vladimir A O. Ontology based semantic similarity comparison of documents. In: 14th Intl. Workshop on Database and Expert Systems Applications (DEXA'03), 2003. 735～738
4程莉,卢正鼎,文坤梅,李娟.基于语义的模糊匹配探索与应用[J].华中科技大学学报（自然科学版）,2003,31(2):23-25. 被引量：12
5Rodriguez M A, Egenhofer M J. Determining semantic similarity among entity classes from different ontologies, Knowledge and Data Engineering. IEEE Transa. on, 2003, 15(2): 442～456
6张茂元,卢正鼎.基于特征选取及模糊学习的网页分类方法研究[J].小型微型计算机系统,2004,25(7):1397-1400. 被引量：4

二级参考文献8

1[1]Gan K W, Wong P W. Annotating information structures in Chinese texts using HowNet. Hong Kong: Second Chinese Language Processing Workshop, 2000. 85～92
2[1]Salton G. Automatic text processing[M].Massachusetts:Addison-wesley,1989.
3[3]Joliffe I T. Principal component analysis[M].New York:Springer-Verlag,1986.
4[4]Setiono R, Liu H. Neural network feature selector[J].IEEE Trans,Neural Networks,1997,8, 654-662.
5[5]Kudo M, Sklansky J,Comparison of algorithms that select features pattern classifiers[J].Pattern Recognit,2000,33(1): 25-41.
6[6]Basak J,De R K, Pal S K. Unsupervised feature selection using a neuro-fuzzy approach[J].Pattern Recognit Lett,1998,19(11):997-1006.
7范焱,郑诚,王清毅,蔡庆生,刘洁.用Naive Bayes方法协调分类Web网页[J].软件学报,2001,12(9):1386-1392. 被引量：53
8李素建.基于语义计算的语句相关度研究[J].计算机工程与应用,2002,38(7):75-76. 被引量：83

共引文献13

1张瑾,刘亚清,于纯妍.汉语词义排歧的另一种方法[J].小型微型计算机系统,2006,27(4):724-726. 被引量：1
2刘亚清,张瑾,于纯妍.基于义原同现频率的汉语词义排歧系统[J].计算机技术与发展,2006,16(5):184-185. 被引量：1
3张茂元,邹春燕,卢正鼎.一种基于语义匹配的Web信息提取方法研究[J].计算机工程与应用,2006,42(23):141-143. 被引量：3
4刘亚清,于纯妍,张瑾.改进的基于义原同现频率的汉语词义排歧方法[J].计算机工程与科学,2006,28(12):136-138.
5文坤梅,卢正鼎,叶卫国.Web-MIND:基于特定主题的Web信息挖掘系统[J].计算机工程与科学,2007,29(6):71-73.
6李佳林.在线考试系统中主观题自动阅卷的设计[J].中国教育技术装备,2008(24):113-114. 被引量：6
7伞晓丽.基于B/S模式的网上考试系统的设计与实现[J].福建电脑,2009,25(1):120-121. 被引量：2
8李倩.基于SVM的网络文本分类[J].电子技术（上海）,2014,0(10):8-11. 被引量：2
9王小林,王东,杨思春,邰伟鹏,郑啸.基于《知网》的词语语义相似度算法[J].计算机工程,2014,40(12):177-181. 被引量：16
10江国荐,顾乃杰,张旭,任开新.基于SAE-LBP的网页分类研究[J].小型微型计算机系统,2016,37(4):738-742. 被引量：4

1乔亚男,刘跃虎,齐勇.查询词相似度加权的邻近性检索方法[J].模式识别与人工智能,2013,26(2):189-194. 被引量：2
2袁里驰.几种基于统计的词聚类方法比较[J].中南大学学报（自然科学版）,2016,47(9):3079-3084. 被引量：1
3于治会.信号与单位[J].电子产品可靠性与环境试验,1989(5):64-67.
4张茂元,邹春燕,卢正鼎.一种基于语义匹配的Web信息提取方法研究[J].计算机工程与应用,2006,42(23):141-143. 被引量：3
5鲁普平.从义素看语文字典辞书中的通假——以“宛”“怨”“汙”为例[J].励耘语言学刊,2016(2):260-266.
6谌颃.社会化标签语义相似度的协同过滤算法[J].华侨大学学报（自然科学版）,2016,37(1):84-87.
7王静.基于网络日志的用户查询推荐[J].河南科技,2016,35(7):50-51. 被引量：1
8鞠艳清.浅析义素分析法在现代汉语各领域中的应用[J].文教资料,2014(30):174-176.
9李亚明.汉语双音并列词语的传承方式——从《连文释义》和《现代汉语词典》的比较看[J].励耘语言学刊,2006(2):54-78.
10袁里驰,钟义信.基于相似度的词聚类算法[J].微电子学与计算机,2005,22(8):93-95. 被引量：4

计算机科学

2005年第4期

浏览历史

内容加载中请稍等...

一种基于义素的网页信息项语义匹配方法研究

参考文献6

二级参考文献8

共引文献13

相关作者

相关机构

相关主题

浏览历史