期刊文献+

基于上下文相似度的对象识别模型的研究

Study of object recognition model based on context similarity
下载PDF
导出
摘要 对象识别是数据集成的一个重要问题,针对学术领域的对象集成问题,提出一个基于上下文环境的对象识别模型。利用作者名字的上下文环境,包括合作者、国际会议、论文时间、论文标题4维信息对作者进行对象识别。通过计算两个表象每一维信息的相似程度,采用感知器模型对于少量的专家标注的学习用例进行学习从而获得每一维合适的权重以及对应的阈值,最后利用构造的模型进行准确预测。实验结果表明该模型具有较高的可用性。 Object recognition is an important problem in data integration with uncertain.To integrate academic objects,the authors propose an object recognition model basd on the context.The authors use an appearance's context information,including co-author,international conference, publication time, paper title, to recognize it.The authors measure the similarity of four dimensions information in an appearance context and use sensors to learn the parameters by a few of examples labeled by field experts.According to the model,the authors can recognize academic objects accurately.
作者 高迎 程涛远
出处 《计算机工程与应用》 CSCD 北大核心 2008年第23期139-142,150,共5页 Computer Engineering and Applications
基金 国家自然科学基金(No.60703007)~~
关键词 数据集成 对象识别 上下文环境 data integration object recognition context
  • 相关文献

参考文献8

  • 1On B W,Lee D W,Kang J W,et al.Comparative study of name disambiguation problem using a scalable blocking-based framework [C]//JCDL, 2005.
  • 2Ananthakrishna R,Chaudhuri S,Ganti V.Eliminating fuzzy duplicates in data warehouses[C]//VLDB,2002.
  • 3Bilenko M,Mooney R,Cohen W,et al.Adaptive name-matching in information integration[J].IEEE Intelligent System,2003,18(5):16-23.
  • 4Borkar V- R,Deshmukh K,Sarawagi S.Automatic segmentation of text into structured records[C]//ACM SIGMOD,Santa Barbara,CA, 2001.
  • 5Cohen W,Ravikumar P,Fienberg S.Comparison of string distance metrics for name-matchlng tasks[C]//ⅡWeb Workshop Held in Conjunction with IJCAI,2003.
  • 6Lee M,Hsu W,Kothari V.Cleaning the spurious links in data[C]// IEEE Intelligent Systems, 2004.
  • 7Xi W X,Fox E A,Fan W G,et al.SimFusion:measuring simi- larity using unified relationship matrix[C]//Proc of SIGIR,2005.
  • 8程涛远,王珊.A Novel Approach to Clustering Merchandise Records[J].Journal of Computer Science & Technology,2007,22(2):228-231. 被引量:3

二级参考文献1

共引文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部