期刊文献+

基于关联规则挖掘的汉语语义搭配规则获取方法 被引量:5

Automatic Acquisition of Chinese Semantic Collocation Rules Based on Association Rule Mining Technique
下载PDF
导出
摘要 针对自然语言处理系统在短语分析时的词汇排歧和结构排歧需要,本文提出了一种基于语料库的汉语短语语义搭配规则自动获取方法.该方法以《知网》为语义知识资源,在标注了句法语义信息的汉语短语熟语料库基础上,先采用数据挖掘中元规则制导的交叉层关联规则挖掘方法,自动发现汉语短语的语义搭配规律,再根据统计结果自动优选后生成语义搭配规则库.实验结果表明该方法是切实可行的.运用该方法自动获取的语义搭配规则具有较好的排歧效果. The semantic collocations play important roles in parsing Chinese phrases. It is useful for both semantic disambiguation and structural disambiguation. In this paper,a corpus-based method was proposed to automatically acquire semantic collocation rules from a Chinese phrase corpus,which was annotated with semantic knowledge according to HowNet. Moreover,a metarule-guided algorithm for mining cross-level association rules was developed to acquire semantic collocation rules from the corpus. And an optimized algorithm was developed to filter these rules. The experiment results showed the effectiveness of the proposed method. Disambiguation performance of the automatically acquired rules was quiet well.
出处 《厦门大学学报(自然科学版)》 CAS CSCD 北大核心 2007年第3期331-336,共6页 Journal of Xiamen University:Natural Science
基金 国家自然科学基金(60373080)资助
关键词 语义规则 语料库 关联规则 知网 semantic rules corpus association rules HowNet
  • 相关文献

参考文献9

  • 1董振东,董强....知网[EB/OL]. http://www.keenage.com/zhiwang/c_zhiwang.html,,(2000-10-25)[2006-10-07]..
  • 2俞士汶.现代汉语短语结构知识库规格说明书.汉语语言与计算学报,2003,13(2):215-226.
  • 3董振东,董强.关于知网-中文信息结构库[EB/OL].(2000-10-25)[2006-10-07] http://www.keenage.com/html/c_index.html.
  • 4Han J,Kamber M.Data mining:concepts and techniques[M].San Francisco:Morgan Kaufmann Publishers,2001.
  • 5Han J,Fu Y.Discovery of multiple-level association rules from large databases[C]//Proceedings of 21th International Conference on Very Large Data Bases.Zurich:Morgan Kaufmann Publishers,1995:420-431.
  • 6欧阳为民,蔡庆生.大型数据库中多层关联规则的元模式制导发现[J].软件学报,1997,8(12):920-927. 被引量:7
  • 7Agrawal R,Imielinski T,Swami A.Mining association rules between sets of items in large databases[C]//Proceedings of the 1993 ACM-SIGMOD International Conference on Management of Data.Washington:ACM Press,1993:207-216.
  • 8Aggarwal C C,Sun Z,Yu P S.Online generation of profile association rules[C]//Proceedings of the 14th International Conference on Knowledge Discovery and Data Mining.Florida:AAAI Press,1998:129-133.
  • 9Aggarwal C C,Yu P S.A new approach to online generation of association rules[J].IEEE Transactions on Knowledge and Data Engineering,2001,13(4):527-540.

二级参考文献6

  • 1Han J,Proc 1996 Int’l Conf on Data Mining and Knowledge Discovery,1996年
  • 2Han J,Proc 2th VLDB Conf Zurich,1995年
  • 3Shen W,Advances in Knowledge Discovery and Data Mining,1995年
  • 4Han J,AAAI’94 Workshop on Knowledge Discovery in Databases,1994年
  • 5Han J,IEEE Trans Knowl Data Eng,1993年,5期,29页
  • 6欧阳为民,蔡庆生.在数据库中自动发现广义序贯模式[J].软件学报,1997,8(11):864-870. 被引量:12

共引文献10

同被引文献70

  • 1谌志群,张国煊.文本挖掘与中文文本挖掘模型研究[J].情报科学,2007,25(7):1046-1051. 被引量:50
  • 2Zhang Yan-qing,Shteynberg M,Prasad S K,et al.Granular fuzzy Web intelligence techniques for profitable data mining[C]∥The 12th IEEE International Conference on Fuzzy Systems (FUZZ 03),May 2003:1462-1464.
  • 3Wang Lipo,Fu Xiuju.Data Mining with Computational Intelligence[M].Berlin,Heidelberg:Springer-Verlag,2005.
  • 4Song D,Bruza P D,Huang Z,et al.Classifying document titles based on information inference[C]//Proceedings of the 14th International Symposium on Methodologies for Intelligent Systems.Japan.Berlin,Heidelberg:Springer,2003:297-306.
  • 5Zelikovitz Sarah.Transductive LSI for short text classification problems[C]// Proeeedings of the 17th International FLAIRS Conference.Miami:AAAI Press,2004.
  • 6Selvi P,Gopalan N P.Sentence similarity computation based on WordNet and corpus statistics[C]//International Conference on Computational Intelligence and Multimedia Applications,13-15 Dec.2007,Sivakasi,Tamil Nadu.Washington,DC:IEEE Computer Society,2007,1:9-14.
  • 7Sarnovsky M,Paralic M.Text mining workflows construction with support of ontologies[C]//Proceedings of the 6th International Symposium on Applied Machine Intelligence and Informatics,SAMI'08,January 21-22,2008,Herlany,Slovakia.Hungary:Budapest Polytechnic,2008:173-177.
  • 8陈骏.基于语义网的文本信息分类技术研究[D].南京:南京理工大学,2007.
  • 9Jung Jason J,Jo Geun-Sik.Semantic analysis for data preparation of Web usage mining[C]//Proceedings of the 17th International Conference on Innovations in Aplied Atificial Itelligence,Ottawa,Canada.Berlin,Heidelberg:Springer,2004:1249-1258.
  • 10Therrien C W.Decision,Estimation,and Classification:An Introduction to Pattern Recognition and Related Topics[M].New York:John Wiley & Sons,Inc.1989.

引证文献5

二级引证文献28

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部