无指导学习语义优选被引量：1

UNSUPERVISED LEARNING OF SEMANTIC SELECTIONAL PREFERENCES

下载PDF

导出

摘要给出基于LSC模型的EM方法进行汉语语义优选的学习。具体步骤是首先随机为参数模型赋予初值;然后迭代运行EM算法,直到收敛;最后计算动词和名词的语义关联度,以此衡量其搭配的可能性。大量实验结果表明LSC模型能够较好地体现动、名词的搭配模式,且算法迭代收敛速度快。该方法无需语法标注的语料库,适合应用于汉语。 An Expectation-Maximisation（EM） algorithm based on latent semantic clustering（LSC） model is introduced for learning Chinese semantic selectional preferences.The specific procedure is as follows： First,the model parameters are designated their initial values randomly;secondly,EM algorithm is executed iteratively until convergence achieved;finally,the semantic association between verbs and nouns is calculated to measure their collocation possibility.Lots of experiment results show that LSC model is able to provide proper collocation patterns of verbs and nouns and the iterative convergence speed of the algorithm is fast as well.The method is suitable for Chinese as it does not need syntax-annotated corpora.

作者李东明张丽娟赵伟石晶

机构地区吉林农业大学信息技术学院长春工业大学计算机科学与工程学院

出处《计算机应用与软件》 CSCD 北大核心 2012年第1期155-158,216,共5页 Computer Applications and Software

基金吉林省科技发展计划项目青年基金(20100155) 吉林省科研发展计划科技支撑重点项目(20100214)

关键词语义优选潜在语义聚类无指导学习 Semantic selectional preferences Latent semantic clustering Unsupervised learning

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献21

1Diana Mccartby,Falmer East Sussex,Srirar Venkatapathy, et al.Joshi,Detecting compositionality of verb-ohjeet combinations using seleetonal preferences[C]//Proceedings of the Joint Conference on Empirical Methods in Natural Language Proces,ssing and Computational Natural Language Learning,2007.
2Diana McCarthy,John Carroll.Disambiguating Nouns,Verbs,and Adjectives Using Autonmtically Acquired Selectional Preferences[J].Computational Linguistics,December 2003,29(4):639-654.
3Wagner W,Schmid H,Schulte im Walde S.Verb Sense Disambiguation using a Predicate-Argument-Clustering Model[C]//Proceedings of the CogSci Workshop on Distributional Semantics beyond Concrete Concepts,Amsterdam,The Netherlands:23-28.
4Schulte im Walde S,Hying C,Scheible C,et al.Schmid.Combining EM Training and the MDL Principle for an Automatic Verb Classifiestion incorporating Selectional Preferences[C]//Proceedings of the 46thAnnual Meeting of the Association for Computational Linguistics,Columbus,OH.2008:496-504.
5Lin Sun,Anna Korhonen.Improving verb clustering with automatically acquired aelectional preferences[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing,2009,2.
6Zanzotto F M,Pennacchiotti M,Pazienza M T.Discovering Asymmetric Entailment Relations between Verbs using Selectional Preferences[C]//COLING/ACL-06,Sydney,Australia.2006:849-856.
7Zachary J Mason.Corret:a computational,corpus-based conventional metaphor extraction system[J].Comput.Linguist,2004,30 (1):23-44.
8Zapirain B,Agirre E,Marquez L,et al.Improving Semantic Role Classification with Selectional Preferences[C]//Proceedings of Human Language Technologies:The Annual Conference of the North American Cbapter of the Association for Computational Linguistics,2010.
9Andrew Clebume Young.The Effect of Selectional Preferences on Semantic Role Labeling[D].Undergraduate Honors Thesis,The University of Texas at Austin,2009.
10Erk,Katrin,Sebastian Padó,et al.A flexible,corpus-driven model of regular and inverse seleetional preferences[OL].Computational Linguistics,2010-10-14.http://www.mitpressjournals.org/doi/abs/10.1162/coli_a_00017.

二级参考文献9

1董振东,董强....知网[EB/OL]. http://www.keenage.com/zhiwang/c_zhiwang.html,,(2000-10-25)[2006-10-07]..
2俞士汶.现代汉语短语结构知识库规格说明书.汉语语言与计算学报,2003,13(2):215-226.
3董振东,董强.关于知网-中文信息结构库[EB/OL].(2000-10-25)[2006-10-07] http://www.keenage.com/html/c_index.html.
4Han J,Kamber M.Data mining:concepts and techniques[M].San Francisco:Morgan Kaufmann Publishers,2001.
5Han J,Fu Y.Discovery of multiple-level association rules from large databases[C]//Proceedings of 21th International Conference on Very Large Data Bases.Zurich:Morgan Kaufmann Publishers,1995:420-431.
6Agrawal R,Imielinski T,Swami A.Mining association rules between sets of items in large databases[C]//Proceedings of the 1993 ACM-SIGMOD International Conference on Management of Data.Washington:ACM Press,1993:207-216.
7Aggarwal C C,Sun Z,Yu P S.Online generation of profile association rules[C]//Proceedings of the 14th International Conference on Knowledge Discovery and Data Mining.Florida:AAAI Press,1998:129-133.
8Aggarwal C C,Yu P S.A new approach to online generation of association rules[J].IEEE Transactions on Knowledge and Data Engineering,2001,13(4):527-540.
9欧阳为民,蔡庆生.大型数据库中多层关联规则的元模式制导发现[J].软件学报,1997,8(12):920-927. 被引量：7

共引文献14

1吴云芳,段慧明,俞士汶.动词对宾语的语义选择限制[J].语言文字应用,2005(2):121-128. 被引量：18
2周日安.“XY中国”的语义、功能与成因[J].语言文字应用,2006(3):76-82. 被引量：5
3吴纪梅.动词“坐”带处所宾语能力的历时发展考察[J].广西社会科学,2008(4):178-181. 被引量：1
4徐采霞,鲁素霞.“V就 V在 P”格式中V的语义特征及其认知解释[J].南昌大学学报（人文社会科学版）,2008,39(5):127-130. 被引量：2
5谢晓明,王宇波.管控动宾超常搭配的若干句法因素[J].语文研究,2009(2):29-33. 被引量：5
6张玉峰,胡凤,董坚峰.泛在知识环境中数据挖掘技术进展分析[J].情报学报,2010,29(2):202-207. 被引量：9
7王宇波.“句管控”下的短语组配原则[J].汉语学报,2010(4):79-87. 被引量：3
8张玉峰,何超.基于领域本体的语义文本挖掘研究[J].情报学报,2011,30(8):832-839. 被引量：16
9李东明,张丽娟,赵伟,石晶.基于MDL和LSC的语义优选方法[J].计算机工程,2011,37(17):15-18.
10胡云晚,于晓燕.“很+名”结构作状语和补语的对称与不对称[J].语言研究,2012,32(1):55-60. 被引量：3

同被引文献3

1蔡基刚.重视大学英语翻译教学提高学生英语应用能力[J].中国翻译,2003,24(1):65-68. 被引量：297
2白解红.语境与意义[J].外语与外语教学,2000(4):21-24. 被引量：85
3陈一民.歧义结构意义优选的理论分析[J].山西师大学报（社会科学版）,2005,32(4):109-112. 被引量：2

引证文献1

1胡红磊.让机器取代人工翻译——语义优选[J].文化创新比较研究,2017,1(4):96-97.

1李东明,张丽娟,赵伟,石晶.基于MDL和LSC的语义优选方法[J].计算机工程,2011,37(17):15-18.
2刘晓亮,李家滨.基于数据挖掘的网络入侵检测系统研究[J].计算机应用与软件,2009,26(4):253-256. 被引量：8
3李世奇,赵铁军,陈晨,刘鹏远.基于ART网络的无指导中文共指消解方法[J].高技术通讯,2009,19(9):926-932.
4朱佳贤.无指导学习环境下基于属性相关性分析和聚类算法的属性选择问题研究[J].管理学报,2005,2(S2):162-165. 被引量：2
5石晶,李万龙.汉语语义分析方法研究[J].计算机应用研究,2010,27(2):529-531. 被引量：4
6李旭,刘国华,张东明.一种改进的汉语全文无指导词义消歧方法[J].自动化学报,2010,36(1):184-187. 被引量：6
7陈凯,朱钰.机器学习及其相关算法综述[J].统计与信息论坛,2007,22(5):105-112. 被引量：82
8韩自豪.有指导的数据挖掘在心脏病风险评价中的应用[J].商情,2014(21):169-169.
9宋东奇,宋余庆,刘哲,凌青华.新型适用于基因表达数据的模型聚类方法[J].计算机与应用化学,2015,32(1):71-74.
10蒋卓人,陈燕,王永清.基于数据元语义树的概念语义相关度算法研究[J].大连海事大学学报,2012,38(4):59-63. 被引量：1

计算机应用与软件

2012年第1期

浏览历史

内容加载中请稍等...

无指导学习语义优选被引量：1

参考文献21

二级参考文献9

共引文献14

同被引文献3

引证文献1

相关作者

相关机构

相关主题

浏览历史

无指导学习语义优选 被引量：1

参考文献21

二级参考文献9

共引文献14

同被引文献3

引证文献1

相关作者

相关机构

相关主题

浏览历史

无指导学习语义优选被引量：1