期刊文献+

基于词元语义特征的汉语框架排歧研究 被引量:7

Chinese Frame Disambiguation Based on the Semantic Feature of Lexical Units
下载PDF
导出
摘要 框架排歧指的是在一个给定的句子中,判断句中目标词激起的语义场景与该目标词可能激起的哪个框架一致,则将该框架分配给当前的目标词。框架排歧最重要的一个步骤就是特征选择,目前常用的方法是人工特征选择方法,但是这种方法不能有效地利用每个目标词的语义特征,而且大量实验表明,不同的目标词取得最好的结果时所用的特征模板是不同的。因此,该文为每个目标词设置一个特征模板,并提出了特征模板的自动选择算法,首先从语料中抽取特征构成特征集,然后利用打分机制,把特征集中得分最高的特征逐个加入到特征模板中,直到相邻两次的得分不再增加。该文借助汉语框架网语义资源,利用最大熵模型建模,使用自动特征选择算法选出特征模板,并进行5-fold交叉验证,平均精确率可达到84.46%。 Frame disambiguation aims to assign appropriate frame for the target words, according to the consistency between semantic scene and the candidate frame evoked by the target words. The key step of frame disambiguation is the feature selection, which is currently a manual process. However, this manual method doesn't effectively use the semantic feature of each target word. In addition, it is proved that the feature templates are different when the target words achieve best results. Hence, this paper proposes an automatic feature template algorithm to set a fea- ture template for each target word. First, feature sets are composed of features from the corpus; Then the feature a- chieved the highest score is added to the feature template until the adjacent two score no longer increases. The paper applies a maximum entropy model to Chinese FrameNet corpus, examining the automatic feature selection algorithm by 5-fold cross validation, and achieves an average precision of 84.46~.
出处 《中文信息学报》 CSCD 北大核心 2013年第4期44-51,共8页 Journal of Chinese Information Processing
基金 国家自然科学基金资助项目(60970053) 国家语委"十二五"科研规划资助项目(YB125-19) 山西省国际科技合作资助项目(2010081044) 山西省自然科学基金资助项目(20090110594)
关键词 框架排歧 汉语框架网语义资源 自动特征选择 词元语义特征 Chinese frame disambiguation Chinese FrameNet automatic feature selection semantic feature of lexi-cal units
  • 相关文献

参考文献22

二级参考文献160

共引文献228

同被引文献42

  • 1王德春,张辉.认知语言学研究现状[J].外语研究,2001,18(3):1-10. 被引量:83
  • 2刘开瑛,由丽萍.汉语框架语义知识库构建工程[C].中国中文信息学会成立二十五周年学术会议论文集,2006,11:64-71.
  • 3郝晓燕,刘伟,李茹,刘开瑛.汉语框架语义知识库及软件描述体系[J].中文信息学报,2007,21(5):96-100. 被引量:51
  • 4Fillmore C.Frame semantics[J] .Linguistics in the morning calm,1982:111-137.
  • 5刘开瑛.汉语框架语义网(CFN)构建现状[C] .第四届全国学生计算语言学研讨会会议论文集.2008:1-7.
  • 6Collin F Baker,Charles J Fillmore,John B Lowe.The Berkeley FrameNet project[C] //Proceedings of COL-ING/ACL.Montreal,Canada.1998:86-90.
  • 7Baker C,Ellsworth M,Erk K.SemEval'07 task 19:frame semantic structure extraction[C] //Proceedings of the 4th International Workshop on Semantic Evaluations.Association for Computational Linguistics,2007:99-104.
  • 8Bejan C A,Hathaway C.UTD-SRL:a pipeline architecture for extracting frame semantic structures[C] //Proceedings of the 4th International Workshop on Semantic Evaluations.Association for Computational Linguistics,2007:460-463.
  • 9Johansson R,Nugues P.LTH:semantic structure extraction using nonprojective dependency trees[C] //Proceedings of the 4th International Workshop on Semantic Evaluations.Association for Computational Linguistics,2007:227-230.
  • 10Dipanjan Das,Desai Chen,André F.T.Martins,etc.Frame-semantic parsing[J] .Computational Linguistics,2013,40(1):9-56.

引证文献7

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部