期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Approach for Multiword Expression Identification in Natural Language Processing
1
作者 Deepak Sharma Prakash R. Devale Akhil K. Khare 《Computer Technology and Application》 2011年第8期663-666,共4页
In this paper, the authors are presenting the approach to extract the multiword expression (MWEs) from monolingual corpora. It both validates and generates multiword candidates. The multiword expression provides a l... In this paper, the authors are presenting the approach to extract the multiword expression (MWEs) from monolingual corpora. It both validates and generates multiword candidates. The multiword expression provides a list of candidates which are extracted and filtered according to the number of criteria and a set of standard statistical association measures. The generation of the multiword candidates is based on the surface forms, while the validation consists of series of criteria for removing noise using language independent association measures. For generating corpus count, it provides both a corpus indexation facility. Also, this approach allows easy integration with a machine learning tool for thecreation and application of supervised multiword extraction models if annotated data is available. The authors present the use of multiword in a standard configuration, for extracting MWEs from a corpus of general purpose English. 展开更多
关键词 Multiword candidates association measures surface forms monolingual corpora.
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部