摘要
该文介绍了基于知网的中文结构排歧工具系列中的一种—VXY。VXY采取了一种独到的排歧技术,对于语言难点采取"定点清除"的策略。它用来解决"V+N+的+N"类型的结构性歧义。VXY是一个自足的、可以现场考核检验的并可以真正付诸实用的系统,而不是仅仅某种方法论的表演或举例性的"游戏"。该文简要地介绍了VXY的组成部分,说明了它的意义计算的原理。同时,该文就如何更有效地利用知网进行结构和语义排歧,如何开辟不同于当前语言信息处理中的"三部曲"(语料标注、现成的计算、应试性的评测)的语言技术等问题进行讨论。
The paper introduces a HowNet-based disambiguator named VXY. The disambiguator effectively tackles the ambiguity in syntactic structures, e.g. "削 (V)苹果 (X)的皮 (Y)", which appear highly-frequently in Chinese. The ambiguity of this kind lies in which word is governed by V in the structure, either X or Y. The HowNet based disambiguator VXY is not merely a demonstration for the stereotypic methodology or algorithm, but a practical tool. for any structures composed by any one of the 98000 unique entries in HowNet Chinese vocabulary. Hence, the paper presents a paradigm completely different from the state-of the-art human language technology.
出处
《中文信息学报》
CSCD
北大核心
2010年第1期60-64,共5页
Journal of Chinese Information Processing
关键词
计算机应用
中文信息处理
语义
排歧工具
强支配
中文句法结构
知网
computer application
Chinese information processing
semantics
disambiguator
strong government
Chinese syntactic structure
HowNet