摘要
英语中的多音词分成两类,一是因词性不同而读音不同,一是因词义不同而读音不同。前者只需经词性标注,根据其词性标记就可判别其正确的读音。而后者则复杂得多,论文采用了一种基于WordNet语义信息的多音词消歧算法,该算法将多音词的语义信息与上下文中词的语义信息进行匹配,根据匹配结果来判别多音词的读音。
English homograph has two types,one is polyphonic because of different part of speech,another is polyphonic because of different senses.The disambiguation of the former is easy to be handled after part-of-speech tagging,while the disambiguation of the latter is more difficult.In this paper,a homograph disambiguation algorithm is proposed using WordNet.In this algorithm,the authors extract semantic words from taxonomy of homograph of its senses and context words,and then compare the two semantic sets.The pronunciation with the maximum score is selected.
出处
《计算机工程与应用》
CSCD
北大核心
2008年第26期138-140,共3页
Computer Engineering and Applications
关键词
多音词消歧
词义消歧
语音合成
homograph disambiguation
word sense disambiguation
speech synthesis