Chinese spoken language understanding in SHTQS

Chinese spoken language understanding in SHTQS

下载PDF

导出

摘要 Spoken dialogue systems are an active research field with wide applications. But the differences in the Chinese spoken dialogue system are not as distinct as that of English. In Chinese spoken dialogues, there are many language phenomena. Firstly, most utterances are ill-formed. Secondly, ellipsis, anaphora and negation are also widely used in Chinese spoken dialogue. Determining how to extract semantic information from incomplete sentences and resolve negation, anaphora and ellipsis is crucial. SHTQS (Shanghai Transportation Query System) is an intelligent telephone-based spoken dialogue system providing information about the best route between any two sites in Shanghai. After a brief description of the system, the natural language processing is emphasized. Speech recognition sentences unavoidably contain errors. In language sequence processing procedures, these errors can be easily passed to the later parts and take on a ripple effect. To detect and recover these from errors as early as possible, language-processing strategies are specially considered. For errors resulting from divided words in speech recognition, segmentation and POS Tagging approaches that can rectify these errors are designed. Since most of the inquiry utterances are ill-formed and negation, anaphora and ellipsis are common language phenomena, the language understanding must be adequately adaptive. So, a partial syntactic parsing scheme is adopted and a chart algorithm is used. The parser is based on unification grammar. The semantic frame that extracts from the best arc set of the chart is used to represent the meaning of sentences. The negation, anaphora and ellipsis are also analyzed and corresponding processing approaches are presented. The accuracy of the language processing part is 88.39% and the testing result shows that the language processing strategies are rational and effective. Spoken dialogue systems are an active research field with wide applications. But the differences in the Chinese spoken dialogue system are not as distinct as that of English. In Chinese spoken dialogues, there are many language phenomena. Firstly, most utterances are ill-formed. Secondly, ellipsis, anaphora and negation are also widely used in Chinese spoken dialogue. Determining how to extract semantic information from incomplete sentences and resolve negation, anaphora and ellipsis is crucial. SHTQS (Shanghai Transportation Query System) is an intelligent telephone-based spoken dialogue system providing information about the best route between any two sites in Shanghai. After a brief description of the system, the natural language processing is emphasized. Speech recognition sentences unavoidably contain errors. In language sequence processing procedures, these errors can be easily passed to the later parts and take on a ripple effect. To detect and recover these from errors as early as possible, language-processing strategies are specially considered. For errors resulting from divided words in speech recognition, segmentation and POS Tagging approaches that can rectify these errors are designed. Since most of the inquiry utterances are ill-formed and negation, anaphora and ellipsis are common language phenomena, the language understanding must be adequately adaptive. So, a partial syntactic parsing scheme is adopted and a chart algorithm is used. The parser is based on unification grammar. The semantic frame that extracts from the best arc set of the chart is used to represent the meaning of sentences. The negation, anaphora and ellipsis are also analyzed and corresponding processing approaches are presented. The accuracy of the language processing part is 88.39% and the testing result shows that the language processing strategies are rational and effective.

作者毛家菊郭荣陆汝占

机构地区 Dept. of Computer Science and Engineering

出处《Journal of Harbin Institute of Technology(New Series)》 EI CAS 2005年第2期225-230,共6页 哈尔滨工业大学学报（英文版）

基金 SponsoredbyShanghaiMunicipalScienceandTechnologyCommittee(SMSTC) (GrantNo. 025115038).

关键词 spoken dialogue system natural language understanding syntactic parsing 汉语口语对话系统语言理解语句分析

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献7

1KONOY.Agenericframeworkforspokendialoguesystemsanditsapplicationtoacarnavigationtask[].IEEE/IEEJJSAIInternationalConferenceonIntelligentTransportationSystems.1999
2HUANGYinfeu.Languageunderstandingcomponentinchinesedialoguesytem[].ProceedingsofICSLP’’.2000
3MAOJiaju.FormalInterpretationofChinese"that"inSituation[].JournalofShanghaiJiaotongUniversity.2003
4LJILJANA S.Interactive dialogue telephone service[].IEEE/IEEJ JSAI th Mediterranean Electrotechnical Conference.2000
5Seneff S.TINA: A natural language system for spoken language applications[].Computational Linguistics.1992
6ZUE V.JUPITER: A telephone-based conversational interface for weather information[].IEEE Transactions on Speech and Audio Processing.2000
7MAO Jiaju.Analyzing V +Adj in situation semantics[].Lecture Notes in Computer Science.2003

1罗鑫.基于.NET技术在Excel报表分析中的应用[J].电脑编程技巧与维护,2014(16):56-58. 被引量：1
2朱文娟,张庆.SQL数据库安全研究及语句分析[J].信息与电脑（理论版）,2009(11):114-114. 被引量：2
3袁里驰.Improved hidden Markov model for speech recognition and POS tagging[J].Journal of Central South University,2012,19(2):511-516. 被引量：4
4唐金文.C语言中的自增和自减语句分析[J].曲靖师范学院学报,2002,21(3):96-97. 被引量：1
5冯茜芦,潘金贵.一种基于句子的信息检索模型研究[J].计算机应用与软件,2010,27(3):162-164.
6王秀艳,康建平.从语句处理谈Oracle数据库的性能优化[J].邯郸职业技术学院学报,2007,20(3):65-68.
7林冠西,吴怀宇,陈洋.基于椭圆建模和NLP算法的移动机器人路径规划研究[J].科学技术与工程,2014,22(23):81-86. 被引量：1
8李欣然,赵山林.C语言程序设计中break语句分析[J].计算机时代,2013(12):48-49.
9LIU Yan-chun.Anaphora＇s Creating Irony by the Aid of Context[J].Sino-US English Teaching,2015,12(9):647-653.
10花的神明.网络收藏发布高手——MetaProducts Inquiry[J].电脑迷,2005,0(10):69-69.

Journal of Harbin Institute of Technology(New Series)

2005年第2期

浏览历史

内容加载中请稍等...

Chinese spoken language understanding in SHTQS

参考文献7

相关作者

相关机构

相关主题

浏览历史