期刊文献+

面向信息处理的少数民族语料库构建分析 被引量:2

Analysis of the construction of minority corpus oriented to information processing
下载PDF
导出
摘要 语料库是一切自然语言处理的基础,尤其是在机器翻译、语音识别等应用的大趋势下,构建高质量、大规模、标准化的语料库尤为重要。民族语料库构建工作自20世纪八九十年代起,到目前已取得众多成果。文章主要对我国民族语料库的建设现状及相关研究进行介绍与评价,重点分析蒙语、维语、藏语语料库研究工作,并在此基础上,针对民族语料库构建存在的问题提几点建议,以期为其他少数民族构建民族语料库提供借鉴与参考。 The corpus is the basis of natural language processing, especially in the trend of applications such as machine translation and speech recognition. It is important to build high quality, massive, standardized corpus. Since the 1980 s and 1990 s, the construction of the national corpus has achieved many achievements. This paper analysis the research status of the national corpus, focusing on the Mongolian, Uyghur and Tibetan corpus. And then, this paper puts forward some suggestions for the problems existing in the construction of national corpus, so as to provide reference for other ethnic minorities to build national corpus.
作者 费德莲 袁凌云 权朝臣 Fei Delian;Yuan Lingyun;Quan Chaochen(Yunnan Normal University,Kunming 650500,China)
机构地区 云南师范大学
出处 《无线互联科技》 2019年第19期77-79,共3页 Wireless Internet Technology
关键词 少数民族语 语料库构建 蒙语 维语 藏语 minority nationality language corpus construction Mongolian Uyghur Tibetan
  • 相关文献

参考文献4

二级参考文献21

  • 1玉素甫.艾白都拉,阿布都热依木.沙力.现代维语语料库的词类标注研究[J].民族语文,2005(4):63-66. 被引量:7
  • 2Greene, Barbara B, Rubin Geral M. Automated Grammatical Tagging of English, Brown University, 1971.
  • 3Kucera H, Francis W Nelson. Frequency Analysis of English Usage: Lexicon and Grammar, Houghton-Mifflin Company, Boston, 1982.
  • 4MarshaU Jan. Choice of Grammatical Word-Class Without Global Syntactic Analysis[J]. Computers in the Hmnanities, 1983, 17: 139-150.
  • 5Shannon C. The Mathematical Theory of Communication[J]. Bell Sustem Technical Journal, 1948, 27: 398-403.
  • 6刘开瑛,郑家恒,赵军.语料库词类自动标注方法算法研究[M].机器翻译研究进展,1992,378-386.
  • 7哈米提·铁木尔.现代维吾尔语语法.形态学[M].北京:民族出版社,1987.
  • 8Chafe Wallace L. Meaning and Structure of Language[M]. Chicago, The University of Chicago Press, 1970, 97.
  • 9哈米提·铁木尔.现代维吾尔语语法·形态学[M].北京:民族出版社,1987.
  • 10刘开瑛 郑家恒 赵军.语料库词类自动标注方法算法研究.机器翻译研究进展,1992,:378-386.

共引文献29

同被引文献24

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部