期刊文献+

互联网上的维语多文转换机制的设计与实现 被引量:1

Research and implementation of converting mechanism of multiple characters Uyghur on the Internet
下载PDF
导出
摘要 近年来,随着互联网技术在新疆地区的发展和普及、微信、QQ、论坛、微博等网络交流逐渐成为新疆人民日常交流的主要方式。由于历史和地理原因,网络平台上的维吾尔语言呈现传统维文、拉丁维文、西里尔维文等多种字母体系共存的"一语多文"的特点。由于这些文字缺乏科学的对应标准、互相转换的工具等原因,造成实际使用中存在很多问题,给维吾尔网民的日常互联网使用及"一带一路"沿线国家间和居民间的沟通和交流带来不便。为此首先研究传统维文、拉丁维文及西里尔维文之间的渊源,以及三种字母目前的对应标准存在的问题和转换规则。借此提出三种字母之间的Unicode字符编码转换算法,以期解决国内外维吾尔人间的在线文字交流困难的问题,进而实现维文搜索引擎系统中使用后两种文字的信息检索。通过实验验证了所提的LUTC和CUTC转换算法的字符编码转换效率有明显提升,拉丁维文和西里尔维文的信息检索效果与传统维文一致。 As the development and increasing popularity of the Internet technology in Xinjiang area in recent years,online communication such as We Chat and QQ have become more and more significant.But due to historical and geographical reasons,the Uyghur on the Internet has displayed a“One Language,Multiple Characters”characteristic,i.e.,the co-existence of multiple character systems of the Old-Uyghur Alphabet,the Latin-Uyghur Alphabet,and the Cyrillic-Uyghur Alphabet.There is a lack of reasonable correspondence standard among these character systems and there is not an effective conversion tool.This has brought many problems to real world applications and greatly impaired the“One Belt and One Road”strategy of the nation.This paper investigates the origin and the current situation of the problem,as well as the correspondence standard and its problem in use.Based on the investigation,it discusses the deficiency of the correspondence standard between Uyghur and Latin-Uyghur,and provides guidance for improvement.Moreover,this paper also suggests a way of Latin-Uyghur and Cyrillic-Uyghur information retrieval implemented on a Uyghur search engine,and a way of mutual conversion among Latin-Uyghur,Cyrillic-Uyghur,and Old-Uygur.
作者 依不拉音.吾斯曼 张绍武 于凯 Yibulayin·WUSIMAN;ZHANG Shaowu;YU Kai(School of Computer Science and Engineering,Xinjiang University of Finance and Economics,Urumqi 830012,China;Faculty of Electronic Information and Electrical Engineering,Dalian University of Technology,Dalian,Liaoning 116000,China)
出处 《计算机工程与应用》 CSCD 北大核心 2018年第19期114-121,共8页 Computer Engineering and Applications
基金 国家自然科学基金(No.71561025) 新疆财经大学基金(No.2014XYB006)
关键词 一语多文 网络交流 多文转换 拉丁维文 西里尔维文 one language multiple characters network communication converting of multiple characters Latin-Uygur Cyrillic-Uyghur
  • 相关文献

参考文献12

二级参考文献145

共引文献286

同被引文献11

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部