摘要
当前国内对于文本可视化的研究还停留在初级阶段,存在着许多方法处理文本语料库。随着科学技术的不断发展,网络变得越来越普及,人们可以从网络上获得大量的文本资料信息,本文主要针对如何获取序列化、规范化的汉语的语料库提出了一种新的框架。
The current domestic for text visualization research still stays in the primary stage, there are many ways to deal with text corpus. With the continuous development of science and technology, network has become more and more popular. We can get a lot of text information from the Internet, this paper focusedon how to obtain the serialization and standardization of the corpus of Chinese to propose a new framework.
作者
孙温稳
Sun Wenwen(Information Science & Technology College9Zhengzhou Normal University,Zhengzhou Henan 450044)
出处
《河南科技》
2016年第11期19-20,共2页
Henan Science and Technology