摘要
为了进行关键词的文本查重和文本检索,设计出基于matlab的文本处理系统。首先,研究文本处理系统的相关原理及技术;其次,设计系统的总体框架,细化功能;最后,采用matlab语言来设计系统,利用多个TXT文本构建语料数据库,设计出基于matlab的文本处理系统应用程序。测试表明:该系统能有效地实现文本查重和文本检索。
In order to better study the keyword search and text retrieval,design and implementation of text processing system based on MATLAB.Firstly,the research of the relevant principle and technology of text processing system;Secondly,the design of the overall framework of the system,detailed function;Finally,using MATLAB language to design the system,the use of multiple TXT text to build corpus database,design of text processing system applications based on matlab.The test results show that the system can effectively implement text check and text retrieval.
作者
费扬
杜庆治
FEI Yang;DU Qing-zhi(Faculty of information engineering and automation, Kunming University of Science and Technology, Kunming Yunnan 650504,China)
出处
《软件》
2017年第8期226-229,共4页
Software
基金
云南省科技厅资助项目(2014RA051)
关键词
MATLAB
文本处理
文本查重
文本检索
TF-IDF
MATLAB
Text processing
Text check
Text Retrieval
Term frequency-inverse document frequency