隐含语义索引技术在供求信息分类中的应用

Implementation of supply and demand information classification based on latent semantic indexing

下载PDF

导出

摘要介绍了一种信息抽取和自动分类的新应用,分析了传统分类方法的不足,介绍了一种基于隐含语义索引技术的文本分类改进方案。该技术是一新型的检索模型,它通过奇异值分解,或增强或消减词在文档中的语义影响力,使得文档之间的语义关系更为明晰,从而能容易地剔除掉那些语义关联弱的噪声数据,提高特征值提取精度和最后的分类准确度。 This paper presents a new implementation of information retrieval and automatic classification.In order to overcome the shortage of traditional methods,an improved classification based on latentsemantic indexing is introduced.LSI is a new retrieval model based on Singular Value Decomposition （SVD）.Using the algorithm,every term will be either strengthened or weakened. When the latent semantic becomes clearer,it is easy to cut off most of the noisy data at the very beginning.So the accuracy of classification will be improved.

作者朱学昊王儒敬

机构地区中国科学院合肥智能机械研究所

出处《计算机工程与应用》 CSCD 北大核心 2007年第14期192-194,共3页 Computer Engineering and Applications

基金国家高技术研究发展计划(863)(No.2003AA118070)~~

关键词隐含语义索引奇异值分解文本分类信息抽取 latent semantic indexing singular Implementation of supply and demand information classification based on latent semantic indexing value decomposition text classification information retrieval

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1戚涌,徐永红,刘凤玉.基于潜在语义标引的WEB文档自动分类[J].计算机工程与应用,2004,40(22):28-31. 被引量：9
2周文,龚礼明,蒋岚.隐含语义检索及中文样本分析实例[J].计算机应用,2004,24(S1):273-276. 被引量：5

二级参考文献12

1Chang C H,Hsu C C.Customizable multi-engine search tool with clustering[J].Computer Network and ISDN Systems,1997;29(8-13):1217～1224
2Mehran S,Salim Y,Michelle Q.Baldonado SONIA:A service for organizing network information autonomously[C].In:Proc of the 3rd ACM Conf on Digital Library,NY: ACM press, 1998: 200～209
3Salton G,McGill M.Introduction to Modem Information Filtering [D]. Massachusetts Inst Of Technology, 1994
4Deerwester S,Dumais S,Fumas G et al. Indexing by Latent Semantic Analysis[J].Joumal of the American Society of Information Science,1990:391～407
5Shih-Hung Wu,Pey-Ching Yang,Von-Wun Soo. An Assessment of Character-based Chinese News Filtering Using Latent Semantic Indexing[J].Computational Linguistics Society of R O C, 1998;3 (2):61～78
6Landauer T,Psotka J.Simulating text understanding for educational applications with Latent Semantic Analysis:Introduction to LSA[J].Interactive Learning Environments,2000;8(1 ): 1～14
7Zhong Jin,Jing-Yu Yang,Zhen-Min Tang et al.A theorem on the uncorrelated optimal discriminant vectors[J].Pattem Recognition,2001;34: 2041～2047
8Landauer TK,Dumais ST.Latent Semantic Analysis and The Measurement of Knowledge[].st Educational Testing Service Conference on Applications of Natural Language Processing in Assessment and Education.1994
9Nielsen J,Phillips VL,Dumais ST.Retrieving Imperfectly Recognized Handwritten Notes[].Behaviour and Information Technology.1994
10Berry M W,Dumais S T,Letsche T A.Computational methods for intelligent information. http://www.cs.utk.edu/berry/sc95/sc95.html . 1996

共引文献10

1张瑜,李景,孟宪学,苏晓路.网络标注的主要方法概述[J].图书情报工作,2008,52(1):20-22. 被引量：9
2周璨,刘琦婧,彭靖佳,韦俞军.基于聚类模型的论文分类检索系统的设计与实现[J].福建电脑,2008,24(6):17-18.
3张玉峰,蔡皎洁.基于数据挖掘的Web文本语义分析与标注研究[J].情报理论与实践,2010,33(2):85-88. 被引量：7
4陈立华.基于潜在语义分析的影响自然语言检索查准率指标因素的评述[J].现代情报,2010,30(3):26-28. 被引量：2
5张瑜.网络标注的主要方法[J].湖北第二师范学院学报,2010,27(2):114-116.
6王瑛.基于VSM的潜在语义索引[J].陕西科技大学学报（自然科学版）,2010,28(5):151-154. 被引量：1
7肖艳华,王青蓝,毕业莉,万发仁.隐含语义索引在吉林省农业知识问答系统中的应用[J].湖北农业科学,2011,50(13):2740-2742.
8胡泽文.基于WordNet和SUMO本体集成的自动语义检索及可视化模型[J].国家图书馆学刊,2012,21(2):23-32. 被引量：4
9郑伟青.基于本体集成的自动语义检索及可视化模型[J].情报科学,2013,31(5):77-83. 被引量：3
10蔡嘉诚.基于RANSAC潜在语义分析的专家库检索[J].电脑知识与技术（过刊）,2014,20(2X):1141-1143.

1张玉连,张敏,张波.一种扩展的向量空间模型-隐含语义索引模型研究[J].燕山大学学报,2006,30(1):87-90.
2王天江,叶卫国,卢正鼎,李永平.LSI和kNN相结合的文本分类模型研究[J].华中科技大学学报（自然科学版）,2004,32(4):59-60. 被引量：3
3王栋,吴军华.基于LSI和词典的文本语义相似度算法[J].煤炭技术,2010,29(12):217-218. 被引量：1
4魏保子,王儒敬.隐含语义索引在农业技术问答系统中的应用[J].微电子学与计算机,2008,25(7):48-51. 被引量：1
5梁栋,杨杰,卢进军,常宇畴.基于非负矩阵分解的隐含语义图像检索[J].上海交通大学学报,2006,40(5):787-790. 被引量：7
6徐建锁,王正欧.基于LSI和自组织神经网络的高效文本聚类方法[J].天津大学学报（自然科学与工程技术版）,2004,37(11):1026-1030. 被引量：7
7曹华梁,朱星,俞勇.适用于P2P的系统查询扩展优化方法[J].上海交通大学学报,2005,39(10):1706-1710. 被引量：5
8王春红.基于语义的中文信息检索技术分析与研究[J].现代计算机,2008,14(10):54-56.
9周水庚,关佶红,胡运发.隐含语义索引及其在中文文本处理中的应用研究[J].小型微型计算机系统,2001,22(2):239-243. 被引量：41
10肖艳华,王青蓝,毕业莉,万发仁.隐含语义索引在吉林省农业知识问答系统中的应用[J].湖北农业科学,2011,50(13):2740-2742.

计算机工程与应用

2007年第14期

浏览历史

内容加载中请稍等...

隐含语义索引技术在供求信息分类中的应用

参考文献2

二级参考文献12

共引文献10

相关作者

相关机构

相关主题

浏览历史