期刊文献+

基于级连神经网络和SVD的文本分类新模型 被引量:1

New text categorization model based on cascade neural network and SVD
下载PDF
导出
摘要 提出了一个基于级连神经网络(Cascade-Correlation Neural Network,CCNN)和SVD(Singular Value Decomposition)的文本分类新模型。该神经网络用级连相关算法来训练网络。大部分的文本分类系统用向量空间模型(Vector Space Model,VSM)来表现文档,然而这种方法需要很高的维度,并且考虑不到文本特征词间的语义隐含信息,因此分类效果不是太理想。引入SVD来学习和表现文本特征词,在降低特征维度的基础上,将文本特征的隐含信息表现出来。实验证明,在加快训练速度的基础上,提高了分类的精度。 A new text categorization model based on cascade neural network and Singular Value Decomposition(SVD) is proposed.The neural network is trained by the cascade-correlation algorithm.Most classic classification systems represent the contents of documents with a set of index terms,it has been known as Vector Space Model(VSM).However, this method needs a high dimensional space to represent the documents, and it does not take into account the semantic relationship between terms,which can lead to poor classification performance.In this paper, SVD is used to learn and represent relations among very large numbers of words and very large numbers of natural text passages in which they occurred.It can not only greatly reduce the dimensional but also discover the important associative relationships between terms.The experiments show that it also helps to accelerate the training speed and improves the classification accuracy.
作者 王燕霞 邓伟
出处 《计算机工程与应用》 CSCD 北大核心 2010年第26期102-104,共3页 Computer Engineering and Applications
基金 国家自然科学基金No.60572074~~
关键词 奇异值分解 神经网络 文本分类 BP算法 级联相关算法 Singular Value Decomposition(SVD) neural network text categorization BP algorithm cascade-correlation algorithm
  • 相关文献

参考文献3

  • 1Park S C.Neural network for text classification based on singular value decomposition[C]//Seventh International Conference on Copmuter and Information Technology, 2007:47-52.
  • 2Yu B.Latent semantic analysis for text categorization using neural network[J].Knowl Based Syst,2008.
  • 3Tan S.A novel refinement approach for text categorization[C]// ACM CIKM 2005,2005.

同被引文献8

  • 1Krn Jingu.Toward faster nonnegative matrix factorization: a new lgorithrn and comparisons[C]//Eighth IEEE International Confer- nce on Data Mining.Pisa, Italy: IEEE, 2008 : 353-362.
  • 2Paul S,Du Shuyan,Parra L.Recovery of constituent spectra us- ing non-negative matrix factorization[C]//Proc of SPIE' 03,2003 : 321-331.
  • 3Koren Y, Carmel L.Robust linear dimensionality reduction[J].IEEE Transactions on Visualization and Computer Graphics,2004, 10 (4) :459-470.
  • 4Deerwester S,Dumais S T, Landauer T K.Indexing by latent se- mantic analysis[J].Joumal of the Society for Information Science, 1990,41(6) :391-407.
  • 5Weston J,Mukhelee S, Chapelle O.Feature selection for SVM[J]. Advances in Neural Information Processing Systems, 2001,21 (8) : 160-162.
  • 6贺一帆,江铭虎.网络文本分类中基于信息瓶颈的特征提取[J].清华大学学报(自然科学版),2010,50(1):45-48. 被引量:6
  • 7刘建,孙鹏,倪宏.面向分类的网页主题特征提取[J].计算机应用研究,2010,27(9):3399-3402. 被引量:2
  • 8黄钢石,张亚非,陆建江,徐宝文.一种受限非负矩阵分解方法[J].东南大学学报(自然科学版),2004,34(2):189-193. 被引量:10

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部