摘要
提出了一种利用核函数改进向量空间的新模型:核向量空间模型.该模型利用Mercer核,把输入空间的样本映射到高维特征空间,在高维特征空间中按向量空间模型操作,然后用核向量空间模型实现专利分类.理论分析及在专利分类中的实验表明,所提出的模型比经典向量空间模型有更高的正确分类率.
A novel model, namely, kernel vector space model, is established by using the kernel function to improve the vector space. In this model, the Mercer kernel is used to map the data in the original space to a high-dimenslonal feature space in which data can be identified as in the vector space. The proposed model is then applied to the to the patent categorization, with the theoretical and experimental results indicating its greater correctness than the traditional vector space model.
出处
《华南理工大学学报(自然科学版)》
EI
CAS
CSCD
北大核心
2005年第8期58-61,共4页
Journal of South China University of Technology(Natural Science Edition)
基金
国家自然科学基金资助项目(60003019)
华南理工大学高水平大学建设项目(159-D65010)
关键词
文本分类
向量空间模型
核函数
text categorization
vector space model
kernel function