Following the expanding of VSM and LSI, a text classification based on Concept Space is proposed in thispaper. Information gaining is applied to acquire concepts based on large training set. Concept Space is built by ...Following the expanding of VSM and LSI, a text classification based on Concept Space is proposed in thispaper. Information gaining is applied to acquire concepts based on large training set. Concept Space is built by acquir-ing latent semantic indexing data, building a latent semantic space by LSI, and then adding the class-basis vector. Thecalculating method of the word-similarity, the text-similarity, the similarity of the text vector and the class-basis vec-tor in Concept Space are presented. Experiment results show the Concept Space method is superior to Vector SpaceModel. This paper also discusses the future work the problem of concept space learning.展开更多
文摘Following the expanding of VSM and LSI, a text classification based on Concept Space is proposed in thispaper. Information gaining is applied to acquire concepts based on large training set. Concept Space is built by acquir-ing latent semantic indexing data, building a latent semantic space by LSI, and then adding the class-basis vector. Thecalculating method of the word-similarity, the text-similarity, the similarity of the text vector and the class-basis vec-tor in Concept Space are presented. Experiment results show the Concept Space method is superior to Vector SpaceModel. This paper also discusses the future work the problem of concept space learning.