内嵌空间排序支持向量机及其在文本检索中的应用被引量：1

Applications of Embedded Space Ranking SVM to Document Retrieval

下载PDF

导出

摘要针对文本检索中的特征提取和分类问题,提出一种基于内嵌空间支持向量机的特征选择和排序学习方法.与多分类特征选择问题中常用的组合方法不同,本文提出的方法能将一个有序分类问题转化为一个两分类问题,从整体上选择最有效的特征.同时与已有的Ranking SVM相比,该方法在转换过程中学习样本的数量只有线性级的增长,从而大大提高了检索的速度.在人工数据集和标准的文本分类数据集上的实验结果表明,本文所提出的方法能较好地解决文本检索中的特征选择和排序问题. For feature extraction and classification in text retrieval,a feature selection and sorting learning method based on embedded space support vector machine is proposed.Unlike combination methods commonly used in multi-classification feature selection,the proposed method can transform an ordered classification into a two-classification problem,then choose the most effective feature from the whole.At the same time,comparing with the existing Ranking SVM,the learning samples number of the proposed method just has a linear level increasing during the conversion process,and the retrieval speed is greatly improved.The experimental results on both artificial and standard data sets show that the proposed method can better solve the feature selection and sorting problem in text retrieval.

作者周绮凤杨小青洪文财邵桂芳

机构地区厦门大学信息科学与技术学院

出处《信息与控制》 CSCD 北大核心 2010年第5期629-634,共6页 Information and Control

基金福建省自然科学基金资助项目(2009J05153)

关键词排序学习支持向量机文本检索特征选择 learning to rank support vector machine text retrieval feature selection

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献13

1Robertson S, Walker S, Hancock-Beaulieu M, et al. Okapi at TREC-3[C]//Proceedings of Text REtrieval Conference. Gaithersburg, MD, USA: National Institute of Standards and Technology Special Publication, 1994: 109-126.
2Ponte J M, Croft W B. A language modeling approach to information retrieval[C]//Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, NJ, USA: ACM, 1998: 275 -281.
3Cooper W S, Gey F C, Dabney D E Probabilistic retrieval based on staged logistic regression[C]//Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, NJ, USA: ACM, 1992: 198-210.
4Nallapati R. Discriminative models for information retrieval[C]//Proceedings of the 27th annual international ACM SIGIR conference on Research and development in Information Retrieval. New York, NJ, USA: ACM, 2004: 64-71.
5Herbrich R, Graepel T, Obermayer K. Large margin rank boundaries for ordinal regiession[M]//Smola A, Bartlett P, Scholkopf B, et al. Advances in Large Margin Classifiers. Cambridge,MA, USA: MIT Press, 2000: 115-132.
6张学工.统计学习理论的本质[M].北京:清华大学出版社,1999.
7Kramer S, Widmer G, Pfahringer B, et al. Prediction of ordinal classes using regression trees[J]. Fundamenta Informaticae, 2001, 47(1/2): 1-13.
8Cao Y B, Xu J, Liu T Y, et al. Adapting ranking SVM to document retrieval[C]//Proceedings of the 29th Annual ACM SIGIR Conference. New York, NJ, USA: ACM, 2006: 186-193.
9Rajaram S, Garg A, Zhou X S, et al. Classification approach towards ranking and sorting problems[M]// Lecture Notes in Computer Science (vol.2837). Berlin, Germany: Springer, 2003: 301-312.
10Guyon I, Weston J, Barnhill S, et al. Gene selection for cancer classification using support vector machines[J]. Machine Learning, 2002, 46(1/2/3): 389-422.

共引文献2

1黄金杰,常英丽.基于支持向量机和正交设计的特征选择方法[J].计算机工程与应用,2008,44(17):135-137. 被引量：1
2解洪胜,孙龙梅,王连国.利用SVM和Matlab开发图像内容检索系统的方法[J].云南民族大学学报（自然科学版）,2010,19(3):207-210. 被引量：2

同被引文献12

1孙笑明,崔文田,林军.一种网络展现文献检索结果的理论模型[J].情报学报,2011,30(2):146-154. 被引量：4
2瞿亮,杨贯中,李琦.基于本体的专业文献检索[J].计算技术与自动化,2007,26(1):84-86. 被引量：2
3魏小娟,杨婧,李翠平,陈红.Skyline查询处理[J].软件学报,2008,19(6):1386-1400. 被引量：34
4蒋一峰,王华,张玉红,黄少林.基于Lucene的语义检索系统的设计和实现[J].计算机工程与设计,2008,29(20):5336-5337. 被引量：7
5孙圣力,戴东波,黄震华,张齐勋,周立新.概率数据流上Skyline查询处理算法[J].电子学报,2009,37(2):285-293. 被引量：17
6向剑平,郑皎凌.Skyline计算在多维排序问题上的分析[J].太原师范学院学报（自然科学版）,2009,8(2):82-84. 被引量：2
7王晓伟,黄九鸣,贾焰.分布式不确定数据上的概率Skyline计算[J].计算机科学与探索,2010,4(10):951-960. 被引量：8
8黄子晴,刘东苏.Skyline查询处理在文献检索排序中的应用[J].情报理论与实践,2011,34(10):104-108. 被引量：2
9杨立龙,董一鸿,何贤芒.分布式环境下的Skyline代表点查询[J].计算机应用研究,2015,32(1):102-107. 被引量：1
10杨林青,李湛,牟雁超,樊里略,李红燕,王腾蛟,雷凯.面向大规模数据集的并行化Top-k Skyline查询算法[J].计算机科学与探索,2015,9(8):897-905. 被引量：7

引证文献1

1王春梅.一种基于关联度的Skyline多目标优化文献检索方法设计与测试[J].实验室研究与探索,2016,35(9):126-129.

1草无缺.一眼看清MSN空间更新时间[J].计算机应用文摘,2006,22(19):108-108.
2MSN Space更新我先知[J].网友世界,2006(15):26-26.
3陈明.轻松拥有自己的站内搜索引擎[J].电脑爱好者,2004(23):72-72. 被引量：1
4张家超,孔媛媛.利用隐空间支持向量机设计IDS的检测算法[J].计算机应用与软件,2008,25(10):87-89.
5王玲,薄列峰,刘芳,焦李成.稀疏隐空间支持向量机[J].西安电子科技大学学报,2006,33(6):896-901. 被引量：8
6管廷兰,丁华,王秀坤.基于改进粒子群的隐空间支持向量机[J].计算机工程与应用,2006,42(32):69-71.
7樊卫国,张启衡.扩展目标边缘数据空间排序算法[J].光电工程,1997,24(3):9-15.
8张晓琴,王婕婷,钱宇华.一种基于加权kappa增益的有序分类的选择性集成算法[J].山西大学学报（自然科学版）,2015,38(1):58-66. 被引量：1
9丁伟民.排序学习中的Ranking SVM算法研究[J].科技视界,2013(30):84-84. 被引量：2
10王鑫,王熙照,陈建凯,翟俊海.有序决策树的比较研究[J].计算机科学与探索,2013,7(11):1018-1025. 被引量：5

信息与控制

2010年第5期

浏览历史

内容加载中请稍等...

内嵌空间排序支持向量机及其在文本检索中的应用被引量：1

参考文献13

共引文献2

同被引文献12

引证文献1

相关作者

相关机构

相关主题

浏览历史

内嵌空间排序支持向量机及其在文本检索中的应用 被引量：1

参考文献13

共引文献2

同被引文献12

引证文献1

相关作者

相关机构

相关主题

浏览历史

内嵌空间排序支持向量机及其在文本检索中的应用被引量：1