余弦距离下保护型迁移学习聚类算法被引量：1

Protection-type transfer learning clustering algorithm with cosine distance metric

下载PDF

导出

摘要以往研究者都从公式的合理性出发研究迁移学习和传统机器学习,但他们忽视了对问题的整体性考虑,致使在具体应用到文本分类问题时,无法实现彻底的分类。通过研究文本分类的整个过程,在k-均值算法中使用余弦距离,显著提高了实验结果;提出保护型迭代思想,同时弃用传统的词特征空间,采用隐空间作为特征向量空间,实施归一化约束。以CCI算法为例,结合提出的改进思想,产生改进算法PCCI,在降低计算复杂度的同时显著提高迁移学习的分类正确率。通过在数据集20-News Groups和Reuters-21578上测试并与现有其他迁移学习算法进行比较,证明了该改进算法的优越性。 Former researchers commonly study transfer learning algorithms and traditional machine learning from the point of the rationality of formulas, while neglecting the integrality of the problem. As a result, their algorithms are usually unable to thoroughly practice classification when they are applied to specific text classification problem. Via observing the whole process of text classification, it uses cosine distance in k-mean method and gets obviously better results. It proposes protection-type iteration idea. It abandons traditional word feature space and chooses hidden space as the feature vector space and implements normalization constraints. Taking CCI algorithm as an example, this idea is used to create an improved algorithm which is nominated PCCI. This algorithm can prominently raise the classification accuracy of transfer learning, meanwhile reducing the computing complexity. It proves the superiority of the improved algorithm by comparing with other former transfer learning cases through program testing on the database of 20-NewsGroups and Reuters-21578.

作者张焱凯包芳王士同

机构地区江南大学数字媒体学院江阴职业技术学院

出处《计算机工程与应用》 CSCD 北大核心 2015年第23期131-138,225,共9页 Computer Engineering and Applications

关键词迁移学习欧式距离余弦距离保护型归一化约束过维数 transfer learning Euclidean distance cosine distance protection-type normalization constraints over dimension

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献15

1Dai W,Xue G R,Yang Q,et al.Co-clustering based classification for out-of-domain documents[C]//Proceedings of the 13th SIGKDD,San Jose,California,2007.
2Zhuang F,Luo P,Xiong H,et al.Exploiting associations between word clusters and document classes for crossdomain text categorization[J].Statistical Analysis and Data Mining,2011,4(1).
3Ling X,Dai W,Xue G R,et al.Spectral domain-transfer learning[C]//Proceedings of the 14th ACM SIGKDD Conference on Knowledge Discovery in Data,Las Vegas,Nevada,USA,2008.
4Dai W,Jin O,Xue G R,et al.Eigentransfer:a unified framework for transfer learning[C]//Proceedings of the26th ICML,Montreal,Quebec,Canada,2009,382.
5Blitzer J,Mc Donald R,Pereira F.Domain adaptation with structural correspondence learning[C]//Proceedings of 2006 EMNLP,2006:120-128.
6Pan S J,Ni X,Sun J T,et al.Cross-domain sentiment classification via spectral feature alignment[C]//International World Wide Web Conference,Raleigh,North Carolina,USA,2010.
7Wang Z,Song Y,Zhang C.Knowledge transfer on hybrid graph[C]//Proceedings of the 21st IJCAI,2009.
8Gao J,Fan W,Jiang J,et al.Knowledge transfer via multiple model local structure mapping[C]//International Conference on Knowledge Discovery and Data Mining,Las Vegas,Nevada,USA,2008.
9Dai W,Yang Q,Xue G.Boosting for transfer learning[C]//Proceedings of the 24th International Conference on Machine Learning,Corvallis,OR,USA,2007:193-200.
10Xue G,Dai W,Yang Q.Topic-bridged PLSA for crossdomain text classification[C]//Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.Singapore:ACM,2008:627-634.

同被引文献9

1李越雷,张天骐,黄铫,蒋世文.利用粒子群算法实现PPS信号的稀疏分解[J].计算机仿真,2010,27(2):200-203. 被引量：8
2李焱.基于函数变换的改进混沌粒子群优化[J].计算机应用研究,2010,27(11):4105-4107. 被引量：4
3孙艳,于凤芹,金银燕.小波匹配追踪的语音信号时频建模[J].计算机工程与应用,2012,48(3):151-152. 被引量：1
4朱延万,赵拥军,孙兵.一种改进的稀疏度自适应匹配追踪算法[J].信号处理,2012,28(1):80-86. 被引量：35
5赵知劲,马春晖.一种基于量子粒子群的二次匹配OMP重构算法[J].计算机工程与应用,2012,48(29):157-161. 被引量：3
6侯坤,易正俊,何荣花.信号稀疏分解的人工蜂群-MP算法[J].计算机仿真,2012,29(11):247-250. 被引量：6
7李霞,孙灵芳,杨明.基于改进FOA匹配追踪的超声信号处理研究[J].仪器仪表学报,2013,34(9):2068-2073. 被引量：17
8高雷阜,高晶,赵世杰.人工鱼群算法优化SVR的预测模型[J].统计与决策,2015,31(7):13-16. 被引量：5
9刘景华,林梦雷,张佳,林耀进.一种启发式的局部随机特征选择算法[J].计算机工程与应用,2016,52(2):170-174. 被引量：5

引证文献1

1浦灵敏,胡宏梅.基于改进匹配追踪算法的语音信号处理研究[J].信息安全与通信保密,2015,13(12):127-130.

1解永勃.感应保护型安全电源插座[J].无线电,2006(7):60-60.
2谢新华,叶明,曾令海.非介质材料物体的电磁时域特征的摄取[J].传感器技术,2005,24(6):12-13.
3房启东.浅析数字信息资源过程管理产生期之迭代模型[J].无锡南洋职业技术学院论丛,2010,0(Z1):72-76.
4张菲.制作版权保护型公文[J].电脑爱好者,2015,0(10):50-51.
5SN65HVD17xx：RS-485收发器[J].世界电子元器件,2009(1):43-43.
6小路.动感礼物[J].时尚旅游,2009(2):182-189.
7王天银,蔡晓秋,张建中.基于椭圆曲线的离线多银行电子现金系统[J].计算机工程,2007,33(15):155-157. 被引量：3
8DRV5000系列：霍尔效应磁传感器[J].世界电子元器件,2014,0(7):28-28.
9金升阳推出浪涌保护型隔离式安全栅[J].可编程控制器与工厂自动化（PLC FA）,2010(3):23-23.
10冯秀芳,李海林.Euclidean节点定位算法改进及其仿真[J].计算机与现代化,2009(1):69-72. 被引量：1

计算机工程与应用

2015年第23期

浏览历史

内容加载中请稍等...

余弦距离下保护型迁移学习聚类算法被引量：1

参考文献15

同被引文献9

引证文献1

相关作者

相关机构

相关主题

浏览历史

余弦距离下保护型迁移学习聚类算法 被引量：1

参考文献15

同被引文献9

引证文献1

相关作者

相关机构

相关主题

浏览历史

余弦距离下保护型迁移学习聚类算法被引量：1