期刊文献+

一种基于多视图协同学习的App分类方法

App classification based on multi-view collaborative learning
下载PDF
导出
摘要 App分类是App检索、App推荐和用户偏好挖掘等智能服务的前提。现有App下载平台已提供了App的类别信息,但这些信息存在无法被第三方使用、无法满足定制的类别体系等问题,因此需要设计一种不依赖App下载平台的App分类方法,而要实现这一目标的挑战包括能利用的描述信息太少(通常仅有App的名称)和有标注样本的数量太少(特别是需要定制类别体系的时候)。针对这些挑战,提出了一种基于多视图协同学习的App分类方法:首先,基于互联网知识对App名称进行信息扩充;然后,从词频和主题两个视图描述App分类问题,并抽取词频分布特征和主题分布特征;最后,采用协同学习方法充分利用无标注样本来训练和融合基于这两个视图的分类器。基于大量真实App数据的实验表明:相比现有方法,提出的方法在分类准确度方面有一定优势。 App classification is the requirement of many intelligent services,e.g.,App query,App recommendation,user preference mining,etc.Although many App delivery platforms have already provided the category information of the Apps,this information can be not used by third parties and cannot adapt to customized taxonomy.Thus,we need an App classification method,which is independent of the App delivery platforms.The main challenges include limited descriptive information(only the name of an App is available)and limited labeled samples(especially when customized taxonomy is required).Aiming at these challenges,this paper proposes an App classification method based on multi-view collaborative learning.Firstly,it enriches the descriptive information through exploiting the additional Web knowledge.Secondly,it represents the App classification problem from two views(i.e.,word frequency view and latent topic view),and extracts word frequency distribution features and latent topic distribution features respectively.Finally,it fully utilizes unlabeled samples to train and fuse the classifiers based on these two views using a collaborative learning approach.The experiments based on real App data show that the proposed method has a competitive App classification accuracy over state-of-the-art methods.
作者 陈铁明 张凯一 吕明琪 CHEN Tieming;ZHANG Kaiyi;Lü Mingqi(College of Computer Science and Technology,Zhejiang University of Technology,Hangzhou 310023,China)
出处 《浙江工业大学学报》 CAS 北大核心 2018年第6期591-597,共7页 Journal of Zhejiang University of Technology
基金 国家自然科学基金资助项目(61772026) 国家自然科学基金浙江两化融合联系基金项目(U1509214) 浙江省自然科学基金资助项目(LY18F020033)
关键词 App分类 互联网知识 多视图学习 协同学习 App classification web knowledge multi-view learning collaborative learning
  • 相关文献

参考文献5

二级参考文献113

  • 1车万翔,刘挺,李生.实体关系自动抽取[J].中文信息学报,2005,19(2):1-6. 被引量:115
  • 2陆奇斌,赵平,王高,黄劲松.消费者满意度测量中的光环效应[J].心理学报,2005,37(4):524-534. 被引量:28
  • 3董静,孙乐,冯元勇,黄瑞红.中文实体关系抽取中的特征选择研究[J].中文信息学报,2007,21(4):80-85. 被引量:55
  • 4刘克彬,李芳,刘磊,韩颖.基于核函数中文关系自动抽取系统的实现[J].计算机研究与发展,2007,44(8):1406-1411. 被引量:58
  • 5Sundheim B, Chinchor N. Survey of the message understanding conferences[CJ Jjproc of HLT'93. Stroudsburg, PA: ACL, 1993: 56-60.
  • 6Banko M, Cafarella M, Soderland S, et a1. Open information ext ract ion from the Web[CJ JjProc of I]CAI 2007. New York: ACM, 2007: 2670-2676.
  • 7Fader A, Soderland S, Etzioni D. Identifying relations for open information extraction[CJ JjProc of EMNLP 2011. Stroudshurg, PA: ACL, 2011: 1535-1545.
  • 8Carlson A, BetteridgeJ, Kisiel B, et a1. Toward an architecture for never-ending language learning[CJ JjProc of AAAI2010. Palo Alto, CA: AAAL 2010: 1306-1313.
  • 9Craven M, KumlienJ. Constructing biological knowledge bases by extracting information from text sources[CJ /lProc of the 7th Int Conf on Intelligent Systems for Molecular Biology. Palo Alto, CA: AAAL 1999: 77-86.
  • 10Blum A, Mitchell T. Combining labeled and unlabeled data with co-training[CJ Jjproc of ICML 1998. New York: ACM, 1998: 92-100.

共引文献127

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部