期刊文献+

基于主题偏好的在线健康社区用户兴趣群体识别研究——以医享网为例 被引量:9

Research on the Identification of Users’ Interest Groups in Online Health Community Based on Topic Preference :an Investigation of yx129.com
原文传递
导出
摘要 【目的/意义】用户对健康信息的需求存在一定的差异,并形成不同的兴趣群体,对用户兴趣群体的发现以及组成机构的分析,有助于探索在线健康社区中知识转移与扩散的相关规律。【方法/过程】本研究借助ATM模型,通过文本聚类识别用户偏好主题,引入用户映射表和海林格距离算法,对用户兴趣群进行识别,并其分布结构和偏好主题内容进行了分析;同时,将医享网作为案例进行研究,进一步识别和分析健康社区的用户兴趣群及其相关特征。【结果/结论】采用本研究框架识别出来医享网痛风社区可分为保健、症状与治疗三个偏好主题;在此基础上结合用户兴趣群体结构分析和偏好主题具体内容,发现不同兴趣群体规模有所差别,组成结构具有一定的交叉性,并在一定程度上根据疾病发展的阶段性特征,存在动态迁移性。【创新/局限】基于主题-用户映射关系,对在线健康社区中偏好相似用户群和主题偏好分布规律进行探索,今后可进一步完善模型对不均衡文本的处理能力。 【Purpose/significance】Users have different needs for health information and form different interest groups. The discovery of user interest groups and the analysis of their constituent organizations will help to explore the relevant laws of knowledge transfer and diffusion in online health communities.【Method/process】This study uses ATM model to identify user preference topics through text clustering, introduces user mapping table and Hellinger distance algorithm, identifies user interest groups, and analyzes the structure and preference content of user interest groups. At the same time, the yx129.com website was used as a case to further identify and analyze the user interest groups of the healthy community and their related characteristics.【Result/conclusion】the gout community of yx129.com website can be divided into three preference themes: health care, symptoms and treatment. On the basis of this, combined with the user interest group structure analysis and the specific content of preference theme. It is found that the scale of different interest groups is different, the composition structure has certain cross-cutting, and to some extent, according to the stage characteristics of disease development, there is dynamic mobility.【Innovation/limitation】Based on the topic user mapping relationship, this paper explores the distribution of user groups with similar preferences and topic preferences in online health community, In the future, we need to further improve the ability of the model to deal with unbalanced text in the future.
作者 董伟 陶金虎 DONG Wei;TAO Jin-hu(School of Education,Tianjin University,Tianjin 300350,China)
出处 《情报科学》 CSSCI 北大核心 2021年第3期88-93,119,共7页 Information Science
基金 国家社科青年项目“在线健康社区用户交互行为及其对用户健康效用影响研究”(16CTQ029)。
关键词 主题偏好 群体识别 ATM 作者主题模型 在线健康社区 topic preference group identification ATM author topic model online health community
  • 相关文献

参考文献11

二级参考文献99

  • 1王妙娅.国内图书馆微博应用现状及建议[J].图书馆学研究(应用版),2010(12):37-41. 被引量:123
  • 2张青,陈翀,向勇.深度分组检测技术研究及在流量经营中的应用[J].电信科学,2013,29(S1):234-239. 被引量:2
  • 3郭岩,白硕,杨志峰,张凯.网络日志规模分析和用户兴趣挖掘[J].计算机学报,2005,28(9):1483-1496. 被引量:62
  • 4王继民,彭波.搜索引擎用户点击行为分析[J].情报学报,2006,25(2):154-162. 被引量:45
  • 5熊文新,宋柔.信息检索用户查询语句的停用词过滤[J].计算机工程,2007,33(6):195-197. 被引量:16
  • 6Stadnyk I, Kass R. Modeling users' interests in information filters. Communications of the ACM, 1992, 35(12): 49-50.
  • 7Halavais A, Lackaff D. An analysis of topical coverage of Wikipedia. Journal of Computer-Mediated Communication, 2008, 13(2): 429-440.
  • 8Kittur A, Chi E H, Suh B. What's in wikipedia: Mapping topics and conflict using socially annotated category struc- ture//Proceedings of the ACM Conference on Human Factors in Computing Systems. Boston, USA, 2009:1509-1512.
  • 9Lin Tsun-Chen, Liu Ru-Sheng, Chen Shu-Yuan, Liu Chen- Chung, Chen Chieh-Yu. Genetic algorithms and silhouette measures applied to microarray data classification//Proceed- ings of the 3rd Asia-Pacific Bioinformatics Conference. Singapore, 2005:229-238.
  • 10Plangprasopchok A, Lerman K, Getoor L. Growing a tree in the forest: Constructing folksonomies by integrating struc- tured metadata//Proceedings of the Conference on Knowl- edge Discovery and Data Mining. Washington, USA, 2010: 949-958.

共引文献178

同被引文献180

引证文献9

二级引证文献35

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部