AN EMPIRICAL FEASIBILITY STUDY OF SOCIETAL RISK CLASSIFICATION TOWARD BBS POSTS 被引量：3

AN EMPIRICAL FEASIBILITY STUDY OF SOCIETAL RISK CLASSIFICATION TOWARD BBS POSTS

导出

摘要 Societal risk classification is the fundamental issue for online societal risk monitoring. To show the challenge and feasibility of societal risk classification toward BBS posts, an empirical analysis is implemented in this paper. Through effectiveness analysis, Support Vector Machine based on Bag-Of-Words （BOW-SVM） is adopted for challenge validation, and the distributed document embeddings of BBS posts generated by Paragraph Vector are applied to feasibility study. Based on BOW-SVM, cross-validations of BBS posts labeled by different groups and annotators are conducted. The big fluctuation of cross-validation results indicates the differences of individual risk perceptions, which brings more challenges to societal risk classification. Furthermore, based on the distributed document embeddings of BBS posts, the pairwise similarities of more than 300 thousands BBS posts from different societal risk categories are compared. The higher similarities of BBS posts in the same societal risk category reveal that BBS posts in the same societal risk category share more features than BBS posts in different categories, which manifests the feasibility of societal risk classification of BBS posts, and also reflects the possibility to improve the performance of societal risk monitoring. Societal risk classification is the fundamental issue for online societal risk monitoring. To show the challenge and feasibility of societal risk classification toward BBS posts, an empirical analysis is implemented in this paper. Through effectiveness analysis, Support Vector Machine based on Bag-Of-Words （BOW-SVM） is adopted for challenge validation, and the distributed document embeddings of BBS posts generated by Paragraph Vector are applied to feasibility study. Based on BOW-SVM, cross-validations of BBS posts labeled by different groups and annotators are conducted. The big fluctuation of cross-validation results indicates the differences of individual risk perceptions, which brings more challenges to societal risk classification. Furthermore, based on the distributed document embeddings of BBS posts, the pairwise similarities of more than 300 thousands BBS posts from different societal risk categories are compared. The higher similarities of BBS posts in the same societal risk category reveal that BBS posts in the same societal risk category share more features than BBS posts in different categories, which manifests the feasibility of societal risk classification of BBS posts, and also reflects the possibility to improve the performance of societal risk monitoring.

作者 Jindong Chen Xiaoji Zhou Xijin Tang

机构地区 School of Economics and Management China Academy of Aerospace Systems Science and Engineering Academy of Mathematics and Systems Science Beijing Key Lab of Green Development Decision Based on Big Data University of Chinese Academy of Sciences

出处《Journal of Systems Science and Systems Engineering》 SCIE EI CSCD 2018年第6期709-726,共18页 系统科学与系统工程学报（英文版）

关键词 Societal risk classification Tianya Forum cross validation pairwise similarity individual risk perception Societal risk classification Tianya Forum cross validation pairwise similarity individual risk perception

分类号 TP393 [自动化与计算机技术—计算机应用技术] TU323.1 [建筑科学—结构工程]

引文网络
相关文献

参考文献1

1吴渝,肖开洲,刘洪涛,唐红.BBS虚拟社区的演化规律探索及仿真[J].系统工程理论与实践,2010,30(10):1883-1890. 被引量：10

二级参考文献14

1谭跃进,吴俊.网络结构熵及其在非标度网络中的应用[J].系统工程理论与实践,2004,24(6):1-3. 被引量：127
2王林,戴冠中.复杂网络的度分布研究[J].西北工业大学学报,2006,24(4):405-409. 被引量：68
3吴晔,肖井华,吴智远,杨俊忠,马宝军.手机短信网络的生长过程研究[J].物理学报,2007,56(4):2037-2041. 被引量：27
4Girvan M,Newman MEJ.Community structure in social and biological networks. Proceedings of the National Academy of Sciences of the United States of America . 2002
5M. E. J. Newman.Coauthorship networks and patterns of scientific collaboration. Proceedings of the National Academy of Sciences of the United States of America . 2004
6Lee.Fion S.L,Douglas Vogel,and Moez Limayem.Virtual Community Informatics:What We Know and What We Need to Know. Proceedings of the 35th Hawaii International Conference on System Sciences . 2002
7Watts DJ,Strogatz SH.Collective dynamics of small-world networks. Nature . 1998
8Albert-Laszlo Barabasi,Reka Albert.Emergence of scaling in random networks. Science . 1999
9Shi Zhou,Raul J Mondragon.The Rich-Club Phenomenon in the Internet Topology. IEEE Communications Letters . 2004
10Bianconi G,Barabasi A L.Bose-Einstein condensation in complex networks. Physical Review . 2001

共引文献9

1熊莉君.虚拟社区中信息交流的引导机制研究[J].图书馆学研究,2011(5):45-47. 被引量：7
2吴渝,杨晶晶,陈涌涛.Swarm突现性指标体系及有效性[J].重庆邮电大学学报（自然科学版）,2011,23(6):733-740. 被引量：1
3吴敏,李慧,张柯,秦丽娟.BBS用户回复网络的抗毁性分析[J].计算机科学,2012,39(B06):28-30. 被引量：4
4吴敏,李慧,秦丽娟.BBS用户回复网络的演化模型研究[J].首都师范大学学报（自然科学版）,2013,34(2):18-22. 被引量：1
5高秀丽,孟飞荣.基于复杂网络的物流企业竞争关系研究[J].计算机应用研究,2013,30(12):3638-3642. 被引量：2
6卢华玲,周燕,唐建波.基于复杂网络的虚拟品牌社区演化研究[J].图书馆学研究,2014(13):13-31. 被引量：2
7傅魁,周良俊,王慧敏.基于主题模型的虚拟社区用户建模[J].武汉理工大学学报（信息与管理工程版）,2014,36(5):663-667. 被引量：1
8沈乾,黄远,马宁,刘怡君.复杂网络演化中的“熵减点”研究:以微博传播网络的演化为例[J].数学的实践与认识,2015,45(3):282-290. 被引量：2
9赵倩.一类确定型的小世界等级指数型复杂网络模型[J].甘肃科技纵横,2017,46(6):4-11.

同被引文献28

1费洪晓,康松林,朱小娟,谢文彪.基于词频统计的中文分词的研究[J].计算机工程与应用,2005,41(7):67-68. 被引量：68
2管建和,甘剑峰.基于Lucene全文检索引擎的应用研究与实现[J].计算机工程与设计,2007,28(2):489-491. 被引量：71
3Wen ZHANG,Xijin TANG,Taketoshi YOSHIDA.TEXT CLASSIFICATION TOWARD A SCIENTIFIC FORUM[J].Journal of Systems Science and Systems Engineering,2007,16(3):356-369. 被引量：2
4程葳,钟华,孙娇华.网络论坛中发帖行为复杂性研究[J].系统工程学报,2009,24(4):385-391. 被引量：12
5林学民,王炜.集合和字符串的相似度查询[J].计算机学报,2011,34(10):1853-1862. 被引量：35
6Xijin TANG.EXPLORING ON-LINE SOCIETAL RISK PERCEPTION FOR HARMONIOUS SOCIETY MEASUREMENT[J].Journal of Systems Science and Systems Engineering,2013,22(4):469-486. 被引量：9
7Lina Cao,Xijin Tang.TOPICS AND TRENDS OF THE ON-LINE PUBLIC CONCERNS BASED ON TIANYA FORUM[J].Journal of Systems Science and Systems Engineering,2014,23(2):212-230. 被引量：11
8曹丽娜,唐锡晋.基于主题模型的BBS话题演化趋势分析[J].管理科学学报,2014,17(11):109-121. 被引量：44
9李诒靖,郭海湘,李亚楠,刘晓.一种基于Boosting的集成学习算法在不均衡数据中的分类[J].系统工程理论与实践,2016,36(1):189-199. 被引量：58
10何永强,秦勤,王俊鹏.基于深度神经网络的嵌入式向量及话题模型[J].计算机工程与设计,2016,37(12):3384-3388. 被引量：4

引证文献3

1陈进东,唐锡晋,周晓纪,张健.网络异源数据社会风险预估及有效性分析[J].系统工程学报,2019,34(3):312-323. 被引量：2
2Wen Zhang,Qiang Wang,Xiangjun Li,Taketoshi Yoshida,Jian Li.DCWord: A Novel Deep Learning Approach to Deceptive Review Identification by Word Vectors[J].Journal of Systems Science and Systems Engineering,2019,28(6):731-746. 被引量：3
3赵悦阳,崔雷.文本嵌入技术的研究与应用进展[J].数据与计算发展前沿,2023,5(3):92-110. 被引量：2

二级引证文献7

1王文松,孙祥娥.基于AM⁃CNN算法下多特征融合实现文本分析[J].现代电子技术,2021,44(13):65-70. 被引量：4
2王宗水,赵红,刘霞,孙倬,张健.社会化媒体环境下的品牌传播及品牌形象差异——基于华为与海尔的比较研究[J].中国管理科学,2022,30(6):178-187. 被引量：8
3张文,王强,唐子旭,秦广杰,李健.在线虚假评论识别中的数据贫乏问题研究[J].运筹与管理,2022,31(11):167-173. 被引量：2
4李杰,郭栋炜,杨芳,张睿.监管科技影响下互联网金融监管演化博弈研究[J].系统工程学报,2022,37(6):721-735. 被引量：1
5王强,张文,李健,马振中.在线商品评论感知真实性对产品销量的影响研究[J].计量经济学报,2023,3(2):513-530.
6文森,钱力,胡懋地,常志军.基于大语言模型的问答技术研究进展综述[J].数据分析与知识发现,2024,8(6):16-29.
7肖明魁.词向量嵌入在优化聚类算法中的应用[J].福建电脑,2024,40(9):1-6.

1Yugai JIA,Xijin TANG.Generating Storyline with Societal Risk from Tianya Club[J].Journal of Systems Science and Information,2017,8(6):524-536. 被引量：1
2China’s Economy Posts Steady Growth, Structural Reform Proceeds[J].Beijing Review,2018,61(39):38-38.
3China, Laos ink MoU on railway vocational technical college feasibility study[J].中国-东盟博览,2018(10):10-10.
4Nuo Xu,Xijin Tang.A CAUSALITY ANALYSIS OF SOCIETAL RISK PERCEPTION AND STOCK MARKET VOLATILITY IN CHINA[J].Journal of Systems Science and Systems Engineering,2018,27(5):613-631. 被引量：3
5Annasha Vyas,Katherine Moran,Joshua Livingston,Savannah Gonzales,Marlene Torres,Ali Duffens,Carina Mireles Romo,Gnevieve Mazza,Briana Livingston,Shadi Lahham,John Christian Fox.Feasibility study of minimally trained medical students using the Rural Obstetrical Ultrasound Triage Exam(ROUTE)in rural Panama[J].World Journal of Emergency Medicine,2018,9(3):216-222.
6Vicentia C Harizopoulou,Panagiotis Tsiartas,Dimitrios G Goulis,Dimitrios Vavilis,Grigorios Grimbizis,Theodoros D Theodoridis,Basil C Tarlatzis.Intrapartum application of the continuous glucose monitoring system in pregnancies complicated with diabetes: A review and feasibility study[J].World Journal of Obstetrics and Gynecology,2013,2(3):42-46. 被引量：2
7Goulnar Kasymjanova,Anh-Thi Tran,Victor Cohen,Carmela Pepe,Lama Sakr,David Small,Jason Scott Agulnik,Robert Thomas Jagoe.The use of a standardized Chinese herbal formula in patients with advanced lung cancer:a feasibility study[J].Journal of Integrative Medicine,2018,16(6):390-395. 被引量：2
8孙凤兰.基于TPR-DB方法的翻译研究新论——New Directions in Empirical Translation Process Research:Exploring the CRITT TPR-DB述评[J].中国翻译,2018,39(6):56-60.
9ZHANG Huanrui,YANG Fang,ZHANG Wenxia.Methods and Challenges of Motivating MOOC Learners——An Analysis of MOOC Discussion Forum Posts[J].外语教育,2016(1):156-167.
10再乱穿马路就向你喷水[J].学苑创造（C版）,2018,0(7):68-69.

Journal of Systems Science and Systems Engineering

2018年第6期

浏览历史

内容加载中请稍等...