Chameleon聚类算法在Web开源情报主题挖掘中的应用研究

Research on the application of Chameleon clustering algorithm in Web open source intelligence topic mining

下载PDF

导出

摘要信息时代的开源情报传播速度快、体量大、时效性强,大量数据难以用人工进行分析,为了解决对海量数据分析的效率,研究设计了Web开源情报信息处理方法。该方法首先利用网络爬虫通过URL爬取目标情报,之后用DOM树对网页内容进行整理,采用TextRank算法提取关键词并使用Chameleon聚类算法构建主题挖掘模型,该模型用于情报主题生成,自动进行情报主题分析。性能测试表明,基于Chameleon聚类算法的Web开源情报信息处理方法能够对开源情报进行有效分析。 In the information age, open source intelligence spreads faster, has a larger volume, and is more timely. However, due to the difficulty of manual analyze of a large amount of data, in order to solve the problem of collecting intelligence from massive amounts of data and improve the efficiency of intelligence analysis, a Web open source intelligence information processing method has been studied and designed. This method firstly uses web crawlers to crawl target intelligence through URLs, then uses DOM trees to organize web content structure, uses TextRank algorithm to extract keywords, and finally adopts Chameleon clustering algorithm to construct a topic mining model.The experiment results demonstrate that the proposed Chameleon clustering algorithm-based Web open source intelligence information processing method can effectively analyse open source intelligence and has excellent performance.

作者方世敏 FANG Shi-min(School of Politics,National Defence University,Shanghai 200433,China)

机构地区国防大学政治学院

出处《信息技术》 2024年第11期63-68,76,共7页 Information Technology

基金国家社科基金军事学青年项目(2019-SKJJ-C-064)。

关键词 CHAMELEON Web开源情报主题挖掘网络爬虫 Chameleon Web open source intelligence topic mining Web crawler

分类号 TP399 [自动化与计算机技术—计算机应用技术] G350.7 [文化科学—情报学]

引文网络
相关文献

1杨昕娉,庞明樊,纪瀚然,房元圣,武洁雯,赵青,孙静,滕岱君,孙雅婷,戚晓鹏.2024年8月全球传染病事件风险评估[J].疾病监测,2024,39(9):1105-1108.
2Haiqin Zhou,Shunze Cao,Shuailong Zhang,Fenggang Li,Nan Ma.Design of a Fuel Explosion-Based Chameleon-Like Soft Robot Aided by the Comprehensive Dynamic Model[J].Cyborg and Bionic Systems,2023(1):460-473. 被引量：1
3Yueyan Dong,Yifang Li,Ye Cheng,Dongxiao Yu.Redactable consortium blockchain with access control:Leveraging chameleon hash and multi-authority attribute-based encryption[J].High-Confidence Computing,2024,4(1):14-23.
4王文鹏,李海晨.基于在线健康平台评价数据的主题挖掘与情感分析[J].现代信息科技,2024,8(19):124-129.
5符精晶,连文双,胡峰.基于知识图谱的用户兴趣社区发现算法研究[J].电脑知识与技术,2024,20(29):12-14.
6袁唯淋,赵卫伟,胡振震,曹巍,何俊,董绍进,王程远,王盛青.智能情报融合综述:对抗视角下的开源情报融合分析[J].智能科学与技术学报,2024,6(3):284-300.
7郭海燕.浅谈主题意义引领下小学英语单元整体教学[J].风采童装,2024(5):0123-0125.
8孙善美.基于在线评论的国外公共图书馆形象感知研究[J].图书馆研究与工作,2024(11):52-57.
9吴克介.煤矿安全Web数据采集技术研究及应用[J].能源与环保,2024,46(10):14-20.
10张绮文,袁凌云,王孜冉.支持数据敏感度分级的属性访问控制方案[J].网络安全与数据治理,2024,43(10):20-27.

信息技术

2024年第11期

浏览历史

内容加载中请稍等...

Chameleon聚类算法在Web开源情报主题挖掘中的应用研究

相关作者

相关机构

相关主题

浏览历史