摘要
为了提高用户业务识别精度和效率,提升APP标签数据抓取效果,提出基于智能化的HTTPS协议深度解析率提升方法。在HTTPS协议轻度解析中,引入自适应免疫进化算法的聚焦爬虫方法,抓取APP标签数据作为训练数据;在HTTPS协议中度解析中,构建业务粗、细分类模型,实现已有业务识别;在HTTPS协议深度解析中,深度解析新增业务并获取ID,通过深度学习训练服务器,实现新增业务流数据快速识别。实验结果表明,所提方法的APP标签数据抓取效果较好。
In order to improve the accuracy and efficiency of user business identification and improve the effect of APP tag data capture,this paper proposes an intelligent method to improve the depth resolution rate of HTTPS protocol.In the mild parsing of HTTPS protocol,the focused crawler method of adaptive immune evolutionary algorithm is introduced to grab APP tag data as training data.In the moderate resolution of the HTTPS protocol,the coarse and fine classification models of services are constructed to realize the identification of existing services.In the depth analysis of the HTTPS protocol,the new business is deeply analyzed and the ID is obtained.Through the depth learning training server,the new business flow data is quickly identified.Experiment results show that the proposed method has a good effect in capturing APP tag data.
作者
罗贤坤
LUO Xian-kun(China Mobile Communications Group Jiangxi Co.,Ltd.,Nanchang 330000,China)
出处
《信息技术》
2022年第6期145-150,156,共7页
Information Technology