期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
A Fast Multi-Pattern Matching Algorithm for Mining Big Network Data 被引量:3
1
作者 Jun Liu Guangkuo Bian +1 位作者 Chao Qin Wenhui Lin 《China Communications》 SCIE CSCD 2019年第5期121-136,共16页
The rapid development of mobile network brings opportunities for researchers to analyze user behaviors based on largescale network traffic data. It is important for Internet Service Providers(ISP) to optimize resource... The rapid development of mobile network brings opportunities for researchers to analyze user behaviors based on largescale network traffic data. It is important for Internet Service Providers(ISP) to optimize resource allocation and provide customized services to users. The first step of analyzing user behaviors is to extract information of user actions from HTTP traffic data by multi-pattern URL matching. However, the efficiency is a huge problem when performing this work on massive network traffic data. To solve this problem, we propose a novel and accurate algorithm named Multi-Pattern Parallel Matching(MPPM) that takes advantage of HashMap in data searching for extracting user behaviors from big network data more effectively. Extensive experiments based on real-world traffic data prove the ability of MPPM algorithm to deal with massive HTTP traffic with better performance on accuracy, concurrency and efficiency. We expect the proposed algorithm and it parallelized implementation would be a solid base to build a high-performance analysis engine of user behavior based on massive HTTP traffic data processing. 展开更多
关键词 HTTP TRAFFIC multi-patterns matching SPARK URL matching USER behavior
下载PDF
Multi-Pattern Matching Algorithm with Wildcards Based on Bit-Parallelism
2
作者 Ahmed A. F. Saif HU Liang CHU Jianfeng 《Wuhan University Journal of Natural Sciences》 CAS CSCD 2017年第2期178-184,共7页
Multi-pattern matching with wildcards is a problem of finding the occurrence of all patterns in a pattern set {p^1,… ,p^k} in a given text t. If the percentage of wildcards in pattern set is not high, this problem ca... Multi-pattern matching with wildcards is a problem of finding the occurrence of all patterns in a pattern set {p^1,… ,p^k} in a given text t. If the percentage of wildcards in pattern set is not high, this problem can be solved using finite automata. We introduce a multi-pattern matching algorithm with a fixed number of wildcards to overcome the high percentage of the occurrence of wildcards in patterns. In our proposed method, patterns are matched as bit patterns using a sliding window approach. The window is a bit window that slides along the given text, matching against stored bit patterns. Matching process is executed using bit wise operations. The experimental results demonstrate that the percentage of wildcard occurrence does not affect the proposed algorithm's performance and the proposed algorithm is more efficient than the algorithms based on the fast Fourier transform. The proposed algorithm is simple to implement and runs efficiently in O(n + d(n/σ )(m/w)) time, where n is text length, d is symbol distribution over k patterns, m is pattern length, and σ is alphabet size. 展开更多
关键词 multi-pattern string matching WILDCARD bitparallelism
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部