摘要
针对搜索引擎查询结果集中的相同记录出现次数的统计问题,提出了分档统计的算法。该算法在时间上比逐个字符统计频率快,能够达到O(n)的时间代价,算法还针对长字符串(字串的长度与字串的个数相差不多)进行了优化,降低了计算规模。
A new algorithm is proposed aiming at search engine's result set calculating frequency, which has a higher frequency than calculating and has the time complexity of O(n) The algorithm for long strings (the length of the string is nearly the same as with the number of the string of) is optimized to reduce the size of the calculation.
出处
《天津工程师范学院学报》
2008年第4期40-42,共3页
Journal of Tianji University of Technology and Education
关键词
分档
字符串频率
算法
classification
frequency of the string
algorithm