期刊文献+

一种自定义顺序的字符串排序算法 被引量:4

A String Sort Algorithm in Custom Character Order
下载PDF
导出
摘要 现有的排序算法很难实现自定义顺序的字符串排序,提出一种自定义顺序的字符串快速排序方法.在应用连续编号定义字符排序顺序的基础上,使用哈希表结构将字符串转换成对应的整型数组,以字符的最大编号作为基数排序算法的新基数,实现字符串的基数排序.分析和实验表明,本文方法可有效实现自定义顺序的字符串排序,是一个时间和空间复杂度都是线性的排序算法,比快速排序(Quick Sort)具有更好的时间性能,且可以方便地推广到其它语言的字串排序中. Existing sort algorithms are difficult to implement string sort in a custom order of characters. This paper presents a fast string sort method in custom character order. On the basis of the consecutive numbers which used to define the custom order of char- acters, the hash table structure is employed to convert each string into corresponding array of integers. By taking the maximum num- ber of characters as the new radix, the radix sort algorithm is used to implement fast sort of strings in custom order. Theory analysis and experiments show that the method of this paper can easily achieve string sort in custom order in linear time and space complexity. This method has a better time performance than that of Quick Sort algorithm, and it can easily extend to string sort applications of oth- er languages.
出处 《小型微型计算机系统》 CSCD 北大核心 2012年第9期1968-1971,共4页 Journal of Chinese Computer Systems
基金 国家自然科学基金项目(61163045 31040050)资助 新疆师范大学博士后科研启动基金项目(XJNUBS1111)资助
关键词 字符串排序 自定义顺序 基数排序 哈希表 string sort custom order radix sort Hash table
  • 相关文献

参考文献4

二级参考文献30

  • 1唐向阳.分段快速排序法[J].软件学报,1993,4(2):53-57. 被引量:48
  • 2顾训穰,诸宇章.堆整序的最优算法[J].软件学报,1994,5(1):33-36. 被引量:6
  • 3杨磊,黄辉,宋涛.桶外排序算法的抽样分点分发策略[J].软件学报,2005,16(5):643-651. 被引量:5
  • 4杨大顺,陶明华,顾芸瑛,薛峰.按字节桶分配链接排序法[J].计算机研究与发展,1996,33(2):132-139. 被引量:15
  • 5Owen A. Bubble sort: An archaeological algorithmic analysis. In: Grissom S, Knox D, Joyce D, Dann W, eds. Proc. of the 34th SIGCSE Technical Symp. on Computer Science Education. New York: ACM Press, 2003.1-5.
  • 6Hore CAR. Quicksort. The Computer Journal, 1962,5(1):10-16.
  • 7Chen JC. Proportion extend sort. SIAM Journal on Computing, 2001,(31)1:323-330.
  • 8Lorin H. Sorting and Sort Systems. Reading: Addison-Wesley Publishing Company, 1975.
  • 9Knuth DE. The Art of Computer Programming, Vol 3: Sorting and Searching. Reading: Addison-Wesley Publishing Company,1973.
  • 10Gray J, Coates J, Nyberg C. Performance/Price sort and PennySort. Technical Report, MS-TR-98-45, Microsoft Research, 1998.

共引文献39

同被引文献28

  • 1邹纲,刘洋,刘群,孟遥,于浩,西野文人,亢世勇.面向Internet的中文新词语检测[J].中文信息学报,2004,18(6):1-9. 被引量:59
  • 2崔世起,刘群,孟遥,于浩,西野文人.基于大规模语料库的新词检测[J].计算机研究与发展,2006,43(5):927-932. 被引量:32
  • 3黄昌宁,赵海.中文分词十年回顾[J].中文信息学报,2007,21(3):8-19. 被引量:249
  • 4罗智勇,宋柔.基于多特征的自适应新词识别[J].北京工业大学学报,2007,33(7):718-725. 被引量:14
  • 5贺敏,龚才春,张华平,程学旗.一种基于大规模语料的新词识别方法[J].计算机工程与应用,2007,43(21):157-159. 被引量:24
  • 6Zhang H J,Huang H Y,Zhu C Y,et al.A pragmatic model for new Chinese word extraction[C]//Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering,Beijing,China,2010:91-98.
  • 7Luo S,Sun M.Two-character Chinese word extraction based on hybrid of internal and contextual measures[C]//Proceedings of the 2nd SIGHAN Workshop on Chinese Language,Sapporo,Japan,2003:24-30.
  • 8Wu A,Jiang Z.Statistically-enhanced new word identification in a rule-based Chinese system[C]//Proceedings of the 2nd Chinese Language Processing Workshop,Hong Kong,China,2000:46-51.
  • 9Peng F,Feng F,Mc Callum A.Chinese segmentation and new word detection using conditional random fields[C]//Proceedings of the 20th International Conference on Computational Linguistics,Switzerland,2004:562-568.
  • 10Nakagawa T,Matsumoto Y.Guessing parts-of-speech of unknown words using global information[C]//Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics,Sydney,Australia,2006:705-712.

引证文献4

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部