数据等概率分档排序算法有效性的定量研究被引量：4

Quantitative Research of Validity on Subsection Sorting Algorithm with Equal Probability Data Segment

下载PDF

导出

摘要归纳提出了数据等概率分档排序算法 .该算法综合分析了以往的概率统计排序算法 ,充分利用了数据的分布信息 ,使得待排序数据尽可能平均分配到不同的区间内 ,分别对不同区间的数据排序 ,进而得到有序的序列 ;提出了数据等概率分档排序算法有效性的定量研究 ,从理论上量化并论证了分档数m的取值、分布类型的近似程度以及影响它们的几个因素 ,而这些方面的量化能为实际排序提供指导 ;推导出了一些重要的结论 . This paper brings forward a subsection insertion-sorting algorithm with equal probability data segment. The algorithm combines traditional sorting algorithms with some knowledge and skill of modern statistics to sort data with general distribution. The distribution information of these data is considered sufficiently. The approaches include mainly the following. Firstly, the distribution type of these data is determined experimentally. Secondly, distribution parameters of thesis data are estimated. Thirdly, these data are assigned evenly to different segments on the whole. Finally, the data of different segments is sorted with traditional sorting algorithms. The complexity of this algorithm is limited to O(n). And this paper uses the theory of non-parametric hypothesis test to quantize the number of segments and approximate degree of distribution types and to deduce some factors which effect the number of segments and approximate degree of distribution types. Some important theoretic results are deduced. Let b represents the ratio of the number of data to the number of segments. Main results as follows. For larger number n, the algorithm is time optimal when b is a constant; the more approximate degree of distribution types is different, the value of b is to ensure the time complexity is O(n). This paper experimentalizes about these important result. Experiment results show that the theoretic results are consistent with practice.

作者尤志强张大方

机构地区湖南大学计算机与通信学院

出处《计算机学报》 EI CSCD 北大核心 2003年第1期45-50,共6页 Chinese Journal of Computers

基金国家自然科学基金 ( 69973 0 16 6973 3 0 10 )资助

关键词数据等概率分档排序算法有效性复杂性计算机科学 Algorithms Approximation theory Computational complexity Estimation Probability

分类号 TP301.6 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献3

1[1]Igarashi Y, Wood D. A generalization of sorting. Journal of Information Processing, 1991, 14:36～42
2[2]Neubert Karl-Dietrich. Flashsortl algorithm. Dr Dobb's Journal, 1998,(2): 123～129
3[3]Chen J C. Proportion split sort. Nordic Journal of Computing, 1996, 3(3): 271～279

同被引文献17

1杨大顺,陶明华,丁青.二次分档插入排序法[J].计算机学报,1993,16(2):151-154. 被引量：12
2唐向阳.分段快速排序法[J].软件学报,1993,4(2):53-57. 被引量：48
3谢少权,刘宏芳.ASS算法分析与改进[J].计算机应用与软件,1996,13(4):17-22. 被引量：3
4张建中.快速分组排序[J].数值计算与计算机应用,1988,9(2):139-143.
5Knuth D E.The Art of Computer Programming(J)Sorting and Searching.Addison Wesley PublishingCompany,Inc.,1973,3:145-158
6KNUTH D E.The art of computer programming[M].New York:Addison-Wesley,1973.
7IGARASHI Y,WOOD D.A generalization of sorting[J].Journal of Information Processing,1991,14:36-42.
8Neubert Karl-Dietrich.Flashsortl algorithm.Dr Dobb's Journal,1998.2:123-129
9Chen J C.Proportion split sort.Nordic Journal of Computing,1996.3(3):271-279
10杨大顺,陶明华.一种新的插入排序和分档检索法[J].计算机学报,1990,13(11):853-859. 被引量：12

引证文献4

1杨建武,刘缙.基于Quick Sorting的快速分页排序算法[J].计算机工程,2005,31(4):82-84. 被引量：1
2方同祝,胡正国,田铮,金文凯.一种节省空间的排序算法[J].小型微型计算机系统,2005,26(7):1200-1201. 被引量：3
3尤志强,张大方,蔡洪波,乔中良.多数据源数据等概率分档统计插入排序算法[J].湖南大学学报（自然科学版）,2007,34(3):75-78.
4石兆英.一种线性时间排序算法的实现[J].计算机时代,2007(8):75-76.

二级引证文献4

1辛士庆,王国瑾.信息处理中有效维持动态有序集的新方法[J].中国科学（F辑:信息科学）,2009,39(9):923-932.
2程岩.零售业中商品选择问题的遗传算法研究[J].科技导报,2007,25(7):65-70. 被引量：1
3帅训波,周相广,黄复贤.一种改进的自索引排序算法设计与分析[J].德州学院学报,2008,24(2):46-48. 被引量：1
4陈翠娥.简单选择排序算法的改进算法[J].才智,2012,0(1):73-73.

1潘敏,王明文,王晓庆,揭安全.基于簇特征的文本增量聚类研究[J].江西师范大学学报（自然科学版）,2014,38(1):95-101. 被引量：2
2李贵林,杨禹琪,高星,廖明宏.企业搜索引擎个性化表示与结果排序算法研究[J].计算机研究与发展,2014,51(1):206-214. 被引量：7
3甄志龙,韩立新,陆佃龙.基于模糊关系的文本分类特征选择方法[J].情报学报,2008,27(6):851-856. 被引量：1
4应水金,陈福洋.提高森林资源续档数据精度问题的研究[J].林业勘察设计,2008,28(2):204-205.
5峰.LTO4，存储新选择[J].网管员世界,2007(11):9-9.
6王道才.文档文档,直接查到[J].电脑爱好者,2008,0(18):62-62.
7王正影.基于XML的Web数据挖掘[J].硅谷,2010,3(11):65-65. 被引量：1
8张晓燕.让“最近的文档”容纳更多的内容[J].电脑知识与技术（经验技巧）,2007(8):37-37.
9春露.请位机器人备份游戏存档数据[J].电脑迷,2010(12):62-62.
10轻松归档数据[J].网管员世界,2012(21):11-11.

计算机学报

2003年第1期

浏览历史

内容加载中请稍等...

数据等概率分档排序算法有效性的定量研究被引量：4

参考文献3

同被引文献17

引证文献4

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

数据等概率分档排序算法有效性的定量研究 被引量：4

参考文献3

同被引文献17

引证文献4

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

数据等概率分档排序算法有效性的定量研究被引量：4