期刊文献+

P2P多线程动态容错模型的研究与应用

Research and Application of P2P Multithread Dynamic Fault-tolerance Model
下载PDF
导出
摘要 在多线程并行程序中,如只由主线程负责容错处理,主线程本身则成为潜在的单点故障点。针对该问题,设计一种对等多线程动态容错模型。该模型包含多轮容错过程,每轮容错过程采用容错控制线程随机决定容错模型动态生成的方法。在EasyPDP系统中进行实际应用测试,结果证明,该容错模型能够避免主线程单点故障,同时保证系统加速比与负载平衡性能。 In a multithread parallel program, the master thread is an underlying single point failure under the consumption that the fault-tolerance is handled by the mast thread only. To solve this problem, a Peer-to-Peer(P2P) multithread dynamic fault-tolerance model is designed and implemented. It contains several rounds of fault-tolerance process, fault-tolerance control thread is decided randomly and the fault-tolerance model is generated dynamically in every round. Test work is conducted in the system of EasyPDP, and results shows that this model is able to avoid the single point failure of the master thread, as well as maintain the system speed-up ratio and load balance performance.
出处 《计算机工程》 CAS CSCD 2013年第9期104-108,共5页 Computer Engineering
基金 国家自然科学基金资助项目(10978016 11003027) 天津市科技支撑计划基金资助重点项目(11ZCKFGX01000 11ZCKFGX04200)
关键词 多线程 容错 单点故障 对等 EasyPDP系统 multithread fault-tolerance single point failure Peer-to-Peer(P2P) EasyPDP system
  • 相关文献

参考文献10

  • 1Li Xiaobin, Gaudiot J L. Design Trade-offs and Deadlock Prevention in Transient Fault-tolerant SMT Processors[C]// Proc. of the 12th International Symposium on Dependable Computing. Riverside, USA: Is. n.], 2006.
  • 2Madan N, Balasubramonian R. Power-efficient Approaches to Redundant Multithreading[J]. IEEE Transactions on Parallel and Distributed Systems, 2007, 18(8): 1066-1079.
  • 3Ma Yi, Zhou Huiyang. Efficient Transient-fault Tolerance for Multithread Processors Using Dual-thread Execution[C]//Proc. of International Conference on Computer Design. Las Vegas, USA: Is. n.], 2007.
  • 4Dieter W, Lumpp J J. A User-level Cheekpointing Library for POSIX Threads Programs[C]//Proe. of Symposium on Fault- tolerant Computing. Los Alamitos, USA: IEEE Computer, 1999.
  • 5富弘毅,丁滟,宋伟,杨学军.一种利用并行复算实现的OpenMP容错机制[J].软件学报,2012,23(2):411-427. 被引量:7
  • 6Tang Shanjiang, Yu Ce, Sun Jizhou, et al. EasyPDP: An Efficient Parallel Dynamic Programming Runtime System for Computational Biology[J]. IEEE Transactions on Parallel and Distributed Systems, 2012, 23(5): 862-872.
  • 7Androutsellis T S, Spinellis D. A Survey of Peer-to-Peer Content Distribution Technologies[J]. ACM Computing Surveys, 2004, 36(4): 335-371.
  • 8Canon L C, Jeannot E. Evaluation and Optimization of the Robustness of DAG Schedules in Heterogeneous Environ- ments[J]. IEEE Transactions on Parallel and Distributed Systems, 2010, 21(4): 532-546.
  • 9郝水侠,曾国荪,谭一鸣.一种基于DAG图的异构可重构任务划分方法[J].同济大学学报(自然科学版),2011,39(11):1693-1698. 被引量:4
  • 10周佳祥,郑纬民.基于DAG图解-重构的机群系统静态调度算法[J].软件学报,2000,11(8):1097-1104. 被引量:7

二级参考文献24

  • 1沈轶炜,曾国荪.异构计算中一种图的非均衡划分算法[J].计算机科学,2006,33(6):260-263. 被引量:7
  • 2Freund R F. Optimal selection theory for superconcurrency [C]//Proceedings of Conference on Supercomputing. New York : ACM, 1989: 699 - 703.
  • 3Chen S,Eshaghian M,Khokhar A, et al. A selection theory and methodology for heterogeneous supercomputing [C ] // Proceedings of Workshop on Heterogeneous Processing. Los Alamitos: IEEE CS Press, 1993:15 - 22.
  • 4Khokhar A, Prasamma V K, Shaaban M E. Heterogeneous computing: challenges and opportunities[J]. Computer, 1993,26 (6):18.
  • 5Estrin G, Bussellt B, Turn R, et al. Parallel processing in a restructure computer system [ J ]. IEEE Transactions on Electronic Computers, 1963,12 (5) : 747.
  • 6Kartashev S I, Kartashev S P. A multicomputer system with dynamic architecture [J]. IEEE Transactions on Computers, 1979,28(10) :704.
  • 7Bruce H,TamaraG. Graph partitioning models for parallel computing[J]. Parallel Computing, 2000,26(1) : 1519.
  • 8Selvakkumaran N, George K. Multiobjective hypergraph partitioning algorithms for cut and maximum subdomain-degree minimization[J]. IEEE Transactions on Computer Aided Design of Intergrated Circuits and System, 2006,25 (3) :504.
  • 9HendricksonB, Kolda T G, Partitioning nonsquare and nonsymmetric matrices for parallel processing [J ]. SIAM Journal on Scientics Computer, 2000 (21) :2048.
  • 10Catalyurek U V, Aykanat C. Hypergraph-partitioning based decomposing for parallel sparsematrix vector multiplication[J]. IEEE Transactions on Parallel Distribution System, 1999, 10 (5) :673.

共引文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部