In order to improve the real-time performance of the real-time HLA(high level architecture) in the application of massive data communication volume,multi-thread processing was adopted,thread pool structure was introdu...In order to improve the real-time performance of the real-time HLA(high level architecture) in the application of massive data communication volume,multi-thread processing was adopted,thread pool structure was introduced into the system,different threads to handle corresponding message queues was utilized to respond different message requests.Furthermore,an allocation strategy of semi-complete deprivation of priority was adopted,which reduces thread switching cost and processing burden in the system,provided that the message requests with high priority can be responded in time,thus improves the system's overall performance.The design and experiment results indicate that the method proposed in this paper can improve the real-time performance of HLA in distributed system applications greatly.展开更多
This article describes three algorithms for distance field generation on triangulated model: brute force algorithm, single-threaded algorithm based on spatial partition and multi-threaded algorithm based on spatial pa...This article describes three algorithms for distance field generation on triangulated model: brute force algorithm, single-threaded algorithm based on spatial partition and multi-threaded algorithm based on spatial partition. Spatial partition algorithm use equidistant network divide the bounding box into equal-sized cubes, calculates the maximum and minimum distances between the sample point and each of the small cubes,taking the minimum value from the maximum distance as the minimum distance from the sample point to the model named d1, comparing d1 with the distance from sample point to every little cube's minimum distance d2, if d1 <d2, the sample point's distance to all triangles inside this cube are greater than d1, skip this cube, otherwise, calculated the distance from the point to all the triangles intersect with the cube, then alternative d1 with the minimum value, circulate all small cubes intersect with the model. Comparing the calculation results, it can be seen that the algorithm about the multi-threaded distance field relative to the other two algorithms in computational speed is greatly improved especially for complex models.展开更多
mc211vm is a process-level ARM-to-x86 binary translator developed in our lab in the past several years. Currently, it is able to emulate singlethreaded programs. We extend mc211vm to emulate multi-threaded programs. O...mc211vm is a process-level ARM-to-x86 binary translator developed in our lab in the past several years. Currently, it is able to emulate singlethreaded programs. We extend mc211vm to emulate multi-threaded programs. Our main task is to reconstruct its architecture for multi-threaded programs. Register mapping, code cache management, and address mapping in mc2llvm have all been modified. In addition, to further speed up the emulation, we collect hot paths, aggressively optimize and generate code for them at run time. Additional threads are used to alleviate the overhead. Thus, when the same hot path is walked through again, the corresponding optimized native code will be executed instead. In our experiments, our system is 8.8X faster than QEMU (quick emulator) on average when emulating the specified benchmarks with 8 guest threads.展开更多
In order to realize the efficiency, reliability and safety tests on the complex cable network of an electronic system, an efficient cable network resistance tester is designed. Firstly, the design background and hardw...In order to realize the efficiency, reliability and safety tests on the complex cable network of an electronic system, an efficient cable network resistance tester is designed. Firstly, the design background and hardware structure are briefly described. Then aiming at the multi task parallelism considering real time measurement of parameters and real time control of the system in the tester testing, a real time muhi task control software is developed by using multi thread testing technology in parallel test to realize multi task complex control. Finally, the least squares method is used to improve the test accuracyof the tester. The test results show that the test error is basically within 0.3%, and the test speed can reach 345 point/min.展开更多
Due to current technology enhancement,molecular databases have exponentially grown requesting faster efficient methods that can handle these amounts of huge data.There-fore,Multi-processing CPUs technology can be used...Due to current technology enhancement,molecular databases have exponentially grown requesting faster efficient methods that can handle these amounts of huge data.There-fore,Multi-processing CPUs technology can be used including physical and logical processors(Hyper Threading)to significantly increase the performance of computations.Accordingly,sequence comparison and pairwise alignment were both found contributing significantly in calculating the resemblance between sequences for constructing optimal alignments.This research used the Hash Table-NGram-Hirschberg(HT-NGH)algo-rithm to represent this pairwise alignment utilizing hashing capabilities.The authors propose using parallel shared memory architecture via Hyper Threading to improve the performance of molecular dataset protein pairwise alignment.The proposed parallel hyper threading method targeted the transformation of the HT-NGH on the datasets decomposition for sequence level efficient utilization within the processing units,that is,reducing idle processing unit situations.The authors combined hyper threading within the multicore architecture processing on shared memory utilization remarking perfor-mance of 24.8%average speed up to 34.4%as the highest boosting rate.The benefit of this work improvement is shown preserving acceptable accuracy,that is,reaching 2.08,2.88,and 3.87 boost-up as well as the efficiency of 1.04,0.96,and 0.97,using 2,3,and 4 cores,respectively,as attractive remarkable results.展开更多
We are developing a speed reducer that can be considered a transformation of a worm gear reducer: the worm is replaced by an inverted roller screw, and the gear is replaced by a threaded chain drive. This configuratio...We are developing a speed reducer that can be considered a transformation of a worm gear reducer: the worm is replaced by an inverted roller screw, and the gear is replaced by a threaded chain drive. This configuration lessens wear, increases load capacity, and improves efficiency. The threaded chain consists of nut-shaped links. This paper presents the results of tests carried out on a prototype with a reduction ratio of 46.展开更多
针对基于Linux和TCG软件栈(Trusted computing group Software Stack,TSS)的复杂性问题,提出一种轻量级的可信软件栈。分析了TSS的基本结构与TSS在嵌入式系统的局限,总结出基于嵌入式系统的可信软件栈设计需求,设计出软件栈命令调用的...针对基于Linux和TCG软件栈(Trusted computing group Software Stack,TSS)的复杂性问题,提出一种轻量级的可信软件栈。分析了TSS的基本结构与TSS在嵌入式系统的局限,总结出基于嵌入式系统的可信软件栈设计需求,设计出软件栈命令调用的机制和软件栈的结构。此外,分析了TSS密钥管理缓存算法,在flash中定义一块密钥槽空间,方便密钥管理中直接访问,阐述密钥生成的逻辑过程,实现面向嵌入式系统的可信软件系统。经实验验证,该软件栈可以结合RT-Thread实时系统实现基本的可信计算功能。展开更多
基金Sponsored by the National Defence SciTech Key Lab Fundation(51457040204BQ0102)
文摘In order to improve the real-time performance of the real-time HLA(high level architecture) in the application of massive data communication volume,multi-thread processing was adopted,thread pool structure was introduced into the system,different threads to handle corresponding message queues was utilized to respond different message requests.Furthermore,an allocation strategy of semi-complete deprivation of priority was adopted,which reduces thread switching cost and processing burden in the system,provided that the message requests with high priority can be responded in time,thus improves the system's overall performance.The design and experiment results indicate that the method proposed in this paper can improve the real-time performance of HLA in distributed system applications greatly.
文摘This article describes three algorithms for distance field generation on triangulated model: brute force algorithm, single-threaded algorithm based on spatial partition and multi-threaded algorithm based on spatial partition. Spatial partition algorithm use equidistant network divide the bounding box into equal-sized cubes, calculates the maximum and minimum distances between the sample point and each of the small cubes,taking the minimum value from the maximum distance as the minimum distance from the sample point to the model named d1, comparing d1 with the distance from sample point to every little cube's minimum distance d2, if d1 <d2, the sample point's distance to all triangles inside this cube are greater than d1, skip this cube, otherwise, calculated the distance from the point to all the triangles intersect with the cube, then alternative d1 with the minimum value, circulate all small cubes intersect with the model. Comparing the calculation results, it can be seen that the algorithm about the multi-threaded distance field relative to the other two algorithms in computational speed is greatly improved especially for complex models.
基金supported by NSC under Grant No.NSC 100-2218-E-009-009MY3 and NSC 100-2218-E-009-010-MY3
文摘mc211vm is a process-level ARM-to-x86 binary translator developed in our lab in the past several years. Currently, it is able to emulate singlethreaded programs. We extend mc211vm to emulate multi-threaded programs. Our main task is to reconstruct its architecture for multi-threaded programs. Register mapping, code cache management, and address mapping in mc2llvm have all been modified. In addition, to further speed up the emulation, we collect hot paths, aggressively optimize and generate code for them at run time. Additional threads are used to alleviate the overhead. Thus, when the same hot path is walked through again, the corresponding optimized native code will be executed instead. In our experiments, our system is 8.8X faster than QEMU (quick emulator) on average when emulating the specified benchmarks with 8 guest threads.
文摘In order to realize the efficiency, reliability and safety tests on the complex cable network of an electronic system, an efficient cable network resistance tester is designed. Firstly, the design background and hardware structure are briefly described. Then aiming at the multi task parallelism considering real time measurement of parameters and real time control of the system in the tester testing, a real time muhi task control software is developed by using multi thread testing technology in parallel test to realize multi task complex control. Finally, the least squares method is used to improve the test accuracyof the tester. The test results show that the test error is basically within 0.3%, and the test speed can reach 345 point/min.
基金Deanship of Scientific Research(DSR),King Abdulaziz University,Grant/Award Number:D-139-137-1441。
文摘Due to current technology enhancement,molecular databases have exponentially grown requesting faster efficient methods that can handle these amounts of huge data.There-fore,Multi-processing CPUs technology can be used including physical and logical processors(Hyper Threading)to significantly increase the performance of computations.Accordingly,sequence comparison and pairwise alignment were both found contributing significantly in calculating the resemblance between sequences for constructing optimal alignments.This research used the Hash Table-NGram-Hirschberg(HT-NGH)algo-rithm to represent this pairwise alignment utilizing hashing capabilities.The authors propose using parallel shared memory architecture via Hyper Threading to improve the performance of molecular dataset protein pairwise alignment.The proposed parallel hyper threading method targeted the transformation of the HT-NGH on the datasets decomposition for sequence level efficient utilization within the processing units,that is,reducing idle processing unit situations.The authors combined hyper threading within the multicore architecture processing on shared memory utilization remarking perfor-mance of 24.8%average speed up to 34.4%as the highest boosting rate.The benefit of this work improvement is shown preserving acceptable accuracy,that is,reaching 2.08,2.88,and 3.87 boost-up as well as the efficiency of 1.04,0.96,and 0.97,using 2,3,and 4 cores,respectively,as attractive remarkable results.
文摘We are developing a speed reducer that can be considered a transformation of a worm gear reducer: the worm is replaced by an inverted roller screw, and the gear is replaced by a threaded chain drive. This configuration lessens wear, increases load capacity, and improves efficiency. The threaded chain consists of nut-shaped links. This paper presents the results of tests carried out on a prototype with a reduction ratio of 46.
文摘针对基于Linux和TCG软件栈(Trusted computing group Software Stack,TSS)的复杂性问题,提出一种轻量级的可信软件栈。分析了TSS的基本结构与TSS在嵌入式系统的局限,总结出基于嵌入式系统的可信软件栈设计需求,设计出软件栈命令调用的机制和软件栈的结构。此外,分析了TSS密钥管理缓存算法,在flash中定义一块密钥槽空间,方便密钥管理中直接访问,阐述密钥生成的逻辑过程,实现面向嵌入式系统的可信软件系统。经实验验证,该软件栈可以结合RT-Thread实时系统实现基本的可信计算功能。