期刊文献+
共找到9篇文章
< 1 >
每页显示 20 50 100
Active probing based Internet service fault management in uncertain and noisy environment 被引量:2
1
作者 CHU LingWei ZOU ShiHong CHENG ShiDuan WANG WenDong 《Science in China(Series F)》 2008年第11期1857-1870,共14页
In Internet service fault management based on active probing, uncertainty and noises will affect service fault management. In order to reduce the impact, challenges of Internet service fault management are analyzed in... In Internet service fault management based on active probing, uncertainty and noises will affect service fault management. In order to reduce the impact, challenges of Internet service fault management are analyzed in this paper. Bipartite Bayesian network is chosen to model the dependency relationship between faults and probes, binary symmetric channel is chosen to model noises, and a service fault management approach using active probing is proposed for such an environment. This approach is composed of two phases: fault detection and fault diagnosis. In first phase, we propose a greedy approximation probe selection algorithm (GAPSA), which selects a minimal set of probes while remaining a high probability of fault detection. In second phase, we propose a fault diagnosis probe selection algorithm (FDPSA), which selects probes to obtain more system information based on the symptoms observed in previous phase. To deal with dynamic fault set caused by fault recovery mechanism, we propose a hypothesis inference algorithm based on fault persistent time statistic (FPTS). Simulation results prove the validity and efficiency of our approach. 展开更多
关键词 service management fault management active probing bipartite Bayesian network binary symmetricchannel
原文传递
Intelligent Fault-tolerant Management of Electromechanical Equipment 被引量:1
2
作者 Wang Zhongsheng Lei Yong Jin Weihua Northwestern Polytechnical University Xi’an 710072, P.R.China 《International Journal of Plant Engineering and Management》 1997年第3期31-35,共5页
In this paper, a method of intelligent fault tolerant management on electromechanical equipment is presented. It is based on condition monitoring of equipment and realized by condition prediction and condition contro... In this paper, a method of intelligent fault tolerant management on electromechanical equipment is presented. It is based on condition monitoring of equipment and realized by condition prediction and condition control. An example is introduced and analyzed in this paper. 展开更多
关键词 electromechanical equipment condition identification intelligent fault tolerant management
下载PDF
Network Fault Diagnosis Using DSM 被引量:1
3
作者 JiangHao YanPu-liu ChenXiao WuJing 《Wuhan University Journal of Natural Sciences》 CAS 2004年第1期63-67,共5页
Difference similitude matrix (DSM) is effective in reducing information system with its higher reduction rate and higher validity. We use DSM method to analyze the fault data of computer networks and obtain the fault ... Difference similitude matrix (DSM) is effective in reducing information system with its higher reduction rate and higher validity. We use DSM method to analyze the fault data of computer networks and obtain the fault diagnosis rules. Through discretizing the relative value of fault data, we get the information system of the fault data. DSM method reduces the information system and gets the diagnosis rules. The simulation with the actual scenario shows that the fault diagnosis based on DSM can obtain few and effective rules. Key words computer networks - data reduction - fault management - difference-similitude matrix CLC number TP 393 Foundation item: Supported by the National Natural Science Foundation of China (90204008)Biography: Jiang Hao (1976-), male, Ph. D candidate, research direction: computer network, data mine. 展开更多
关键词 computer networks data reduction fault management difference-similitude matrix
下载PDF
FAULT IDENTIFICATION IN HETEROGENEOUS NETWORKS USING TIME SERIES ANALYSIS
4
作者 孙钦东 张德运 孙朝晖 《Journal of Pharmaceutical Analysis》 SCIE CAS 2004年第2期101-105,共5页
Fault management is crucial to pro vi de quality of service grantees for the future networks, and fault identification is an essential part of it. A novel fault identification algorithm is proposed in this paper, wh... Fault management is crucial to pro vi de quality of service grantees for the future networks, and fault identification is an essential part of it. A novel fault identification algorithm is proposed in this paper, which focuses on the anomaly detection of network traffic. Since the fault identification has been achieved using statistical information in mana gement information base, the algorithm is compatible with the existing simple ne twork management protocol framework. The network traffic time series is verified to be non-stationary. By fitting the adaptive autoregressive model, the series is transformed into a multidimensional vector. The training samples and identif iers are acquired from the network simulation. A k-nearest neighbor classif ier identifies the system faults after being trained. The experiment results are consistent with the given fault scenarios, which prove the accuracy of the algo rithm. The identification errors are discussed to illustrate that the novel faul t identification algorithm is adaptive in the fault scenarios with network traff ic change. 展开更多
关键词 fault management fault identification time seri es analysis adaptive autoregressive
下载PDF
Fault Aware Dynamic Resource Manager for Fault Recognition and Avoidance in Cloud
5
作者 Nandhini Jembu Mohanram Gnanasekaran Thangavel N.M.Jothi Swaroopan 《Computer Systems Science & Engineering》 SCIE EI 2021年第8期215-228,共14页
Fault tolerance(FT)schemes are intended to work on a minimized and static amount of physical resources.When a host failure occurs,the conventional FT frequently proceeds with the execution on the accessible working ho... Fault tolerance(FT)schemes are intended to work on a minimized and static amount of physical resources.When a host failure occurs,the conventional FT frequently proceeds with the execution on the accessible working hosts.This methodology saves the execution state and applications to complete without disruption.However,the dynamicity of open cloud assets is not seen when taking scheduling choices.Existing optimization techniques are intended in dealing with resource scheduling.This method will be utilized for distributing the approaching tasks to the VMs.However,the dynamic scheduling for this procedure doesn’t accomplish the objective of adaptation of internal failure.The scheme prefers jobs in the activity list with the most elevated execution time on resources that can execute in a shorter timeframe,but it suffers with higher makespan;poor resource usage and unbalance load concerns.To overcome the above mentioned issue,Fault Aware Dynamic Resource Manager(FADRM)is proposed that enhances the mechanism to Multi-stage Resilience Manager at an application-level FT arrangement.Proposed FADRM method gives FT a Multi-stage Resilience Manager(MRM)in the client and application layers,and simultaneously decreases the over-head and degradations.It additionally provides safety to the application execution considering the clients,application and framework necessities.Based on experimental evaluations,Proposed Fault Aware Dynamic Resource Manager(FADRM)method 157.5 MakeSpan(MS)time,0.38 Fault Rate(FR),0.25 Failure Delay(FD)and improves 5.5 Performance Improvement Ratio(PIR)for 25,50,75 and 100 tasks and 475 MakeSpan(MS)time,0.40 Fault Rate(FR),1.30 Failure Delay(FD)and improves 6.75 improves Performance Improvement Ratio(PER)for 100,200,300 and 500 Tasks compare than existing methodologies. 展开更多
关键词 Cloud computing fault aware dynamic resource manager fault tolerance MAKESPAN fault rate failure delay performance improvement ratio
下载PDF
Iaso: an autonomous fault-tolerant management system for supercomputers 被引量:1
6
作者 Kai LU Xiaoping WANG +6 位作者 Gen LI Ruibo WANG Wanqing CHI Yongpeng LIU Hongwei TANG Hua FENG Yinghui GAO 《Frontiers of Computer Science》 SCIE EI CSCD 2014年第3期378-390,共13页
With the increase of system scale, the inherent reliability of supercomputers becomes lower and lower. The cost of fault handling and task recovery increases so rapidly that the reliability issue will soon harm the us... With the increase of system scale, the inherent reliability of supercomputers becomes lower and lower. The cost of fault handling and task recovery increases so rapidly that the reliability issue will soon harm the usability of supercomputers. This issue is referred to as the "reliability wall", which is regarded as a critical problem for current and future supercomputers. To address this problem, we propose an autonomous fault-tolerant system, named Iaso, in MilkyWay- 2 system. Iaso introduces the concept of autonomous management in supercomputers. By autonomous management, the computer itself, rather than manpower, takes charge of the fault management work. Iaso automatically manage the whole lifecycle of faults, including fault detection, fault diagnosis, fault isolation, and task recovery. Iaso endows the autonomous features with MilkyWay-2 system, such as self-awareness, self-diagnosis, self-healing, and self-protection. With the help of Iaso, the cost of fault handling in supercomputers reduces from several hours to a few seconds. Iaso greatly improves the usability and reliability of MilkyWay-2 system. 展开更多
关键词 SUPERCOMPUTER autonomous management fault tolerant fault management MilkyWay-2 system
原文传递
A novel testability model for health management of heading attitude system 被引量:2
7
作者 Liu Guanjun Yang Shuming +1 位作者 Qiu Jing Yang Peng 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2013年第1期201-208,共8页
Prognostics and health management (PHM) is very important to guarantee the reliability and safety of aerospace systems, and sensing and test are the precondition of PHM. Integrating design for testability into early... Prognostics and health management (PHM) is very important to guarantee the reliability and safety of aerospace systems, and sensing and test are the precondition of PHM. Integrating design for testability into early design stage of system early design stage is deemed as a fundamental way to improve PHM performance, and testability model is the base of testability analysis and design. This paper discusses a hierarchical model-based approach to testability modeling and analysis for heading attitude system health management. Quantified directed graph, of which the nodes represent components and tests and the directed edges represent fault propagation paths, is used to describe fault-test dependency, and quantitative testability information is assigned to nodes and directed edges. The fault dependencies between nodes can be obtained by functional fault analysis methodology that captures the physical architecture and material flows such as energy, heat, data, and so on. By incorporating physics of failure models into component, the dynamic process of a failing or degrading component can be projected onto system behavior, i.e., system symptoms. Then, the analysis of extended failure modes, mechanisms and effects is utilized to construct fault evolution-test dependency. Using this integrated model, the designers and system analysts can assess the test suite's fault detectability, fault isolability and fault predictability. And heading attitude system application results show that the proposed model can support testability analysis and design for PHM very well. 展开更多
关键词 fault evolution Functional fault analysis Physics of failure model Prognostics and health management Quantified directed graph Testability analysis
原文传递
Performability analysis of avionics system with multilayer HM/FM using stochastic Petri nets 被引量:4
8
作者 Wan Jianxiong Xiang Xudong +3 位作者 Bai Xiaoying Lin Chuang Kong Xiangzhen Li Jianxiang 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2013年第2期363-377,共15页
The integrated modular avionics (IMA) architecture is an open standard in avionics industry, in which the number of functionalities implemented by software is greater than ever before. In the IMA architecture, the r... The integrated modular avionics (IMA) architecture is an open standard in avionics industry, in which the number of functionalities implemented by software is greater than ever before. In the IMA architecture, the reliability of the avionics system is highly affected by the software applications. In order to enhance the fault tolerance feature with regard to software application failures, many industrial standards propose a layered health monitoring/fault management (HM/FM) scheme to periodically check the health status of software application processes and recover the malfunctioning software process whenever an error is located. In this paper, we make an analytical study of the HM/FM system for avionics application software. We use the stochastic Petri nets (SPN) to build a formal model of each component and present a method to combine the components together to form a complete system model with respect to three interlayer query strategies. We further investigate the effectiveness of these strategies in an illustrative system. 展开更多
关键词 Health monitoring/fault management system Integrated modular avionics Multilayer Performability analysis Stochastic Petri nets
原文传递
Electronictization-A Foundation for Grid Modernization 被引量:1
9
作者 Don Tan 《Chinese Journal of Electrical Engineering》 2015年第1期1-8,共8页
Smart grid is the flag under which the US DoE has been mobilizing efforts to modernize the grid.Electronictization is the first step towards a smart modern grid.It is a process that transforms the grid from electrical... Smart grid is the flag under which the US DoE has been mobilizing efforts to modernize the grid.Electronictization is the first step towards a smart modern grid.It is a process that transforms the grid from electrical and electromechanical(EE)to electronic,electrical and electromechanical(EEE),laying down the very basic foundation for the modern grid.All things grid connected(ATGC)has five groups of essential hardware:1)Grid interface(smart)inverters;2)Hardware for flexible AC transmissions;3)Intelligent electronic power transformers(grid scale);4)Solid-state circuit breaker,current limiters,smart fuses and sensors;and 5)Multi-port bidirectional power&control units.Development and deployment of ATGC will be a grassroots drive to transform the grid from an old passive technology to a new active technology based on electronic power transmission,distribution,processing and protection.Grid modernization represents a win-win-win situation for the environment(Government),consumers,and grid owners/operators. 展开更多
关键词 Grid modernization electronictization fractal grid structure energy system resiliency fault management fault isolation renewable energy integration all things grid connected
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部