Aim To detect sensor failure in control system using a single sensor signal. Methods A neural predictor was designed based on a radial basis function network(RBFN), and the neural predictor learned the sensor sig...Aim To detect sensor failure in control system using a single sensor signal. Methods A neural predictor was designed based on a radial basis function network(RBFN), and the neural predictor learned the sensor signal on line with a hybrid algorithm composed of n means clustering and Kalman filter and then gave the estimation of the sensor signal at the next step. If the difference between the estimation and the actural values of the sensor signal exceeded a threshold, the sensor could be declared to have a failure. The choice of the failure detection threshold depends on the noise variance and the possible prediction error of neural predictor. Results and Conclusion\ The computer simulation results show the proposed method can detect sensor failure correctly for a gyro in an automotive engine.展开更多
Failure detection module is one of important components in fault-tolerant distributed systems,especially cloud platform.However,to achieve fast and accurate detection of failure becomes more and more difficult especia...Failure detection module is one of important components in fault-tolerant distributed systems,especially cloud platform.However,to achieve fast and accurate detection of failure becomes more and more difficult especially when network and other resources' status keep changing.This study presented an efficient adaptive failure detection mechanism based on volterra series,which can use a small amount of data for predicting.The mechanism uses a volterra filter for time series prediction and a decision tree for decision making.Major contributions are applying volterra filter in cloud failure prediction,and introducing a user factor for different QoS requirements in different modules and levels of IaaS.Detailed implementation is proposed,and an evaluation is performed in Beijing and Guangzhou experiment environment.展开更多
Predicting heartbeat message arrival time is crucial for the quality of failure detection service over intemet. However, intemet dynamic characteristics make it very difficult to understand message behavior and accura...Predicting heartbeat message arrival time is crucial for the quality of failure detection service over intemet. However, intemet dynamic characteristics make it very difficult to understand message behavior and accurately predict heartbeat arrival time. To solve this problem, a novel black-box model is proposed to predict the next heartbeat arrival time. Heartbeat arrival time is modeled as auto-regressive process, heartbeat sending time is modeled as exogenous variable, the model' s coefficients are estimated based on the sliding window of observations and this result is used to predict the next heartbeat arrival time. Simulation shows that this adaptive auto-regressive exogenous (ARX) model can accurately capture heartbeat arrival dynamics and minimize prediction error in different network environments.展开更多
As the mean-time-between-failures(MTBF)continues to decline with the increasing number of components on large-scale high performance computing(HPC)systems,program failures might occur during the execution period with ...As the mean-time-between-failures(MTBF)continues to decline with the increasing number of components on large-scale high performance computing(HPC)systems,program failures might occur during the execution period with high probability.Ensuring successful execution of the HPC programs has become an issue that the unprivileged users should be concerned.From the user perspective,if the program failure cannot be detected and handled in time,it would waste resources and delay the progress of program execution.Unfortunately,the unprivileged users are unable to perform program state checking due to execution control by the job management system as well as the limited privilege.Currently,automated tools for supporting user-level failure detection and autorecovery of parallel programs in HPC systems are missing.This paper proposes an innovative method for the unprivileged user to achieve failure detection of job execution and automatic resubmission of failed jobs.The state checker in our method is encapsulated as an independent job to reduce interference with the user jobs.In addition,we propose a dual-checker mechanism to improve the robustness of our approach.We implement the proposed method as a tool named automatic re-launcher(ARL)and evaluate it on the Tianhe-2 system.Experiment results show that ARL can detect the execution failures effectively on Tianhe-2 system.In addition,the communication and performance overhead caused by ARL is negligible.The good scalability of ARL makes it applicable for large-scale HPC systems.展开更多
This work presents the application of the technique named signal analysis based on chaos using density of maxima to analyze brushless direct current motors.It uses a correlation coefficient estimated from the density ...This work presents the application of the technique named signal analysis based on chaos using density of maxima to analyze brushless direct current motors.It uses a correlation coefficient estimated from the density of maxima of the current signal.This study demonstrates in experiments the speed estimation of a brushless motor on a testbench and failure detection in a small flying drone.The experimental results demonstrate that it is possible to estimate the speed in 97.8%of the cases and to detect failure in 82.75%of the analyzed cases.展开更多
A Mobile Ad-hoc NETwork(MANET)contains numerous mobile nodes,and it forms a structure-less network associated with wireless links.But,the node movement is the key feature of MANETs;hence,the quick action of the nodes ...A Mobile Ad-hoc NETwork(MANET)contains numerous mobile nodes,and it forms a structure-less network associated with wireless links.But,the node movement is the key feature of MANETs;hence,the quick action of the nodes guides a link failure.This link failure creates more data packet drops that can cause a long time delay.As a result,measuring accurate link failure time is the key factor in the MANET.This paper presents a Fuzzy Linear Regression Method to measure Link Failure(FLRLF)and provide an optimal route in the MANET-Internet of Things(IoT).This work aims to predict link failure and improve routing efficiency in MANET.The Fuzzy Linear Regression Method(FLRM)measures the long lifespan link based on the link failure.The mobile node group is built by the Received Signal Strength(RSS).The Hill Climbing(HC)method selects the Group Leader(GL)based on node mobility,node degree and node energy.Additionally,it uses a Data Gathering node forward the infor-mation from GL to the sink node through multiple GL.The GL is identified by linking lifespan and energy using the Particle Swarm Optimization(PSO)algo-rithm.The simulation results demonstrate that the FLRLF approach increases the GL lifespan and minimizes the link failure time in the MANET.展开更多
A new integrity metric for navigation systems is proposed based on the measurement domain. Proba-hilistic optimization design offers tools for fault detection by considering the required navigation performance (RNP)...A new integrity metric for navigation systems is proposed based on the measurement domain. Proba-hilistic optimization design offers tools for fault detection by considering the required navigation performance (RNP) parameter and the uncertainty noise. The choice of the proper performance parameter provided the single-valued mapping with the missed detection probability estimates the probability of failure. The desirable characteristics of the residual sensitivity matrix are exploited to increase the efficiency for identifying erroneous observations. The algorithm can be used to support the performance specification and the efficient calculation of the integrity monitoring process. The simulation for non-precision approach (NPA) validates both the viability and the effectiveness of the proposed algorithm.展开更多
To deal with the adverse influence of model failures on Kalman filtering (KF) estimation, it is necessary to investigate the generalized reliability theory, including the model failure detection and identification m...To deal with the adverse influence of model failures on Kalman filtering (KF) estimation, it is necessary to investigate the generalized reliability theory, including the model failure detection and identification method as well as the separability and reliability theories. Although the generalized reliability theory for the least square has been discussed for many decades, the generalized reliability theory of KF is not widely discussed. Compared with the least square, KF includes not only the measurement model, but also the dynamic model. In KF, the predicted value of the state parameters from the dynamic model is considered as pseudomeasurements and combined with the observed measurements to compose the form of the least square. According to the reliability of the least square, the generalized reliability of KF is derived. Then, the dynamic model failure of precise point positioning is simulated to demonstrate the usage of the generalized reliability theory. The results show that the adverse influence of the dynamic model failure is more severe than that of the measurement model. Moreover, it is recommended that the model failure identification should always be used even if the overall model test passes. It is shown that the derived generalized reliability measures are suitable for the generalized KF estimation.展开更多
With the advancement of technology in recent years, effective fault diagnosis became a necessity to verify the performance and ensure the quality of complex systems. In this paper, an original verification methodology...With the advancement of technology in recent years, effective fault diagnosis became a necessity to verify the performance and ensure the quality of complex systems. In this paper, an original verification methodology for complex consumer electronic devices is presented. Verification of the system which consists of hardware (integrated circuit) and corresponding software within a flat panel TV set is in the focus. Proposed methodology provides reliable functional failure detection using the concept of black box testing. Further, the approach is fully automated, improving the reliability and speed of failure detection. The methodology effectiveness has been experimentally evaluated and the analysis results have been reported.展开更多
An integration processing system of three-dimensional laser scanning information visualization in goaf was developed. It is provided with multiple functions, such as laser scanning information management for goaf, clo...An integration processing system of three-dimensional laser scanning information visualization in goaf was developed. It is provided with multiple functions, such as laser scanning information management for goaf, cloud data de-noising optimization, construction, display and operation of three-dimensional model, model editing, profile generation, calculation of goaf volume and roof area, Boolean calculation among models and interaction with the third party soft ware. Concerning this system with a concise interface, plentiful data input/output interfaces, it is featured with high integration, simple and convenient operations of applications. According to practice, in addition to being well-adapted, this system is favorably reliable and stable.展开更多
Malfunction or breakdown of certain mission critical systems(MCSs) may cause losses of life, damage the environments, and/or lead to high costs. Therefore, recognition of emerging failures and preventive maintenance a...Malfunction or breakdown of certain mission critical systems(MCSs) may cause losses of life, damage the environments, and/or lead to high costs. Therefore, recognition of emerging failures and preventive maintenance are essential for reliable operation of MCSs. There is a practical approach for identifying and forecasting failures based on the indicators obtained from real life processes. We aim to develop means for performing active failure diagnosis and forecasting based on monitoring statistical changes of generic signal features in the specific operation modes of the system. In this paper, we present a new approach for identifying emerging failures based on their manifestations in system signals. Our approach benefits from the dynamic management of the system operation modes and from simultaneous processing and characterization of multiple heterogeneous signal sources. It improves the reliability of failure diagnosis and forecasting by investigating system performance in various operation modes, includes reasoning about failures and forming of failures using a failure indicator matrix which is composed of statistical deviation of signal characteristics between normal and failed operations, and implements a failure indicator concept that can be used as a plug and play failure diagnosis and failure forecasting feature of cyber-physical systems. We demonstrate that our method can automate failure diagnosis in the MCSs and lend the MCSs to the development of decision support systems for preventive maintenance.展开更多
The connectivity of a strongly connected network may be destroyed after link damage.Since many net- works are connected by directed links,the reachability may be restored by altering the direction of one or more of th...The connectivity of a strongly connected network may be destroyed after link damage.Since many net- works are connected by directed links,the reachability may be restored by altering the direction of one or more of the links and thus reconfigoring the network.The location of the failed link must first be determined.In this paper,we examine new methods to determine the location of failed links and nodes in networks.A routing test approach is proposed and the conditions under which communication networks may be tested are discussed. Finally,an adaptive algorithm and a heuristic algorithm that can locate a single failed llnk or a single failed node are presented.展开更多
文摘Aim To detect sensor failure in control system using a single sensor signal. Methods A neural predictor was designed based on a radial basis function network(RBFN), and the neural predictor learned the sensor signal on line with a hybrid algorithm composed of n means clustering and Kalman filter and then gave the estimation of the sensor signal at the next step. If the difference between the estimation and the actural values of the sensor signal exceeded a threshold, the sensor could be declared to have a failure. The choice of the failure detection threshold depends on the noise variance and the possible prediction error of neural predictor. Results and Conclusion\ The computer simulation results show the proposed method can detect sensor failure correctly for a gyro in an automotive engine.
基金supported by the National High-tech Research and Development Program(863) of China under Grant No. 2011AA01A102
文摘Failure detection module is one of important components in fault-tolerant distributed systems,especially cloud platform.However,to achieve fast and accurate detection of failure becomes more and more difficult especially when network and other resources' status keep changing.This study presented an efficient adaptive failure detection mechanism based on volterra series,which can use a small amount of data for predicting.The mechanism uses a volterra filter for time series prediction and a decision tree for decision making.Major contributions are applying volterra filter in cloud failure prediction,and introducing a user factor for different QoS requirements in different modules and levels of IaaS.Detailed implementation is proposed,and an evaluation is performed in Beijing and Guangzhou experiment environment.
基金the National Basic Research Program of China(No.2003CB314806)China Next Generation Intemet Project(CNGI-04-6-2T)
文摘Predicting heartbeat message arrival time is crucial for the quality of failure detection service over intemet. However, intemet dynamic characteristics make it very difficult to understand message behavior and accurately predict heartbeat arrival time. To solve this problem, a novel black-box model is proposed to predict the next heartbeat arrival time. Heartbeat arrival time is modeled as auto-regressive process, heartbeat sending time is modeled as exogenous variable, the model' s coefficients are estimated based on the sliding window of observations and this result is used to predict the next heartbeat arrival time. Simulation shows that this adaptive auto-regressive exogenous (ARX) model can accurately capture heartbeat arrival dynamics and minimize prediction error in different network environments.
基金This work was supported by National Key R&D Program of China(2020YFB150001)the National Natural Science Foundation of China(Grant No.62072018).
文摘As the mean-time-between-failures(MTBF)continues to decline with the increasing number of components on large-scale high performance computing(HPC)systems,program failures might occur during the execution period with high probability.Ensuring successful execution of the HPC programs has become an issue that the unprivileged users should be concerned.From the user perspective,if the program failure cannot be detected and handled in time,it would waste resources and delay the progress of program execution.Unfortunately,the unprivileged users are unable to perform program state checking due to execution control by the job management system as well as the limited privilege.Currently,automated tools for supporting user-level failure detection and autorecovery of parallel programs in HPC systems are missing.This paper proposes an innovative method for the unprivileged user to achieve failure detection of job execution and automatic resubmission of failed jobs.The state checker in our method is encapsulated as an independent job to reduce interference with the user jobs.In addition,we propose a dual-checker mechanism to improve the robustness of our approach.We implement the proposed method as a tool named automatic re-launcher(ARL)and evaluate it on the Tianhe-2 system.Experiment results show that ARL can detect the execution failures effectively on Tianhe-2 system.In addition,the communication and performance overhead caused by ARL is negligible.The good scalability of ARL makes it applicable for large-scale HPC systems.
基金the Higher Education Improvement Coordination(CAPES)the National Council for the Scientific and Technological Development(CNPq),Brazil。
文摘This work presents the application of the technique named signal analysis based on chaos using density of maxima to analyze brushless direct current motors.It uses a correlation coefficient estimated from the density of maxima of the current signal.This study demonstrates in experiments the speed estimation of a brushless motor on a testbench and failure detection in a small flying drone.The experimental results demonstrate that it is possible to estimate the speed in 97.8%of the cases and to detect failure in 82.75%of the analyzed cases.
文摘A Mobile Ad-hoc NETwork(MANET)contains numerous mobile nodes,and it forms a structure-less network associated with wireless links.But,the node movement is the key feature of MANETs;hence,the quick action of the nodes guides a link failure.This link failure creates more data packet drops that can cause a long time delay.As a result,measuring accurate link failure time is the key factor in the MANET.This paper presents a Fuzzy Linear Regression Method to measure Link Failure(FLRLF)and provide an optimal route in the MANET-Internet of Things(IoT).This work aims to predict link failure and improve routing efficiency in MANET.The Fuzzy Linear Regression Method(FLRM)measures the long lifespan link based on the link failure.The mobile node group is built by the Received Signal Strength(RSS).The Hill Climbing(HC)method selects the Group Leader(GL)based on node mobility,node degree and node energy.Additionally,it uses a Data Gathering node forward the infor-mation from GL to the sink node through multiple GL.The GL is identified by linking lifespan and energy using the Particle Swarm Optimization(PSO)algo-rithm.The simulation results demonstrate that the FLRLF approach increases the GL lifespan and minimizes the link failure time in the MANET.
基金Supported by the National High Technology Research and Development Program of China (‘863’Program) (2006AA12Z313)~~
文摘A new integrity metric for navigation systems is proposed based on the measurement domain. Proba-hilistic optimization design offers tools for fault detection by considering the required navigation performance (RNP) parameter and the uncertainty noise. The choice of the proper performance parameter provided the single-valued mapping with the missed detection probability estimates the probability of failure. The desirable characteristics of the residual sensitivity matrix are exploited to increase the efficiency for identifying erroneous observations. The algorithm can be used to support the performance specification and the efficient calculation of the integrity monitoring process. The simulation for non-precision approach (NPA) validates both the viability and the effectiveness of the proposed algorithm.
基金supported by the National Natural Science Foundation of China (41074010)the National Science and Technology Planning Projects (2012BAC25B01)+1 种基金the Knowledge Innovation Program of the Chinese Academy of Sciences (KZCX2-EW-QN605)the President Fund of University of Chinese Academy of Sciences
文摘To deal with the adverse influence of model failures on Kalman filtering (KF) estimation, it is necessary to investigate the generalized reliability theory, including the model failure detection and identification method as well as the separability and reliability theories. Although the generalized reliability theory for the least square has been discussed for many decades, the generalized reliability theory of KF is not widely discussed. Compared with the least square, KF includes not only the measurement model, but also the dynamic model. In KF, the predicted value of the state parameters from the dynamic model is considered as pseudomeasurements and combined with the observed measurements to compose the form of the least square. According to the reliability of the least square, the generalized reliability of KF is derived. Then, the dynamic model failure of precise point positioning is simulated to demonstrate the usage of the generalized reliability theory. The results show that the adverse influence of the dynamic model failure is more severe than that of the measurement model. Moreover, it is recommended that the model failure identification should always be used even if the overall model test passes. It is shown that the derived generalized reliability measures are suitable for the generalized KF estimation.
文摘With the advancement of technology in recent years, effective fault diagnosis became a necessity to verify the performance and ensure the quality of complex systems. In this paper, an original verification methodology for complex consumer electronic devices is presented. Verification of the system which consists of hardware (integrated circuit) and corresponding software within a flat panel TV set is in the focus. Proposed methodology provides reliable functional failure detection using the concept of black box testing. Further, the approach is fully automated, improving the reliability and speed of failure detection. The methodology effectiveness has been experimentally evaluated and the analysis results have been reported.
基金Project(51274250)supported by the National Natural Science Foundation of ChinaProject(2012BAK09B02-05)supported by the National Key Technology R&D Program during the 12th Five-year Plan of China
文摘An integration processing system of three-dimensional laser scanning information visualization in goaf was developed. It is provided with multiple functions, such as laser scanning information management for goaf, cloud data de-noising optimization, construction, display and operation of three-dimensional model, model editing, profile generation, calculation of goaf volume and roof area, Boolean calculation among models and interaction with the third party soft ware. Concerning this system with a concise interface, plentiful data input/output interfaces, it is featured with high integration, simple and convenient operations of applications. According to practice, in addition to being well-adapted, this system is favorably reliable and stable.
文摘Malfunction or breakdown of certain mission critical systems(MCSs) may cause losses of life, damage the environments, and/or lead to high costs. Therefore, recognition of emerging failures and preventive maintenance are essential for reliable operation of MCSs. There is a practical approach for identifying and forecasting failures based on the indicators obtained from real life processes. We aim to develop means for performing active failure diagnosis and forecasting based on monitoring statistical changes of generic signal features in the specific operation modes of the system. In this paper, we present a new approach for identifying emerging failures based on their manifestations in system signals. Our approach benefits from the dynamic management of the system operation modes and from simultaneous processing and characterization of multiple heterogeneous signal sources. It improves the reliability of failure diagnosis and forecasting by investigating system performance in various operation modes, includes reasoning about failures and forming of failures using a failure indicator matrix which is composed of statistical deviation of signal characteristics between normal and failed operations, and implements a failure indicator concept that can be used as a plug and play failure diagnosis and failure forecasting feature of cyber-physical systems. We demonstrate that our method can automate failure diagnosis in the MCSs and lend the MCSs to the development of decision support systems for preventive maintenance.
文摘The connectivity of a strongly connected network may be destroyed after link damage.Since many net- works are connected by directed links,the reachability may be restored by altering the direction of one or more of the links and thus reconfigoring the network.The location of the failed link must first be determined.In this paper,we examine new methods to determine the location of failed links and nodes in networks.A routing test approach is proposed and the conditions under which communication networks may be tested are discussed. Finally,an adaptive algorithm and a heuristic algorithm that can locate a single failed llnk or a single failed node are presented.