Missile interception problem can be regarded as a two-person zero-sum differential games problem,which depends on the solution of Hamilton-Jacobi-Isaacs(HJI)equa-tion.It has been proved impossible to obtain a closed-f...Missile interception problem can be regarded as a two-person zero-sum differential games problem,which depends on the solution of Hamilton-Jacobi-Isaacs(HJI)equa-tion.It has been proved impossible to obtain a closed-form solu-tion due to the nonlinearity of HJI equation,and many iterative algorithms are proposed to solve the HJI equation.Simultane-ous policy updating algorithm(SPUA)is an effective algorithm for solving HJI equation,but it is an on-policy integral reinforce-ment learning(IRL).For online implementation of SPUA,the dis-turbance signals need to be adjustable,which is unrealistic.In this paper,an off-policy IRL algorithm based on SPUA is pro-posed without making use of any knowledge of the systems dynamics.Then,a neural-network based online adaptive critic implementation scheme of the off-policy IRL algorithm is pre-sented.Based on the online off-policy IRL method,a computa-tional intelligence interception guidance(CIIG)law is developed for intercepting high-maneuvering target.As a model-free method,intercepting targets can be achieved through measur-ing system data online.The effectiveness of the CIIG is verified through two missile and target engagement scenarios.展开更多
In real-time hybrid simulation(RTHS), it is difficult if not impossible to completely erase the error in restoring force due to actuator response delay using existing displacement-based compensation methods. This pa...In real-time hybrid simulation(RTHS), it is difficult if not impossible to completely erase the error in restoring force due to actuator response delay using existing displacement-based compensation methods. This paper proposes a new force correction method based on online discrete tangent stiffness estimation(online DTSE) to provide accurate online estimation of the instantaneous stiffness of the physical substructure. Following the discrete curve parameter recognition theory, the online DTSE method estimates the instantaneous stiffness mainly through adaptively building a fuzzy segment with the latest measurements, constructing several strict bounding lines of the segment and calculating the slope of the strict bounding lines, which significantly improves the calculation efficiency and accuracy for the instantaneous stiffness estimation. The results of both computational simulation and real-time hybrid simulation show that:(1) the online DTSE method has high calculation efficiency, of which the relatively short computation time will not interrupt RTHS; and(2) the online DTSE method provides better estimation for the instantaneous stiffness, compared with other existing estimation methods. Due to the quick and accurate estimation of instantaneous stiffness, the online DTSE method therefore provides a promising technique to correct restoring forces in RTHS.展开更多
Current public-opinion propagation research usually focused on closed network topologies without considering the fluctuation of the number of network users or the impact of social factors on propagation. Thus, it rema...Current public-opinion propagation research usually focused on closed network topologies without considering the fluctuation of the number of network users or the impact of social factors on propagation. Thus, it remains difficult to accurately describe the public-opinion propagation rules of social networks. In order to study the rules of public opinion spread on dynamic social networks, by analyzing the activity of social-network users and the regulatory role of relevant departments in the spread of public opinion, concepts of additional user and offline rates are introduced, and the direct immune-susceptible, contacted, infected, and refractory (DI-SCIR) public-opinion propagation model based on real-time online users is established. The interventional force of relevant departments, credibility of real information, and time of intervention are considered, and a public-opinion propagation control strategy based on direct immunity is proposed. The equilibrium point and the basic reproduction number of the model are theoretically analyzed to obtain boundary conditions for public-opinion propagation. Simulation results show that the new model can accurately reflect the propagation rules of public opinion. When the basic reproduction number is less than 1, public opinion will eventually disappear in the network. Social factors can significantly influence the time and scope of public opinion spread on social networks. By controlling social factors, relevant departments can analyze the rules of public opinion spread on social networks to suppress the propagate of negative public opinion and provide a powerful tool to ensure security and stability of society.展开更多
Failures are very common during the online real-time monitoring of large quantities of complex liquids in industrial processes, and can result in excessive resource consumption and pollution. In this study, we introdu...Failures are very common during the online real-time monitoring of large quantities of complex liquids in industrial processes, and can result in excessive resource consumption and pollution. In this study, we introduce a monitoring method capable of non-contact original-state online real-time monitoring for strongly coated, high-salinity, and multi-component liquids. The principle of the method is to establish the relationship among the concentration of the target substance in the liquid (C), the color space coor- dinates of the target substance at different concentrations (L*, a*, b*), and the maximum absorption wave- length (λmax); subsequently, the optimum wavelength λT of the liquid is determined by a high-precision scanning-type monitoring system that is used to detect the instantaneous concentration of the target substance in the flowing liquid. Unlike traditional monitoring methods and existing online monitoring methods, the proposed method does not require any pretreatment of the samples (i.e., filtration, dilution, oxidation/reduction, addition of chromogenic agent, constant volume, etc.), and it is capable of original- state online real-time monitoring. This method is employed at a large electrolytic manganese plant to monitor the Fe3. concentration in the colloidal process of the plant's aging liquid (where the concentra- tions of Fe3+, Mn2+, and (NH4)2SO4 are 0.5-18 mg.L 1, 35-39 g.L 1, and 90-110 g.L 1, respectively). The relative error of this monitoring method compared with an off-line laboratory monitoring is less than 2%.展开更多
This paper describes a cooperative decentralized architecture for reactive real-time route guidance. The architecture is cooperative in the sense that it allows adjacent local controllers to exchange information regar...This paper describes a cooperative decentralized architecture for reactive real-time route guidance. The architecture is cooperative in the sense that it allows adjacent local controllers to exchange information regarding the traffic conditions in their territories. A set of local decision rules and associated heuristic functions to support the cooperative architecture are specified. A protocol governing the knowledge exchange among local adjacent controllers is developed. A simulation-assignment modeling framework is used for assessing the effectiveness of this cooperative architecture under various levels of controller knowledge and network traffic congestion. The cooperative decentralized system is tested under various scenarios of knowledge and cooperation and network traffic demand levels. The cooperative system is compared against the shortest path algorithm as a benchmark.展开更多
An impact point prediction(IPP) guidance based on supervised learning is proposed to address the problem of precise guidance for the ballistic missile in high maneuver penetration condition.An accurate ballistic traje...An impact point prediction(IPP) guidance based on supervised learning is proposed to address the problem of precise guidance for the ballistic missile in high maneuver penetration condition.An accurate ballistic trajectory model is applied to generate training samples,and ablation experiments are conducted to determine the mapping relationship between the flight state and the impact point.At the same time,the impact point coordinates are decoupled to improve the prediction accuracy,and the sigmoid activation function is improved to ameliorate the prediction efficiency.Therefore,an IPP neural network model,which solves the contradiction between the accuracy and the speed of the IPP,is established.In view of the performance deviation of the divert control system,the mapping relationship between the guidance parameters and the impact deviation is analysed based on the variational principle.In addition,a fast iterative model of guidance parameters is designed for reference to the Newton iteration method,which solves the nonlinear strong coupling problem of the guidance parameter solution.Monte Carlo simulation results show that the prediction accuracy of the impact point is high,with a 3 σ prediction error of 4.5 m,and the guidance method is robust,with a 3 σ error of 7.5 m.On the STM32F407 singlechip microcomputer,a single IPP takes about 2.374 ms,and a single guidance solution takes about9.936 ms,which has a good real-time performance and a certain engineering application value.展开更多
This paper presents a neighborhood optimal trajectory online correction algorithm considering terminal time variation,and investigates its application range.Firstly,the motion model of midcourse guidance is establishe...This paper presents a neighborhood optimal trajectory online correction algorithm considering terminal time variation,and investigates its application range.Firstly,the motion model of midcourse guidance is established,and the online trajectory correction-regenerating strategy is introduced.Secondly,based on the neighborhood optimal control theory,a neighborhood optimal trajectory online correction algorithm considering the terminal time variation is proposed by adding the consideration of terminal time variation to the traditional neighborhood optimal trajectory correction method.Thirdly,the Monte Carlo simulation method is used to analyze the application range of the algorithm,which provides a basis for the division of application domain of the online correction algorithm and the online regeneration algorithm of midcourse guidance trajectory.Finally,the simulation results show that the algorithm has high real-time performance,and the online correction trajectory can meet the requirements of terminal constraint change.The application range of the algorithm is obtained through Monte Carlo simulation.展开更多
The probabilistic real-time automaton (PRTA) is a representation of dynamic processes arising in the sciences and industry. Currently, the induction of automata is divided into two steps: the creation of the prefix...The probabilistic real-time automaton (PRTA) is a representation of dynamic processes arising in the sciences and industry. Currently, the induction of automata is divided into two steps: the creation of the prefix tree acceptor (PTA) and the merge procedure based on clustering of the states. These two steps can be very time intensive when a PRTA is to be induced for massive or even unbounded datasets. The latter one can be efficiently processed, as there exist scalable online clustering algorithms. However, the creation of the PTA still can be very time consuming. To overcome this problem, we propose a genuine online PRTA induction approach that incorporates new instances by first collapsing them and then using a maximum frequent pattern based clustering. The approach is tested against a predefined synthetic automaton and real world datasets, for which the approach is scalable and stable. Moreover, we present a broad evaluation on a real world disease group dataset that shows the applicability of such a model to the analysis of medical processes.展开更多
文摘Missile interception problem can be regarded as a two-person zero-sum differential games problem,which depends on the solution of Hamilton-Jacobi-Isaacs(HJI)equa-tion.It has been proved impossible to obtain a closed-form solu-tion due to the nonlinearity of HJI equation,and many iterative algorithms are proposed to solve the HJI equation.Simultane-ous policy updating algorithm(SPUA)is an effective algorithm for solving HJI equation,but it is an on-policy integral reinforce-ment learning(IRL).For online implementation of SPUA,the dis-turbance signals need to be adjustable,which is unrealistic.In this paper,an off-policy IRL algorithm based on SPUA is pro-posed without making use of any knowledge of the systems dynamics.Then,a neural-network based online adaptive critic implementation scheme of the off-policy IRL algorithm is pre-sented.Based on the online off-policy IRL method,a computa-tional intelligence interception guidance(CIIG)law is developed for intercepting high-maneuvering target.As a model-free method,intercepting targets can be achieved through measur-ing system data online.The effectiveness of the CIIG is verified through two missile and target engagement scenarios.
基金Priority Academic Program Development of Jiangsu Higher Education Institutions under Grant No.1105007002National Natural Science Foundation of China under Grant No.51378107 and No.51678147
文摘In real-time hybrid simulation(RTHS), it is difficult if not impossible to completely erase the error in restoring force due to actuator response delay using existing displacement-based compensation methods. This paper proposes a new force correction method based on online discrete tangent stiffness estimation(online DTSE) to provide accurate online estimation of the instantaneous stiffness of the physical substructure. Following the discrete curve parameter recognition theory, the online DTSE method estimates the instantaneous stiffness mainly through adaptively building a fuzzy segment with the latest measurements, constructing several strict bounding lines of the segment and calculating the slope of the strict bounding lines, which significantly improves the calculation efficiency and accuracy for the instantaneous stiffness estimation. The results of both computational simulation and real-time hybrid simulation show that:(1) the online DTSE method has high calculation efficiency, of which the relatively short computation time will not interrupt RTHS; and(2) the online DTSE method provides better estimation for the instantaneous stiffness, compared with other existing estimation methods. Due to the quick and accurate estimation of instantaneous stiffness, the online DTSE method therefore provides a promising technique to correct restoring forces in RTHS.
基金Project supported by the National Natural Science Foundation of China (Grant No. 61471080)the Equipment Development Department Research Foundation of China (Grant No. 61400010303)+2 种基金the Natural Science Research Project of Liaoning Education Department of China (Grant Nos. JDL2019019 and JDL2020002)the Surface Project for Natural Science Foundation in Guangdong Province of China (Grant No. 2019A1515011164)the Science and Technology Plan Project in Zhanjiang, China (Grant No. 2018A06001)。
文摘Current public-opinion propagation research usually focused on closed network topologies without considering the fluctuation of the number of network users or the impact of social factors on propagation. Thus, it remains difficult to accurately describe the public-opinion propagation rules of social networks. In order to study the rules of public opinion spread on dynamic social networks, by analyzing the activity of social-network users and the regulatory role of relevant departments in the spread of public opinion, concepts of additional user and offline rates are introduced, and the direct immune-susceptible, contacted, infected, and refractory (DI-SCIR) public-opinion propagation model based on real-time online users is established. The interventional force of relevant departments, credibility of real information, and time of intervention are considered, and a public-opinion propagation control strategy based on direct immunity is proposed. The equilibrium point and the basic reproduction number of the model are theoretically analyzed to obtain boundary conditions for public-opinion propagation. Simulation results show that the new model can accurately reflect the propagation rules of public opinion. When the basic reproduction number is less than 1, public opinion will eventually disappear in the network. Social factors can significantly influence the time and scope of public opinion spread on social networks. By controlling social factors, relevant departments can analyze the rules of public opinion spread on social networks to suppress the propagate of negative public opinion and provide a powerful tool to ensure security and stability of society.
文摘Failures are very common during the online real-time monitoring of large quantities of complex liquids in industrial processes, and can result in excessive resource consumption and pollution. In this study, we introduce a monitoring method capable of non-contact original-state online real-time monitoring for strongly coated, high-salinity, and multi-component liquids. The principle of the method is to establish the relationship among the concentration of the target substance in the liquid (C), the color space coor- dinates of the target substance at different concentrations (L*, a*, b*), and the maximum absorption wave- length (λmax); subsequently, the optimum wavelength λT of the liquid is determined by a high-precision scanning-type monitoring system that is used to detect the instantaneous concentration of the target substance in the flowing liquid. Unlike traditional monitoring methods and existing online monitoring methods, the proposed method does not require any pretreatment of the samples (i.e., filtration, dilution, oxidation/reduction, addition of chromogenic agent, constant volume, etc.), and it is capable of original- state online real-time monitoring. This method is employed at a large electrolytic manganese plant to monitor the Fe3. concentration in the colloidal process of the plant's aging liquid (where the concentra- tions of Fe3+, Mn2+, and (NH4)2SO4 are 0.5-18 mg.L 1, 35-39 g.L 1, and 90-110 g.L 1, respectively). The relative error of this monitoring method compared with an off-line laboratory monitoring is less than 2%.
文摘This paper describes a cooperative decentralized architecture for reactive real-time route guidance. The architecture is cooperative in the sense that it allows adjacent local controllers to exchange information regarding the traffic conditions in their territories. A set of local decision rules and associated heuristic functions to support the cooperative architecture are specified. A protocol governing the knowledge exchange among local adjacent controllers is developed. A simulation-assignment modeling framework is used for assessing the effectiveness of this cooperative architecture under various levels of controller knowledge and network traffic congestion. The cooperative decentralized system is tested under various scenarios of knowledge and cooperation and network traffic demand levels. The cooperative system is compared against the shortest path algorithm as a benchmark.
基金supported by the National Natural Science Foundation of China (Grant No.62103432)supported by Young Talent fund of University Association for Science and Technology in Shaanxi, China(Grant No.20210108)。
文摘An impact point prediction(IPP) guidance based on supervised learning is proposed to address the problem of precise guidance for the ballistic missile in high maneuver penetration condition.An accurate ballistic trajectory model is applied to generate training samples,and ablation experiments are conducted to determine the mapping relationship between the flight state and the impact point.At the same time,the impact point coordinates are decoupled to improve the prediction accuracy,and the sigmoid activation function is improved to ameliorate the prediction efficiency.Therefore,an IPP neural network model,which solves the contradiction between the accuracy and the speed of the IPP,is established.In view of the performance deviation of the divert control system,the mapping relationship between the guidance parameters and the impact deviation is analysed based on the variational principle.In addition,a fast iterative model of guidance parameters is designed for reference to the Newton iteration method,which solves the nonlinear strong coupling problem of the guidance parameter solution.Monte Carlo simulation results show that the prediction accuracy of the impact point is high,with a 3 σ prediction error of 4.5 m,and the guidance method is robust,with a 3 σ error of 7.5 m.On the STM32F407 singlechip microcomputer,a single IPP takes about 2.374 ms,and a single guidance solution takes about9.936 ms,which has a good real-time performance and a certain engineering application value.
基金supported by the National Natural Science Foundation of China(61873278,62173339)。
文摘This paper presents a neighborhood optimal trajectory online correction algorithm considering terminal time variation,and investigates its application range.Firstly,the motion model of midcourse guidance is established,and the online trajectory correction-regenerating strategy is introduced.Secondly,based on the neighborhood optimal control theory,a neighborhood optimal trajectory online correction algorithm considering the terminal time variation is proposed by adding the consideration of terminal time variation to the traditional neighborhood optimal trajectory correction method.Thirdly,the Monte Carlo simulation method is used to analyze the application range of the algorithm,which provides a basis for the division of application domain of the online correction algorithm and the online regeneration algorithm of midcourse guidance trajectory.Finally,the simulation results show that the algorithm has high real-time performance,and the online correction trajectory can meet the requirements of terminal constraint change.The application range of the algorithm is obtained through Monte Carlo simulation.
文摘The probabilistic real-time automaton (PRTA) is a representation of dynamic processes arising in the sciences and industry. Currently, the induction of automata is divided into two steps: the creation of the prefix tree acceptor (PTA) and the merge procedure based on clustering of the states. These two steps can be very time intensive when a PRTA is to be induced for massive or even unbounded datasets. The latter one can be efficiently processed, as there exist scalable online clustering algorithms. However, the creation of the PTA still can be very time consuming. To overcome this problem, we propose a genuine online PRTA induction approach that incorporates new instances by first collapsing them and then using a maximum frequent pattern based clustering. The approach is tested against a predefined synthetic automaton and real world datasets, for which the approach is scalable and stable. Moreover, we present a broad evaluation on a real world disease group dataset that shows the applicability of such a model to the analysis of medical processes.