A gait control method for a biped robot based on the deep Q-network (DQN) algorithm is proposed to enhance the stability of walking on uneven ground. This control strategy is an intelligent learning method of posture ...A gait control method for a biped robot based on the deep Q-network (DQN) algorithm is proposed to enhance the stability of walking on uneven ground. This control strategy is an intelligent learning method of posture adjustment. A robot is taken as an agent and trained to walk steadily on an uneven surface with obstacles, using a simple reward function based on forward progress. The reward-punishment (RP) mechanism of the DQN algorithm is established after obtaining the offline gait which was generated in advance foot trajectory planning. Instead of implementing a complex dynamic model, the proposed method enables the biped robot to learn to adjust its posture on the uneven ground and ensures walking stability. The performance and effectiveness of the proposed algorithm was validated in the V-REP simulation environment. The results demonstrate that the biped robot's lateral tile angle is less than 3° after implementing the proposed method and the walking stability is obviously improved.展开更多
Deep beam anchorage structures based on spatial distribution analysis of the cable prestressed field have been proposed for roadway roof support, Stability and other factors that influence deep beam structures are stu...Deep beam anchorage structures based on spatial distribution analysis of the cable prestressed field have been proposed for roadway roof support, Stability and other factors that influence deep beam structures are studied in this paper using mechanical calculations, numerical analysis and field measurements, A mechanical model of deep beam structure subjected to multiple loading is established, including analysis of roof support in the return airway of S1203 working face in the Yuwu coal mine, China, The expression of maximum shear stress in the deep beam structure is deduced according to the stress superposition criterion, It is found that the primary factors affecting deep beam structure stability are deep beam thickness, cable pre-tension and cable spacing, The variation of maximum shear stress distribution and prestressed field diffusion effects according to various factors are analyzed using Matlah and FLAC3DTM software, and practical support parameters of the S1203 return airway roof are determined, According to the observations of rock pressure, there is no evidence of roof separation, and the maximum values of roof subsidence and convergence of wall rock are 72 and 48 mm, respectively, The results show that the proposed roof support design with a deep beam structure is feasible and achieves effective control of the roadway roof,展开更多
Focused on the lost circulation control in deep naturally fractured reservoirs, the multiscale structure of fracture plugging zone is proposed based on the theory of granular matter mechanics, and the structural failu...Focused on the lost circulation control in deep naturally fractured reservoirs, the multiscale structure of fracture plugging zone is proposed based on the theory of granular matter mechanics, and the structural failure pattern of plugging zone is developed to reveal the plugging zone failure mechanisms in deep, high temperature, high pressure, and high in-situ stress environment. Based on the fracture plugging zone strength model, key performance parameters are determined for the optimal selection of loss control material(LCM). Laboratory fracture plugging experiments with new LCM are carried out to evaluate the effect of the key performance parameters of LCM on fracture plugging quality. LCM selection strategy for fractured reservoirs is developed. The results show that the force chain formed by LCMs determines the pressure stabilization of macro-scale fracture plugging zone. Friction failure and shear failure are the two major failure patterns of fracture plugging zone. The strength of force chain depends on the performance of micro-scale LCM, and the LCM key performance parameters include particle size distribution, fiber aspect ratio, friction coefficient, compressive strength, soluble ability and high temperature resistance. Results of lab experiments and field test show that lost circulation control quality can be effectively improved with the optimal material selection based on the extracted key performance parameters of LCMs.展开更多
Preventive transient stability control is an effective measure for the power system to withstand high-probability severe contingencies.It is mathematically an optimal power flow problem with transient stability constr...Preventive transient stability control is an effective measure for the power system to withstand high-probability severe contingencies.It is mathematically an optimal power flow problem with transient stability constraints.Due to the constraints involved for differential algebraic equations of transient stability,it is difficult and time-consuming to solve this problem.To address these issues,this paper presents a novel deep reinforcement learning(DRL)framework for preventive transient stability control of power systems.A distributed deep deterministic policy gradient is utilized to train a DRL agent that can learn its control policy through massive interactions with a grid simulator.Once properly trained,the DRL agent can instantaneously provide effective strategies to adjust the system to a safe operating position with a near-optimal operational cost.The effectiveness of the proposed method is verified through numerical experiments conducted on a New England 39-bus system and NPCC 140-bus system.展开更多
The real-time transient stability assessment(TSA)and emergency control are effective measures to suppress accident expansion,prevent system instability,and avoid large-scale power outages in the event of power system ...The real-time transient stability assessment(TSA)and emergency control are effective measures to suppress accident expansion,prevent system instability,and avoid large-scale power outages in the event of power system failure.However,real-time assessment is extremely demanding on computing speed,and the traditional method is not competent.In this paper,an improved deep belief network(DBN)is proposed for the fast assessment of transient stability,which considers the structural characteristics of power system in the construction of loss function.Deep learning has been effective in many fields,but usually is considered as a black-box model.From the perspective of machine learning interpretation,this paper proposes a local linear interpreter(LLI)model,and tries to give a reasonable interpretation of the relationship between the system features and the assessment result,and illustrates the conversion process from the input feature space to the high-dimension representation space.The proposed method is tested on an IEEE new England test system and demonstrated on a regional power system in China.The result demonstrates that the proposed method has rapidity,high accuracy and good interpretability in transient stability assessment.展开更多
Emergency control is an essential means to help system maintain synchronism after fault clearance.Traditional“offline calculation,online matching”scheme faces significant challenges on adaptiveness and robustness pr...Emergency control is an essential means to help system maintain synchronism after fault clearance.Traditional“offline calculation,online matching”scheme faces significant challenges on adaptiveness and robustness problems.To address these challenges,this paper proposes a novel closed-loop framework of transient stability prediction(TSP)and emergency control based on Deep Belief Network(DBN).First,a hierarchical real-time anti-jitter TSP method using sliding time windows is adopted,which takes into account accuracy and rapidity at the same time.Next,a sensitivity regression model is established to mine the implicit relationship between power angles and sensitivity.When impending instability of the system is foreseen,optimal emergency control strategy can be determined in time.Lastly,responses after emergency control are fed back to the TSP model.If prediction result is still unstable,an additional control strategy will be implemented.Comprehensive numerical case studies are conducted on New England IEEE 39-bus system and Northeast Power Coordinated Council(NPCC)140-bus system.Results show the proposed method can detect instability of system as soon as possible and assist in maintaining reliable system synchronism.展开更多
基金Supported by the National Ministries and Research Funds(3020020221111)
文摘A gait control method for a biped robot based on the deep Q-network (DQN) algorithm is proposed to enhance the stability of walking on uneven ground. This control strategy is an intelligent learning method of posture adjustment. A robot is taken as an agent and trained to walk steadily on an uneven surface with obstacles, using a simple reward function based on forward progress. The reward-punishment (RP) mechanism of the DQN algorithm is established after obtaining the offline gait which was generated in advance foot trajectory planning. Instead of implementing a complex dynamic model, the proposed method enables the biped robot to learn to adjust its posture on the uneven ground and ensures walking stability. The performance and effectiveness of the proposed algorithm was validated in the V-REP simulation environment. The results demonstrate that the biped robot's lateral tile angle is less than 3° after implementing the proposed method and the walking stability is obviously improved.
基金provided by the National Natural Science Foundation of China (Nos. 51504259 and 51234005)the Fundamental Research Funds for the Central Universities (No. 2010QZ06)
文摘Deep beam anchorage structures based on spatial distribution analysis of the cable prestressed field have been proposed for roadway roof support, Stability and other factors that influence deep beam structures are studied in this paper using mechanical calculations, numerical analysis and field measurements, A mechanical model of deep beam structure subjected to multiple loading is established, including analysis of roof support in the return airway of S1203 working face in the Yuwu coal mine, China, The expression of maximum shear stress in the deep beam structure is deduced according to the stress superposition criterion, It is found that the primary factors affecting deep beam structure stability are deep beam thickness, cable pre-tension and cable spacing, The variation of maximum shear stress distribution and prestressed field diffusion effects according to various factors are analyzed using Matlah and FLAC3DTM software, and practical support parameters of the S1203 return airway roof are determined, According to the observations of rock pressure, there is no evidence of roof separation, and the maximum values of roof subsidence and convergence of wall rock are 72 and 48 mm, respectively, The results show that the proposed roof support design with a deep beam structure is feasible and achieves effective control of the roadway roof,
基金Supported by the National Natural Science Foundation of China(Grant No.51604236)Science and Technology Program of Sichuan Province(Grant No.2018JY0436)the Sichuan Province Youth Science and Technology Innovation Team Project(Grant No.2016TD0016)
文摘Focused on the lost circulation control in deep naturally fractured reservoirs, the multiscale structure of fracture plugging zone is proposed based on the theory of granular matter mechanics, and the structural failure pattern of plugging zone is developed to reveal the plugging zone failure mechanisms in deep, high temperature, high pressure, and high in-situ stress environment. Based on the fracture plugging zone strength model, key performance parameters are determined for the optimal selection of loss control material(LCM). Laboratory fracture plugging experiments with new LCM are carried out to evaluate the effect of the key performance parameters of LCM on fracture plugging quality. LCM selection strategy for fractured reservoirs is developed. The results show that the force chain formed by LCMs determines the pressure stabilization of macro-scale fracture plugging zone. Friction failure and shear failure are the two major failure patterns of fracture plugging zone. The strength of force chain depends on the performance of micro-scale LCM, and the LCM key performance parameters include particle size distribution, fiber aspect ratio, friction coefficient, compressive strength, soluble ability and high temperature resistance. Results of lab experiments and field test show that lost circulation control quality can be effectively improved with the optimal material selection based on the extracted key performance parameters of LCMs.
基金This work is supported by National Natural Science Foundation of China Authorized Number:U22B2097。
文摘Preventive transient stability control is an effective measure for the power system to withstand high-probability severe contingencies.It is mathematically an optimal power flow problem with transient stability constraints.Due to the constraints involved for differential algebraic equations of transient stability,it is difficult and time-consuming to solve this problem.To address these issues,this paper presents a novel deep reinforcement learning(DRL)framework for preventive transient stability control of power systems.A distributed deep deterministic policy gradient is utilized to train a DRL agent that can learn its control policy through massive interactions with a grid simulator.Once properly trained,the DRL agent can instantaneously provide effective strategies to adjust the system to a safe operating position with a near-optimal operational cost.The effectiveness of the proposed method is verified through numerical experiments conducted on a New England 39-bus system and NPCC 140-bus system.
基金supported by National Natural Science Foundation of China(No.51777104)the Science and Technology Project of the State Grid Corporation of China.
文摘The real-time transient stability assessment(TSA)and emergency control are effective measures to suppress accident expansion,prevent system instability,and avoid large-scale power outages in the event of power system failure.However,real-time assessment is extremely demanding on computing speed,and the traditional method is not competent.In this paper,an improved deep belief network(DBN)is proposed for the fast assessment of transient stability,which considers the structural characteristics of power system in the construction of loss function.Deep learning has been effective in many fields,but usually is considered as a black-box model.From the perspective of machine learning interpretation,this paper proposes a local linear interpreter(LLI)model,and tries to give a reasonable interpretation of the relationship between the system features and the assessment result,and illustrates the conversion process from the input feature space to the high-dimension representation space.The proposed method is tested on an IEEE new England test system and demonstrated on a regional power system in China.The result demonstrates that the proposed method has rapidity,high accuracy and good interpretability in transient stability assessment.
基金supported in part by the Fundamental Research Funds for the Central Universities(No.2020YJS162)the National Key R&D Program of China(No.2018YFB0904500)Science and Technology Projects of State Grid Corporation of China(No.SGLNDK00KJJS1800236).
文摘Emergency control is an essential means to help system maintain synchronism after fault clearance.Traditional“offline calculation,online matching”scheme faces significant challenges on adaptiveness and robustness problems.To address these challenges,this paper proposes a novel closed-loop framework of transient stability prediction(TSP)and emergency control based on Deep Belief Network(DBN).First,a hierarchical real-time anti-jitter TSP method using sliding time windows is adopted,which takes into account accuracy and rapidity at the same time.Next,a sensitivity regression model is established to mine the implicit relationship between power angles and sensitivity.When impending instability of the system is foreseen,optimal emergency control strategy can be determined in time.Lastly,responses after emergency control are fed back to the TSP model.If prediction result is still unstable,an additional control strategy will be implemented.Comprehensive numerical case studies are conducted on New England IEEE 39-bus system and Northeast Power Coordinated Council(NPCC)140-bus system.Results show the proposed method can detect instability of system as soon as possible and assist in maintaining reliable system synchronism.