This work constructed a machine learning(ML)model to predict the atmospheric corrosion rate of low-alloy steels(LAS).The material properties of LAS,environmental factors,and exposure time were used as the input,while ...This work constructed a machine learning(ML)model to predict the atmospheric corrosion rate of low-alloy steels(LAS).The material properties of LAS,environmental factors,and exposure time were used as the input,while the corrosion rate as the output.6 dif-ferent ML algorithms were used to construct the proposed model.Through optimization and filtering,the eXtreme gradient boosting(XG-Boost)model exhibited good corrosion rate prediction accuracy.The features of material properties were then transformed into atomic and physical features using the proposed property transformation approach,and the dominant descriptors that affected the corrosion rate were filtered using the recursive feature elimination(RFE)as well as XGBoost methods.The established ML models exhibited better predic-tion performance and generalization ability via property transformation descriptors.In addition,the SHapley additive exPlanations(SHAP)method was applied to analyze the relationship between the descriptors and corrosion rate.The results showed that the property transformation model could effectively help with analyzing the corrosion behavior,thereby significantly improving the generalization ability of corrosion rate prediction models.展开更多
The optimization of network topologies to retain the generalization ability by deciding when to stop overtraining an artificial neural network(ANN)is an existing vital challenge in ANN prediction works.The larger the ...The optimization of network topologies to retain the generalization ability by deciding when to stop overtraining an artificial neural network(ANN)is an existing vital challenge in ANN prediction works.The larger the dataset the ANN is trained with,the better generalization the prediction can give.In this paper,a large dataset of atmospheric corrosion data of carbon steel compiled from several resources is used to train and test a multilayer backpropagation ANN model as well as two conventional corrosion prediction models(linear and Klinesmith models).Unlike previous related works,a grid search-based hyperparameter tuning is performed to develop multiple hyperparameter combinations(network topologies)to train multiple ANNs with mini-batch stochastic gradient descent optimization algorithm to facilitate the training of a large dataset.After that,one selection strategy for the optimal hyperparameter combination is applied by an early stopping method to guarantee the generalization ability of the optimal network model.The correlation coefficients(R)of the ANN model can explain about 80%(more than 75%)of the variance of atmospheric corrosion of carbon steel,and the root mean square errors(RMSE)of three models show that the ANN model gives a better performance than the other two models with acceptable generalization.The influence of input parameters on the output is highlighted by using the fuzzy curve analysis method.The result reveals that TOW,Cl-and SO2 are the most important atmospheric chemical variables,which have a well-known nonlinear relationship with atmospheric corrosion.展开更多
The system periphery ("jieke" in Chinese) is defined as a part of the system and is adjacent to its environment. The periphery is an in- termediary agent between the system and its environment, and has two functi...The system periphery ("jieke" in Chinese) is defined as a part of the system and is adjacent to its environment. The periphery is an in- termediary agent between the system and its environment, and has two functions: defending system itself and exchanging with the environment. Generally, the periphery is defined on space dimension. We will investigate the periphery from the time dimension, and study a time jieke based on set theory viewpoint; initial values and forecast lead in weather forecast are clarified. Further predictability of weather forecast on the basis of periph- ery theory is defined; its calculation formulae are given, with which computing for day-to-day forecast were carded out. The results have been com- pared with present researches of atmospheric predictability, and it shows advancing the predictability study. Most interesting is that atmospheric pre- dictability possesses rule of gold ratio 0.618, and it is found firstly in research of weather and climate predictability.展开更多
This paper makes a review on the predictability of the atmosphere. The essential problems of predictability theory, i.e., how a deterministic system changes to an undeterministic system (chaos) and how is the opposite...This paper makes a review on the predictability of the atmosphere. The essential problems of predictability theory, i.e., how a deterministic system changes to an undeterministic system (chaos) and how is the opposite (order within chaos), are discussed. Some applications of predictability theory are given.展开更多
In this paper the concept of Chaos and its applications to the study of predictability theory is introduced. The author's attempt is to give a general overview of ideas and methods involved in this problem to scie...In this paper the concept of Chaos and its applications to the study of predictability theory is introduced. The author's attempt is to give a general overview of ideas and methods involved in this problem to scientists,who are interested in the problem of predictability but not familiar with the theory of chaos. The problem is discussed in 4 sections. In the first section, the concept of chaos and the study methods are outlined briefly; in the second section, the methods of quantitatively measuring the main characteristics of chaos which are the basis for the predictability theory are introduced; the third section discusses the time series analysis for directly studying chaotic phenomena in practical problems; and the last section presents some research results on the chaotic characteristics and the predictability of the real atmosphere.展开更多
The article is to report some results of numerical experiments on the error growth and the atmospheric predictability Experiments with two-level global baroclinic primitive equation spectral model have main results as...The article is to report some results of numerical experiments on the error growth and the atmospheric predictability Experiments with two-level global baroclinic primitive equation spectral model have main results as follows.The magnitude of initial errors directly affects the error growth,but its distribution form has little effect on the growth.The loss of predictability resulting from small-scale error is much greater than that from large-scale error.The small-scale error rapidly grows and is transferred to the large-scale error by interaction between different scale waves,which stimulates the growth of error for the whole system Orographic forcing restrains planetary-scale error(wavenumbers 0—3)but enhances the small-scale error (wavenumbers 8 or greater).Hence,orographic effects on the error growth closely depend on the characteris- tic scale of initial errors,and there may be a critical wavenumber between 4 and 7.The error growth is great- er in Northern Hemisphere than in Southern Hemisphere if initial errors are the same.In the end we give some discussions about model,initialization scheme,etc.,to improve model prediction.展开更多
Despite a specific data assimilation method,data assimilation(DA)in general can be decomposed into components of the prior information,observation forward operator that is given by the observation type,observation err...Despite a specific data assimilation method,data assimilation(DA)in general can be decomposed into components of the prior information,observation forward operator that is given by the observation type,observation error covariances,and background error covariances.In a classic Lorenz model,the influences of the DA components on the initial conditions(ICs)and subsequent forecasts are systematically investigated,which could provide a theoretical basis for the design of DA for different scales of interests.The forecast errors undergo three typical stages:a slow growth stage from 0 h to 5 d,a fast growth stage from 5 d to around 15 d with significantly different error growth rates for ensemble and deterministic forecasts,and a saturation stage after 15 d.Assimilation strategies that provide more accurate ICs can improve the predictability.Cycling assimilation is superior to offline assimilation,and a flow-dependent background error covariance matrix(Pf)provides better analyses than a static background error covariance matrix(B)for instantaneous observations and frequent time-averaged observations;but the opposite is true for infrequent time-averaged observations,since cycling simulation cannot construct informative priors when the model lacks predictive skills and the flow-dependent Pf cannot effectively extract information from low-informative observations as the static B.Instantaneous observations contain more information than time-averaged observations,thus the former is preferred,especially for infrequent observing systems.Moreover,ensemble forecasts have advantages over deterministic forecasts,and the advantages are enlarged with less informative observations and lower predictive-skill model priors.展开更多
The growth of small errors in weather prediction is exponential on average. As an error becomes larger, its growth slows down and then stops with the magnitude of the error saturating at about the average distance bet...The growth of small errors in weather prediction is exponential on average. As an error becomes larger, its growth slows down and then stops with the magnitude of the error saturating at about the average distance between two states chosen randomly.This paper studies the error growth in a low-dimensional atmospheric model before, during and after the initial exponential divergence occurs. We test cubic, quartic and logarithmic hypotheses by ensemble prediction method. Furthermore, the quadratic hypothesis suggested by Lorenz in 1969 is compared with the ensemble prediction method. The study shows that a small error growth is best modeled by the quadratic hypothesis. After the error exceeds about a half of the average value of variables, logarithmic approximation becomes superior. It is also shown that the time length of the exponential growth in the model data is a function of the size of small initial error and the largest Lyapunov exponent. We conclude that the size of the error at the least upper bound(supremum) of time length is equal to 1 and it is invariant to these variables. Predictability, as a time interval, where the model error is growing, is for small initial error, the sum of the least upper bound of time interval of exponential growth and predictability for the size of initial error equal to 1.展开更多
基金the National Key R&D Program of China(No.2021YFB3701705).
文摘This work constructed a machine learning(ML)model to predict the atmospheric corrosion rate of low-alloy steels(LAS).The material properties of LAS,environmental factors,and exposure time were used as the input,while the corrosion rate as the output.6 dif-ferent ML algorithms were used to construct the proposed model.Through optimization and filtering,the eXtreme gradient boosting(XG-Boost)model exhibited good corrosion rate prediction accuracy.The features of material properties were then transformed into atomic and physical features using the proposed property transformation approach,and the dominant descriptors that affected the corrosion rate were filtered using the recursive feature elimination(RFE)as well as XGBoost methods.The established ML models exhibited better predic-tion performance and generalization ability via property transformation descriptors.In addition,the SHapley additive exPlanations(SHAP)method was applied to analyze the relationship between the descriptors and corrosion rate.The results showed that the property transformation model could effectively help with analyzing the corrosion behavior,thereby significantly improving the generalization ability of corrosion rate prediction models.
基金supported by National Key R&D Program of China[Grant Number 2017YFB0203703]111 Project[Grant Number B12012]Fundamental Research Funds for the Central Universities[Grant Number FRF-GF-19-029B].
文摘The optimization of network topologies to retain the generalization ability by deciding when to stop overtraining an artificial neural network(ANN)is an existing vital challenge in ANN prediction works.The larger the dataset the ANN is trained with,the better generalization the prediction can give.In this paper,a large dataset of atmospheric corrosion data of carbon steel compiled from several resources is used to train and test a multilayer backpropagation ANN model as well as two conventional corrosion prediction models(linear and Klinesmith models).Unlike previous related works,a grid search-based hyperparameter tuning is performed to develop multiple hyperparameter combinations(network topologies)to train multiple ANNs with mini-batch stochastic gradient descent optimization algorithm to facilitate the training of a large dataset.After that,one selection strategy for the optimal hyperparameter combination is applied by an early stopping method to guarantee the generalization ability of the optimal network model.The correlation coefficients(R)of the ANN model can explain about 80%(more than 75%)of the variance of atmospheric corrosion of carbon steel,and the root mean square errors(RMSE)of three models show that the ANN model gives a better performance than the other two models with acceptable generalization.The influence of input parameters on the output is highlighted by using the fuzzy curve analysis method.The result reveals that TOW,Cl-and SO2 are the most important atmospheric chemical variables,which have a well-known nonlinear relationship with atmospheric corrosion.
基金Supported by Natural Science Foundation of China(41375079,40375025)
文摘The system periphery ("jieke" in Chinese) is defined as a part of the system and is adjacent to its environment. The periphery is an in- termediary agent between the system and its environment, and has two functions: defending system itself and exchanging with the environment. Generally, the periphery is defined on space dimension. We will investigate the periphery from the time dimension, and study a time jieke based on set theory viewpoint; initial values and forecast lead in weather forecast are clarified. Further predictability of weather forecast on the basis of periph- ery theory is defined; its calculation formulae are given, with which computing for day-to-day forecast were carded out. The results have been com- pared with present researches of atmospheric predictability, and it shows advancing the predictability study. Most interesting is that atmospheric pre- dictability possesses rule of gold ratio 0.618, and it is found firstly in research of weather and climate predictability.
文摘This paper makes a review on the predictability of the atmosphere. The essential problems of predictability theory, i.e., how a deterministic system changes to an undeterministic system (chaos) and how is the opposite (order within chaos), are discussed. Some applications of predictability theory are given.
基金This project is supported by National Natural Science Foundation of China
文摘In this paper the concept of Chaos and its applications to the study of predictability theory is introduced. The author's attempt is to give a general overview of ideas and methods involved in this problem to scientists,who are interested in the problem of predictability but not familiar with the theory of chaos. The problem is discussed in 4 sections. In the first section, the concept of chaos and the study methods are outlined briefly; in the second section, the methods of quantitatively measuring the main characteristics of chaos which are the basis for the predictability theory are introduced; the third section discusses the time series analysis for directly studying chaotic phenomena in practical problems; and the last section presents some research results on the chaotic characteristics and the predictability of the real atmosphere.
文摘The article is to report some results of numerical experiments on the error growth and the atmospheric predictability Experiments with two-level global baroclinic primitive equation spectral model have main results as follows.The magnitude of initial errors directly affects the error growth,but its distribution form has little effect on the growth.The loss of predictability resulting from small-scale error is much greater than that from large-scale error.The small-scale error rapidly grows and is transferred to the large-scale error by interaction between different scale waves,which stimulates the growth of error for the whole system Orographic forcing restrains planetary-scale error(wavenumbers 0—3)but enhances the small-scale error (wavenumbers 8 or greater).Hence,orographic effects on the error growth closely depend on the characteris- tic scale of initial errors,and there may be a critical wavenumber between 4 and 7.The error growth is great- er in Northern Hemisphere than in Southern Hemisphere if initial errors are the same.In the end we give some discussions about model,initialization scheme,etc.,to improve model prediction.
基金This work was supported by the National Natural Science Foundation of China(Grant Nos.42192553,41922036&41775057)the Frontiers Science Center for Critical Earth Material Cycling Fund(Grant No.JBGS2102)the Fundamental Research Funds for the Central Universities(Grant No.0209-14380097).
文摘Despite a specific data assimilation method,data assimilation(DA)in general can be decomposed into components of the prior information,observation forward operator that is given by the observation type,observation error covariances,and background error covariances.In a classic Lorenz model,the influences of the DA components on the initial conditions(ICs)and subsequent forecasts are systematically investigated,which could provide a theoretical basis for the design of DA for different scales of interests.The forecast errors undergo three typical stages:a slow growth stage from 0 h to 5 d,a fast growth stage from 5 d to around 15 d with significantly different error growth rates for ensemble and deterministic forecasts,and a saturation stage after 15 d.Assimilation strategies that provide more accurate ICs can improve the predictability.Cycling assimilation is superior to offline assimilation,and a flow-dependent background error covariance matrix(Pf)provides better analyses than a static background error covariance matrix(B)for instantaneous observations and frequent time-averaged observations;but the opposite is true for infrequent time-averaged observations,since cycling simulation cannot construct informative priors when the model lacks predictive skills and the flow-dependent Pf cannot effectively extract information from low-informative observations as the static B.Instantaneous observations contain more information than time-averaged observations,thus the former is preferred,especially for infrequent observing systems.Moreover,ensemble forecasts have advantages over deterministic forecasts,and the advantages are enlarged with less informative observations and lower predictive-skill model priors.
基金supported by Research Plan(No.MSM0021620860)by project(No.SVV-2013-267308)
文摘The growth of small errors in weather prediction is exponential on average. As an error becomes larger, its growth slows down and then stops with the magnitude of the error saturating at about the average distance between two states chosen randomly.This paper studies the error growth in a low-dimensional atmospheric model before, during and after the initial exponential divergence occurs. We test cubic, quartic and logarithmic hypotheses by ensemble prediction method. Furthermore, the quadratic hypothesis suggested by Lorenz in 1969 is compared with the ensemble prediction method. The study shows that a small error growth is best modeled by the quadratic hypothesis. After the error exceeds about a half of the average value of variables, logarithmic approximation becomes superior. It is also shown that the time length of the exponential growth in the model data is a function of the size of small initial error and the largest Lyapunov exponent. We conclude that the size of the error at the least upper bound(supremum) of time length is equal to 1 and it is invariant to these variables. Predictability, as a time interval, where the model error is growing, is for small initial error, the sum of the least upper bound of time interval of exponential growth and predictability for the size of initial error equal to 1.