The growing global requirement for food and the need for sustainable farming in an era of a changing climate and scarce resources have inspired substantial crop yield prediction research.Deep learning(DL)and machine l...The growing global requirement for food and the need for sustainable farming in an era of a changing climate and scarce resources have inspired substantial crop yield prediction research.Deep learning(DL)and machine learning(ML)models effectively deal with such challenges.This research paper comprehensively analyses recent advancements in crop yield prediction from January 2016 to March 2024.In addition,it analyses the effectiveness of various input parameters considered in crop yield prediction models.We conducted an in-depth search and gathered studies that employed crop modeling and AI-based methods to predict crop yield.The total number of articles reviewed for crop yield prediction using ML,meta-modeling(Crop models coupled with ML/DL),and DL-based prediction models and input parameter selection is 125.We conduct the research by setting up five objectives for this research and discussing them after analyzing the selected research papers.Each study is assessed based on the crop type,input parameters employed for prediction,the modeling techniques adopted,and the evaluation metrics used for estimatingmodel performance.We also discuss the ethical and social impacts of AI on agriculture.However,various approaches presented in the scientific literature have delivered impressive predictions,they are complicateddue to intricate,multifactorial influences oncropgrowthand theneed for accuratedata-driven models.Therefore,thorough research is required to deal with challenges in predicting agricultural output.展开更多
Pre-harvest yield prediction of ratoon rice is critical for guiding crop interventions in precision agriculture.However,the unique agronomic practice(i.e.,varied stubble height treatment)in rice ratooning could lead t...Pre-harvest yield prediction of ratoon rice is critical for guiding crop interventions in precision agriculture.However,the unique agronomic practice(i.e.,varied stubble height treatment)in rice ratooning could lead to inconsistent rice phenology,which had a significant impact on yield prediction of ratoon rice.Multi-temporal unmanned aerial vehicle(UAV)-based remote sensing can likely monitor ratoon rice productivity and reflect maximum yield potential across growing seasons for improving the yield prediction compared with previous methods.Thus,in this study,we explored the performance of combination of agronomic practice information(API)and single-phase,multi-spectral features[vegetation indices(VIs)and texture(Tex)features]in predicting ratoon rice yield,and developed a new UAV-based method to retrieve yield formation process by using multi-temporal features which were effective in improving yield forecasting accuracy of ratoon rice.The results showed that the integrated use of VIs,Tex and API(VIs&Tex+API)improved the accuracy of yield prediction than single-phase UAV imagery-based feature,with the panicle initiation stage being the best period for yield prediction(R^(2) as 0.732,RMSE as 0.406,RRMSE as 0.101).More importantly,compared with previous multi-temporal UAV-based methods,our proposed multi-temporal method(multi-temporal model VIs&Tex:R^(2) as 0.795,RMSE as 0.298,RRMSE as 0.072)can increase R^(2) by 0.020-0.111 and decrease RMSE by 0.020-0.080 in crop yield forecasting.This study provides an effective method for accurate pre-harvest yield prediction of ratoon rice in precision agriculture,which is of great significance to take timely means for ensuring ratoon rice production and food security.展开更多
Agriculture plays a vital role in the Indian economy.Crop recommen-dation for a specific region is a tedious process as it can be affected by various variables such as soil type and climatic parameters.At the same time...Agriculture plays a vital role in the Indian economy.Crop recommen-dation for a specific region is a tedious process as it can be affected by various variables such as soil type and climatic parameters.At the same time,crop yield prediction was based on several features like area,irrigation type,temperature,etc.The recent advancements of artificial intelligence(AI)and machine learning(ML)models pave the way to design effective crop recommendation and crop pre-diction models.In this view,this paper presents a novel Multimodal Machine Learning Based Crop Recommendation and Yield Prediction(MMML-CRYP)technique.The proposed MMML-CRYP model mainly focuses on two processes namely crop recommendation and crop prediction.At the initial stage,equilibrium optimizer(EO)with kernel extreme learning machine(KELM)technique is employed for effectual recommendation of crops.Next,random forest(RF)tech-nique was executed for predicting the crop yield accurately.For reporting the improved performance of the MMML-CRYP system,a wide range of simulations were carried out and the results are investigated using benchmark dataset.Experi-mentation outcomes highlighted the significant performance of the MMML-CRYP approach on the compared approaches with maximum accuracy of 97.91%.展开更多
Cornstalks show promise as a raw material for polysaccharide production through xylanase.Rapid and accurate prediction of polysaccharide yield can facilitate process optimization,eliminating the need for extensive exp...Cornstalks show promise as a raw material for polysaccharide production through xylanase.Rapid and accurate prediction of polysaccharide yield can facilitate process optimization,eliminating the need for extensive experimentation in actual production to refine reaction conditions,thereby saving time and costs.However,the intricate interplay of enzymatic factors poses challenges in predicting and optimizing polysaccharide yield accurately.Here,we introduce an innovative data-driven approach leveraging multiple artificial intelligence techniques to enhance polysaccharide production.We propose a machine learning framework to identify highly accurate polysaccharide yield prediction modeling methods and uncover optimal enzymatic parameter combinations.Notably,Random Forest(RF)and eXtreme Gradient Boost(XGB)demonstrate robust performance,achieving prediction accuracies of 93.0%and 95.6%,respectively,while an independently developed deep neural network(DNN)model achieves 91.1%accuracy.A feature importance analysis of XGB reveals the enzyme solution volume's dominant role(43.7%),followed by time(20.7%),substrate concentration(15%),temperature(15%),and pH(5.6%).Further interpretability analysis unveils complex parameter interactions and potential optimization strategies.This data-driven approach,incorporating machine learning,deep learning,and interpretable analysis,offers a viable pathway for polysaccharide yield prediction and the potential recovery of various agricultural residues.展开更多
Predicting crop yield timely can considerably accelerate agricultural production management and food policy-making,which are also important requirements for precise agricultural development.Given the development of hy...Predicting crop yield timely can considerably accelerate agricultural production management and food policy-making,which are also important requirements for precise agricultural development.Given the development of hyperspectral imaging technology,a simple and efficient modeling method is convenient for predicting crop yield by using airborne hyperspectral images.In this study,the Unmanned Aerial Vehicle(UAV)hyperspectral and maturity yield data in 2014-2015 and 2017-2018 were collected.The winter wheat yield prediction model was established by optimizing Vegetation Indices(VIs)feature scales and sample scales,incorporating Partial Least Squares Regression(PLSR),Random Forest algorithm(RF),and Back Propagation Neural Network algorithm(BPN).Results showed that PLSR stands out as the optimal wheat yield prediction model considering stability and accuracy(RMSE=948.88 kg/hm2).Contrary to the belief that more input features result in higher accuracy,PLSR,RF,and BPN models performed best when trained with the top 3,8,and 4 VIs with the highest correlation,respectively.With an increase in training samples,model accuracy improves,reaching stability when the training samples reach 70.Using PLSR and optimal feature scales,UAV yield prediction maps were generated,holding significant value for field management in precision agriculture.展开更多
The accurate prediction of soybean yield is of great significance for agricultural production, monitoring and early warning.Although previous studies have used machine learning algorithms to predict soybean yield base...The accurate prediction of soybean yield is of great significance for agricultural production, monitoring and early warning.Although previous studies have used machine learning algorithms to predict soybean yield based on meteorological data,it is not clear how different models can be used to effectively separate soybean meteorological yield from soybean yield in various regions. In addition, comprehensively integrating the advantages of various machine learning algorithms to improve the prediction accuracy through ensemble learning algorithms has not been studied in depth. This study used and analyzed various daily meteorological data and soybean yield data from 173 county-level administrative regions and meteorological stations in two principal soybean planting areas in China(Northeast China and the Huang–Huai region), covering 34 years.Three effective machine learning algorithms(K-nearest neighbor, random forest, and support vector regression) were adopted as the base-models to establish a high-precision and highly-reliable soybean meteorological yield prediction model based on the stacking ensemble learning framework. The model's generalizability was further improved through 5-fold crossvalidation, and the model was optimized by principal component analysis and hyperparametric optimization. The accuracy of the model was evaluated by using the five-year sliding prediction and four regression indicators of the 173 counties, which showed that the stacking model has higher accuracy and stronger robustness. The 5-year sliding estimations of soybean yield based on the stacking model in 173 counties showed that the prediction effect can reflect the spatiotemporal distribution of soybean yield in detail, and the mean absolute percentage error(MAPE) was less than 5%. The stacking prediction model of soybean meteorological yield provides a new approach for accurately predicting soybean yield.展开更多
The prediction of crop yield is one of the important factor and also challenging,to predict the future crop yield based on various criteria’s.Many advanced technologies are incorporated in the agricultural processes,...The prediction of crop yield is one of the important factor and also challenging,to predict the future crop yield based on various criteria’s.Many advanced technologies are incorporated in the agricultural processes,which enhances the crop yield production efficiency.The process of predicting the crop yield can be done by taking agriculture data,which helps to analyze and make important decisions before and during cultivation.This paper focuses on the prediction of crop yield,where two models of machine learning are developed for this work.One is Modified Convolutional Neural Network(MCNN),and the other model is TLBO(Teacher Learning Based Optimization)-a Genetic algorithm which reduces the input size of data.In this work,some spatial information used for analysis is the Normalized Difference Vegetation Index,Standard Precipitation Index and Vegetation Condition Index.TLBO finds some best feature value set in the data that represents the specific yield of the crop.So,these selected feature valued set is passed in the Error Back Propagation Neural Network for learning.Here,the training was done in such a way that all set of features were utilized in pair with their yield value as output.For increasing the reliability of the work whole experiment was done on a real dataset from Madhya Pradesh region of country India.The result shows that the proposed model has overcome various evaluation parameters on different scales as compared to previous approaches adopted by researchers.展开更多
Though studies showed the potential of high-resolution optical sensors for crop yield prediction,several factors have limited their wider application.The main factors are obstruction of cloud,identification of phenolo...Though studies showed the potential of high-resolution optical sensors for crop yield prediction,several factors have limited their wider application.The main factors are obstruction of cloud,identification of phenology,demand for high computing infrastructure and the complexity of statistical methods.In this research,we created a novel approach by combining four methods.First,we implemented the cloud restoration algorithm called gapfill to restore missed Normalized Difference Vegetation Index(NDVI)values derived from Sentinel-2 sensor(S2)due to cloud obstruction.Second,we created square tiles as a solution for high computing infrastructure demand due to the use of high-resolution sensor.Third,we implemented gapfill following critical crop phenology stage.Fourth,observations from restored images combined with original(from cloud-free images)values and applied for winter wheat prediction.We applied seven base machine learning as well as two groups of super learning ensembles.The study successfully applied gapfill on high-resolution image to get good quality estimates for cloudy pixels.Consequently,yield prediction accuracy increased due to the incorporation of restored values in the regression process.Base models such as Generalized Linear Regression(GLM)and Random Forest(RF)showed improved capacity compared to other base and ensemble models.The two models revealed RMSE of 0.001 t/ha and 0.136 t/ha on the holdout group.The twomodels also revealed consistent and better performance using scatter plot analysis across three datasets.The approach developed is useful to predict wheat yield at field scale,which is a rarely available but vital in many developmental projects,using optical sensors.展开更多
The effects of Si content on the microstructure and yield strength of Al-(1.44-12.40)Si-0.7 Mg(wt.%)alloy sheets under the T4 condition were systematically studied via laser scanning confocal microscopy(LSCM),DSC,TEM ...The effects of Si content on the microstructure and yield strength of Al-(1.44-12.40)Si-0.7 Mg(wt.%)alloy sheets under the T4 condition were systematically studied via laser scanning confocal microscopy(LSCM),DSC,TEM and tensile tests.The results show that the recrystallization grain of the alloy sheets becomes more refined with an increase in Si content.When the Si content increases from 1.44 to 12.4 wt.%,the grain size of the alloy sheets decreases from approximately 47 to 10μm.Further,with an increase in Si content,the volume fraction of the GP zones in the matrix increases slightly.Based on the existing model,a yield strength model for alloy sheets was proposed.The predicted results are in good agreement with the actual experimental results and reveal the strengthening mechanisms of the Al-(1.44-12.40)Si-0.7 Mg alloy sheets under the T4 condition and how they are influenced by the Si content.展开更多
Yield prediction is the primary goal of genomic selection(GS)-assisted crop breeding.Because yield is a complex quantitative trait,making predictions from genotypic data is challenging.Transfer learning can produce an...Yield prediction is the primary goal of genomic selection(GS)-assisted crop breeding.Because yield is a complex quantitative trait,making predictions from genotypic data is challenging.Transfer learning can produce an effective model for a target task by leveraging knowledge from a different,but related,source domain and is considered a great potential method for improving yield prediction by integrating multi-trait data.However,it has not previously been applied to genotype-to-phenotype prediction owing to the lack of an efficient implementation framework.We therefore developed TrG2P,a transfer-learning-based framework.TrG2P first employs convolutional neural networks(CNN)to train models using non-yield-trait phenotypic and genotypic data,thus obtaining pre-trained models.Subsequently,the convolutional layer parameters from these pre-trained models are transferred to the yield prediction task,and the fully connected layers are retrained,thus obtaining fine-tuned models.Finally,the convolutional layer and the first fully connected layer of the fine-tuned models are fused,and the last fully connected layer is trained to enhance prediction performance.We applied TrG2P to five sets of genotypic and phenotypic data from maize(Zea mays),rice(Oryza sativa),and wheat(Triticum aestivum)and compared its model precision to that of seven other popular GS tools:ridge regression best linear unbiased prediction(rrBLUP),random forest,support vector regression,light gradient boosting machine(LightGBM),CNN,DeepGS,and deep neural network for genomic prediction(DNNGP).TrG2P improved the accuracy of yield prediction by 39.9%,6.8%,and 1.8%in rice,maize,and wheat,respectively,compared with predictions generated by the best-performing comparison model.Our work therefore demonstrates that transfer learning is an effective strategy for improving yield prediction by integrating information from non-yield-trait data.We attribute its enhanced prediction accuracy to the valuable information available from traits associated with yield and to training dataset augmentation.The Python implementation of TrG2P is available at https://github.com/lijinlong1991/TrG2P.The web-based tool is available at http://trg2p.ebreed.cn:81.展开更多
Maize(Zea mays L.) stands prominently as one of the major cereal crops in China as well as in the rest of the world.Therefore,predicting the growth and yield of maize for large areas through yield components under hig...Maize(Zea mays L.) stands prominently as one of the major cereal crops in China as well as in the rest of the world.Therefore,predicting the growth and yield of maize for large areas through yield components under high-yielding environments will help in understanding the process of yield formation and yield potential under different environmental conditions.This accurate early assessment of yield requires accuracy in the formation process of yield components as well.In order to formulate the quantitative design for high yields of maize in China,yield performance parameters of quantitative design for high grain yields were evaluated in this study,by utilizing the yield performance equation with normalization of planting density.Planting density was evaluated by parameters including the maximum leaf area index and the maximum leaf area per plant.Results showed that the variation of the maximum leaf area per plant with varying plant density conformed to the Reciprocal Model,which proved to have excellent prediction with root mean square error(RMSE) value of 5.95%.Yield model estimation depicted that the best optimal maximum leaf area per plant was 0.63 times the potential maximum leaf area per plant of hybrids.Yield performance parameters for different yield levels were quantitatively designed based on the yield performance equation.Through validation of the yield performance model by simulating high yields of spring maize in the Inner Mongolia Autonomous Region and Jilin Province,China,and summer maize in Shandong Province,the yield performance equation showed excellent prediction with the satisfactory mean RMSE value(7.72%) of all the parameters.The present study provides theoretical support for the formulation of quantitative design for sustainable high yield of maize in China,through consideration of planting density normalization in the yield prediction process,providing there is no water and nutrient limitation.展开更多
Agricultural system is very complex since it deals with large data situation which comes from a number of factors. A lot of techniques and approaches have been used to identify any interactions between factors that af...Agricultural system is very complex since it deals with large data situation which comes from a number of factors. A lot of techniques and approaches have been used to identify any interactions between factors that affecting yields with the crop performances. The application of neural network to the task of solving non-linear and complex systems is promising. This paper presents a review on the use of artificial neural network (ANN) in predicting crop yield using various crop performance factors. General overview on the application of ANN and the basic concept of neural network architecture are also presented. From the literature, it has been shown that ANN provides better interpretation of crop variability compared to the other methods.展开更多
In this study,we analyzed the potential of using dry matter content for determining ethanol yield of sweet potatoes as one of the raw materials for bioethanol production.We tested dry matter content,total starch conte...In this study,we analyzed the potential of using dry matter content for determining ethanol yield of sweet potatoes as one of the raw materials for bioethanol production.We tested dry matter content,total starch content,crude protein content,glucose content,fructose content,sucrose content and fermentation indicators of 29 sweet potato varieties in Henan province.Correlation analysis between main component contents of sweet potato and the fermentation indicators were carried on.The results showed that there was strong linear correlation between dry matter content and bioethanol yield(R^2=0.935).In order to prove the conclusion,we also tested dry matter content and ethanol yield of another24 sweet potato varieties.Based on the dry matter content and linear correlations,we predicted the ethanol yields.We performed correlation analysis between the predicted values and the measured values of bioethanol yield of the 24 sweet potato varieties,and found highly significant positive correlation between the predicted values and the measured values.These results confirmed the reliability of using dry matter content for bioethanol production prediction for sweet potatoes.展开更多
Grain yield security is a basic national policy of China,and changes in grain yield are influenced by a variety of factors,which often have a complex,non-linear relationship with each other.Therefore,this paper propos...Grain yield security is a basic national policy of China,and changes in grain yield are influenced by a variety of factors,which often have a complex,non-linear relationship with each other.Therefore,this paper proposes a Grey Relational Analysis-Adaptive Boosting-Support Vector Regression(GRA-AdaBoost-SVR)model,which can ensure the prediction accuracy of the model under small sample,improve the generalization ability,and enhance the prediction accuracy.SVR allows mapping to high-dimensional spaces using kernel functions,good for solving nonlinear problems.Grain yield datasets generally have small sample sizes and many features,making SVR a promising application for grain yield datasets.However,the SVR algorithm’s own problems with the selection of parameters and kernel functions make the model less generalizable.Therefore,the Adaptive Boosting(AdaBoost)algorithm can be used.Using the SVR algorithm as a training method for base learners in the AdaBoost algorithm.Effectively address the generalization capability problem in SVR algorithms.In addition,to address the problem of sensitivity to anomalous samples in the AdaBoost algorithm,the GRA method is used to extract influence factors with higher correlation and reduce the number of anomalous samples.Finally,applying the GRA-AdaBoost-SVR model to grain yield forecasting in China.Experiments were conducted to verify the correctness of the model and to compare the effectiveness of several traditional models applied to the grain yield data.The results show that the GRA-AdaBoost-SVR algorithm improves the prediction accuracy,the model is smoother,and confirms that the model possesses better prediction performance and better generalization ability.展开更多
Starting from the supposition of time-space substitution, the Langbein-Schumm's Law was applied to deal with response of fluvial erosion System to the changes in mean annual Precipitation induced by global green-h...Starting from the supposition of time-space substitution, the Langbein-Schumm's Law was applied to deal with response of fluvial erosion System to the changes in mean annual Precipitation induced by global green-house warming. As a result, a simple method was put forward to predict change in sediment yield, with Ningxia Hui Autonomous Region in the northern fringe of the Loess Plateau of China as an example. Results show that, even the change in mean annual precipitation is the same, the direction and magnitude of the resultant chang in sediment yteld would be quite different in fferent physico-geographical zones. When mean annual precipitation is increased, sediment yield in arid or semi-arid areas with a mean anntal Peripitation of less than 400 mm will be increased, while sediment yield in sub-humid or humid areas with a mean annual precipitation of more than 400 mm will be decreased.Additionally, the complex response of fluvial erosion system in time series due to the lag of change in vegetation behind the changn in precipitation has also been qualitatively discussed in this paper.展开更多
Prediction of reaction yields using machine learning(ML)can help chemists select high-yielding reactions and provide prior experience before wet-lab experimenting to improve efficiency.However,the exploration of a mul...Prediction of reaction yields using machine learning(ML)can help chemists select high-yielding reactions and provide prior experience before wet-lab experimenting to improve efficiency.However,the exploration of a multicomponent organic reaction features many complex variables and limited number of experimental data,which are challenging for the application of ML.Herein,we perform yield prediction for the synthesis of 2-oxazolidones via Cu-catalyzed radical-type oxy-alkylation of allylamines and herteroaryl-methylamines with CO_(2),which is a three-component reaction.Using physicochemical descriptors as features to launch ML modelling,we find that XGBoost shows significantly improved performance over linear models and these features are effective for the yield prediction.Moreover,out-of-sample prediction indicates the application potential of the model.This study demonstrates great potential of regression-modelling-based ML in organic synthesis even with complex factors and a general small size of reaction data,which are generated from the classical research pattern of method for the inquiry of multicomponent reactions.展开更多
Due to continuous process scaling, process, voltage, and temperature (PVT) parameter variations have become one of the most problematic issues in circuit design. The resulting correlations among performance metrics ...Due to continuous process scaling, process, voltage, and temperature (PVT) parameter variations have become one of the most problematic issues in circuit design. The resulting correlations among performance metrics lead to a significant parametric yield loss. Previous algorithms on parametric yield prediction are limited to predicting a single-parametric yield or performing balanced optimization for several single-parametric yields. Consequently, these methods fail to predict the multiparametric yield that optimizes multiple performance metrics simultaneously, which may result in significant accuracy loss. In this paper we suggest an efficient multi-parametric yield prediction framework, in which multiple performance metrics are considered as simultaneous constraint conditions for parametric yield prediction, to maintain the correlations among metrics. First, the framework models the performance metrics in terms of PVT parameter variations by using the adaptive elastic net (AEN) method. Then the parametric yield for a single performance metric can be predicted through the computation of the cumulative distribution function (CDF) based on the multiplication theorem and the Markov chain Monte Carlo (MCMC) method. Finally, a copula-based parametric yield prediction procedure has been developed to solve the multi-parametric yield prediction problem, and to generate an accurate yield estimate. Experimental results demonstrate that the proposed multi-parametric yield prediction framework is able to provide the designer with either an accurate value for parametric yield under specific performance limits, or a multi-parametric yield surface under all ranges of performance limits.展开更多
Variation in phenological stage is the major nonlinearity in monitoring, modeling and various estimations of agricultural systems. Indices are used as a common means of evaluating agricultural monitoring data from rem...Variation in phenological stage is the major nonlinearity in monitoring, modeling and various estimations of agricultural systems. Indices are used as a common means of evaluating agricultural monitoring data from remote sensing and terrestrial observation systems, and many of these indices have linear characteristics. The analysis of and relationships between indices are dependent on the type of plant, but they are also highly variable with respect to its phenologicat stage. For this reason, variations in the phenologica! stage affect the performance of spatiotemporal crop status monitoring. We hereby propose an adaptive event-triggered model for monitoring crop status based on remote sensing data and terrestrial observations. In the proposed model, the estimation of phenological stage is a part of predicting crop status, and spatially distributed remote sensing parameters and temporal terrestrial monitoring data are used together as inputs in a state space system model. The temporal data are segmented with respect to the phenological stage-oriented timing of the spatial data, so instead of a generalized discrete state space model, we used logical states combined with analog inputs and adaptive trigger functions, as in the case of a Mealy machine model. This provides the necessary nonlinearity for the state transi- tions. The results showed that observation parameters have considerably greater significance in crop status monitoring with respect to conventional agricultural data fusion techniques.展开更多
Estimation of yield performance for crop products is a topic of interest in agriculture.In breeding programs,we cannot test all possible hybrids created by crossing two parents(inbred and tester)since it would be too ...Estimation of yield performance for crop products is a topic of interest in agriculture.In breeding programs,we cannot test all possible hybrids created by crossing two parents(inbred and tester)since it would be too time consuming and costly.In this paper,we exploit different machine learning algorithms including decision tree,gradient boosting machine,random forest,adaptive boosting,XGBoost and neural network to predict the yield of corn hybrids using data provided in the 2020 Syngenta Crop Challenge.The participants were asked to predict the yield of missing hybrids which were not tested before.Our results show that the prediction obtained by XGBoost is more accurate than other models with a root mean square error equal to 0.0524.Therefore,we use XGBoost model to estimate the yield performance for untested combinations of inbreds and testers.Using this approach,we identify hybrids with high predicted yield that can be bred to increase corn production.展开更多
As die size and complexity increase, accurate and efficient extraction of the critical area is essential for yield prediction. Aiming at eliminating the potential integration errors of the traditional shape shifting m...As die size and complexity increase, accurate and efficient extraction of the critical area is essential for yield prediction. Aiming at eliminating the potential integration errors of the traditional shape shifting method, an improved shape shifting method is proposed for Manhattan layouts. By mathematical analyses of the relevance of critical areas to defect sizes, the critical area for all defect sizes is modeled as a piecewise quadratic polynomial function of defect size, which can be obtained by extracting critical area for some certain defect sizes. Because the improved method calculates critical areas for all defect sizes instead of several discrete values with traditional shape shifting method, it eliminates the integration error of the average critical area. Experiments on industrial layouts show that the improved shape shifting method can improve the accuracy of the average critical area calculation by 24.3% or reduce about 59.7% computational expense compared with the traditional method.展开更多
文摘The growing global requirement for food and the need for sustainable farming in an era of a changing climate and scarce resources have inspired substantial crop yield prediction research.Deep learning(DL)and machine learning(ML)models effectively deal with such challenges.This research paper comprehensively analyses recent advancements in crop yield prediction from January 2016 to March 2024.In addition,it analyses the effectiveness of various input parameters considered in crop yield prediction models.We conducted an in-depth search and gathered studies that employed crop modeling and AI-based methods to predict crop yield.The total number of articles reviewed for crop yield prediction using ML,meta-modeling(Crop models coupled with ML/DL),and DL-based prediction models and input parameter selection is 125.We conduct the research by setting up five objectives for this research and discussing them after analyzing the selected research papers.Each study is assessed based on the crop type,input parameters employed for prediction,the modeling techniques adopted,and the evaluation metrics used for estimatingmodel performance.We also discuss the ethical and social impacts of AI on agriculture.However,various approaches presented in the scientific literature have delivered impressive predictions,they are complicateddue to intricate,multifactorial influences oncropgrowthand theneed for accuratedata-driven models.Therefore,thorough research is required to deal with challenges in predicting agricultural output.
基金supported by the Key Research and Development Program of Heilongjiang,China(Grant No.2022ZX01A25)Cooperative Funding between Huazhong Agricultural University and Shenzhen Institute of Agricultural Genomics(Grant No.SZYJY2022014)+2 种基金Fundamental Research Funds for the Central Universities,Beijing,China(Grant Nos.2662022JC006 and 2662022ZHYJ002)National Natural Science Foundation of China(Grant No.32101819)Huazhong Agriculture University Research Startup Fund,China(Grant Nos.11041810340 and 11041810341).
文摘Pre-harvest yield prediction of ratoon rice is critical for guiding crop interventions in precision agriculture.However,the unique agronomic practice(i.e.,varied stubble height treatment)in rice ratooning could lead to inconsistent rice phenology,which had a significant impact on yield prediction of ratoon rice.Multi-temporal unmanned aerial vehicle(UAV)-based remote sensing can likely monitor ratoon rice productivity and reflect maximum yield potential across growing seasons for improving the yield prediction compared with previous methods.Thus,in this study,we explored the performance of combination of agronomic practice information(API)and single-phase,multi-spectral features[vegetation indices(VIs)and texture(Tex)features]in predicting ratoon rice yield,and developed a new UAV-based method to retrieve yield formation process by using multi-temporal features which were effective in improving yield forecasting accuracy of ratoon rice.The results showed that the integrated use of VIs,Tex and API(VIs&Tex+API)improved the accuracy of yield prediction than single-phase UAV imagery-based feature,with the panicle initiation stage being the best period for yield prediction(R^(2) as 0.732,RMSE as 0.406,RRMSE as 0.101).More importantly,compared with previous multi-temporal UAV-based methods,our proposed multi-temporal method(multi-temporal model VIs&Tex:R^(2) as 0.795,RMSE as 0.298,RRMSE as 0.072)can increase R^(2) by 0.020-0.111 and decrease RMSE by 0.020-0.080 in crop yield forecasting.This study provides an effective method for accurate pre-harvest yield prediction of ratoon rice in precision agriculture,which is of great significance to take timely means for ensuring ratoon rice production and food security.
文摘Agriculture plays a vital role in the Indian economy.Crop recommen-dation for a specific region is a tedious process as it can be affected by various variables such as soil type and climatic parameters.At the same time,crop yield prediction was based on several features like area,irrigation type,temperature,etc.The recent advancements of artificial intelligence(AI)and machine learning(ML)models pave the way to design effective crop recommendation and crop pre-diction models.In this view,this paper presents a novel Multimodal Machine Learning Based Crop Recommendation and Yield Prediction(MMML-CRYP)technique.The proposed MMML-CRYP model mainly focuses on two processes namely crop recommendation and crop prediction.At the initial stage,equilibrium optimizer(EO)with kernel extreme learning machine(KELM)technique is employed for effectual recommendation of crops.Next,random forest(RF)tech-nique was executed for predicting the crop yield accurately.For reporting the improved performance of the MMML-CRYP system,a wide range of simulations were carried out and the results are investigated using benchmark dataset.Experi-mentation outcomes highlighted the significant performance of the MMML-CRYP approach on the compared approaches with maximum accuracy of 97.91%.
基金supported by the Academic Core Project of Northeast Agricultural University Scholars Program(20YJ5B01)Heilongjiang Postdoctoral General Fund Project(LBH-Z21110)Key Laboratory of Swine Facilities Engineering,Ministry of Agriculture and Rural Affairs,Northeast Agricultural University 150030,P.R.China.
文摘Cornstalks show promise as a raw material for polysaccharide production through xylanase.Rapid and accurate prediction of polysaccharide yield can facilitate process optimization,eliminating the need for extensive experimentation in actual production to refine reaction conditions,thereby saving time and costs.However,the intricate interplay of enzymatic factors poses challenges in predicting and optimizing polysaccharide yield accurately.Here,we introduce an innovative data-driven approach leveraging multiple artificial intelligence techniques to enhance polysaccharide production.We propose a machine learning framework to identify highly accurate polysaccharide yield prediction modeling methods and uncover optimal enzymatic parameter combinations.Notably,Random Forest(RF)and eXtreme Gradient Boost(XGB)demonstrate robust performance,achieving prediction accuracies of 93.0%and 95.6%,respectively,while an independently developed deep neural network(DNN)model achieves 91.1%accuracy.A feature importance analysis of XGB reveals the enzyme solution volume's dominant role(43.7%),followed by time(20.7%),substrate concentration(15%),temperature(15%),and pH(5.6%).Further interpretability analysis unveils complex parameter interactions and potential optimization strategies.This data-driven approach,incorporating machine learning,deep learning,and interpretable analysis,offers a viable pathway for polysaccharide yield prediction and the potential recovery of various agricultural residues.
基金financially supported by the National Natural Science Foundation of China(Grant No.42271396)the Key R&D project of Hebei Province(Grant No.22326406D).
文摘Predicting crop yield timely can considerably accelerate agricultural production management and food policy-making,which are also important requirements for precise agricultural development.Given the development of hyperspectral imaging technology,a simple and efficient modeling method is convenient for predicting crop yield by using airborne hyperspectral images.In this study,the Unmanned Aerial Vehicle(UAV)hyperspectral and maturity yield data in 2014-2015 and 2017-2018 were collected.The winter wheat yield prediction model was established by optimizing Vegetation Indices(VIs)feature scales and sample scales,incorporating Partial Least Squares Regression(PLSR),Random Forest algorithm(RF),and Back Propagation Neural Network algorithm(BPN).Results showed that PLSR stands out as the optimal wheat yield prediction model considering stability and accuracy(RMSE=948.88 kg/hm2).Contrary to the belief that more input features result in higher accuracy,PLSR,RF,and BPN models performed best when trained with the top 3,8,and 4 VIs with the highest correlation,respectively.With an increase in training samples,model accuracy improves,reaching stability when the training samples reach 70.Using PLSR and optimal feature scales,UAV yield prediction maps were generated,holding significant value for field management in precision agriculture.
基金supported by the Science and Technology Innovation Project of Chinese Academy of Agricultural Sciences (CAAS-ASTIP-2016-AII)。
文摘The accurate prediction of soybean yield is of great significance for agricultural production, monitoring and early warning.Although previous studies have used machine learning algorithms to predict soybean yield based on meteorological data,it is not clear how different models can be used to effectively separate soybean meteorological yield from soybean yield in various regions. In addition, comprehensively integrating the advantages of various machine learning algorithms to improve the prediction accuracy through ensemble learning algorithms has not been studied in depth. This study used and analyzed various daily meteorological data and soybean yield data from 173 county-level administrative regions and meteorological stations in two principal soybean planting areas in China(Northeast China and the Huang–Huai region), covering 34 years.Three effective machine learning algorithms(K-nearest neighbor, random forest, and support vector regression) were adopted as the base-models to establish a high-precision and highly-reliable soybean meteorological yield prediction model based on the stacking ensemble learning framework. The model's generalizability was further improved through 5-fold crossvalidation, and the model was optimized by principal component analysis and hyperparametric optimization. The accuracy of the model was evaluated by using the five-year sliding prediction and four regression indicators of the 173 counties, which showed that the stacking model has higher accuracy and stronger robustness. The 5-year sliding estimations of soybean yield based on the stacking model in 173 counties showed that the prediction effect can reflect the spatiotemporal distribution of soybean yield in detail, and the mean absolute percentage error(MAPE) was less than 5%. The stacking prediction model of soybean meteorological yield provides a new approach for accurately predicting soybean yield.
文摘The prediction of crop yield is one of the important factor and also challenging,to predict the future crop yield based on various criteria’s.Many advanced technologies are incorporated in the agricultural processes,which enhances the crop yield production efficiency.The process of predicting the crop yield can be done by taking agriculture data,which helps to analyze and make important decisions before and during cultivation.This paper focuses on the prediction of crop yield,where two models of machine learning are developed for this work.One is Modified Convolutional Neural Network(MCNN),and the other model is TLBO(Teacher Learning Based Optimization)-a Genetic algorithm which reduces the input size of data.In this work,some spatial information used for analysis is the Normalized Difference Vegetation Index,Standard Precipitation Index and Vegetation Condition Index.TLBO finds some best feature value set in the data that represents the specific yield of the crop.So,these selected feature valued set is passed in the Error Back Propagation Neural Network for learning.Here,the training was done in such a way that all set of features were utilized in pair with their yield value as output.For increasing the reliability of the work whole experiment was done on a real dataset from Madhya Pradesh region of country India.The result shows that the proposed model has overcome various evaluation parameters on different scales as compared to previous approaches adopted by researchers.
文摘Though studies showed the potential of high-resolution optical sensors for crop yield prediction,several factors have limited their wider application.The main factors are obstruction of cloud,identification of phenology,demand for high computing infrastructure and the complexity of statistical methods.In this research,we created a novel approach by combining four methods.First,we implemented the cloud restoration algorithm called gapfill to restore missed Normalized Difference Vegetation Index(NDVI)values derived from Sentinel-2 sensor(S2)due to cloud obstruction.Second,we created square tiles as a solution for high computing infrastructure demand due to the use of high-resolution sensor.Third,we implemented gapfill following critical crop phenology stage.Fourth,observations from restored images combined with original(from cloud-free images)values and applied for winter wheat prediction.We applied seven base machine learning as well as two groups of super learning ensembles.The study successfully applied gapfill on high-resolution image to get good quality estimates for cloudy pixels.Consequently,yield prediction accuracy increased due to the incorporation of restored values in the regression process.Base models such as Generalized Linear Regression(GLM)and Random Forest(RF)showed improved capacity compared to other base and ensemble models.The two models revealed RMSE of 0.001 t/ha and 0.136 t/ha on the holdout group.The twomodels also revealed consistent and better performance using scatter plot analysis across three datasets.The approach developed is useful to predict wheat yield at field scale,which is a rarely available but vital in many developmental projects,using optical sensors.
基金Project(2016YFB0300801)supported by the National Key Research and Development Program of ChinaProject(51871043)supported by the National Natural Science Foundation of ChinaProject(N180212010)supported by the Fundamental Research Funds for the Central Universities of China。
文摘The effects of Si content on the microstructure and yield strength of Al-(1.44-12.40)Si-0.7 Mg(wt.%)alloy sheets under the T4 condition were systematically studied via laser scanning confocal microscopy(LSCM),DSC,TEM and tensile tests.The results show that the recrystallization grain of the alloy sheets becomes more refined with an increase in Si content.When the Si content increases from 1.44 to 12.4 wt.%,the grain size of the alloy sheets decreases from approximately 47 to 10μm.Further,with an increase in Si content,the volume fraction of the GP zones in the matrix increases slightly.Based on the existing model,a yield strength model for alloy sheets was proposed.The predicted results are in good agreement with the actual experimental results and reveal the strengthening mechanisms of the Al-(1.44-12.40)Si-0.7 Mg alloy sheets under the T4 condition and how they are influenced by the Si content.
基金This research was funded by the STI2030-Major Projects(no.2023ZD0406104)the Beijing Postdoctoral Research Foundation(no.2023-ZZ-116).
文摘Yield prediction is the primary goal of genomic selection(GS)-assisted crop breeding.Because yield is a complex quantitative trait,making predictions from genotypic data is challenging.Transfer learning can produce an effective model for a target task by leveraging knowledge from a different,but related,source domain and is considered a great potential method for improving yield prediction by integrating multi-trait data.However,it has not previously been applied to genotype-to-phenotype prediction owing to the lack of an efficient implementation framework.We therefore developed TrG2P,a transfer-learning-based framework.TrG2P first employs convolutional neural networks(CNN)to train models using non-yield-trait phenotypic and genotypic data,thus obtaining pre-trained models.Subsequently,the convolutional layer parameters from these pre-trained models are transferred to the yield prediction task,and the fully connected layers are retrained,thus obtaining fine-tuned models.Finally,the convolutional layer and the first fully connected layer of the fine-tuned models are fused,and the last fully connected layer is trained to enhance prediction performance.We applied TrG2P to five sets of genotypic and phenotypic data from maize(Zea mays),rice(Oryza sativa),and wheat(Triticum aestivum)and compared its model precision to that of seven other popular GS tools:ridge regression best linear unbiased prediction(rrBLUP),random forest,support vector regression,light gradient boosting machine(LightGBM),CNN,DeepGS,and deep neural network for genomic prediction(DNNGP).TrG2P improved the accuracy of yield prediction by 39.9%,6.8%,and 1.8%in rice,maize,and wheat,respectively,compared with predictions generated by the best-performing comparison model.Our work therefore demonstrates that transfer learning is an effective strategy for improving yield prediction by integrating information from non-yield-trait data.We attribute its enhanced prediction accuracy to the valuable information available from traits associated with yield and to training dataset augmentation.The Python implementation of TrG2P is available at https://github.com/lijinlong1991/TrG2P.The web-based tool is available at http://trg2p.ebreed.cn:81.
基金supported by the National Key Research and Development Program of China(2018YFD020060 and 2017YFD0301307)the National Natural Science Foundation of China(31971851)the earmarked fund for China Agriculture Research System(CARS-02-12)
文摘Maize(Zea mays L.) stands prominently as one of the major cereal crops in China as well as in the rest of the world.Therefore,predicting the growth and yield of maize for large areas through yield components under high-yielding environments will help in understanding the process of yield formation and yield potential under different environmental conditions.This accurate early assessment of yield requires accuracy in the formation process of yield components as well.In order to formulate the quantitative design for high yields of maize in China,yield performance parameters of quantitative design for high grain yields were evaluated in this study,by utilizing the yield performance equation with normalization of planting density.Planting density was evaluated by parameters including the maximum leaf area index and the maximum leaf area per plant.Results showed that the variation of the maximum leaf area per plant with varying plant density conformed to the Reciprocal Model,which proved to have excellent prediction with root mean square error(RMSE) value of 5.95%.Yield model estimation depicted that the best optimal maximum leaf area per plant was 0.63 times the potential maximum leaf area per plant of hybrids.Yield performance parameters for different yield levels were quantitatively designed based on the yield performance equation.Through validation of the yield performance model by simulating high yields of spring maize in the Inner Mongolia Autonomous Region and Jilin Province,China,and summer maize in Shandong Province,the yield performance equation showed excellent prediction with the satisfactory mean RMSE value(7.72%) of all the parameters.The present study provides theoretical support for the formulation of quantitative design for sustainable high yield of maize in China,through consideration of planting density normalization in the yield prediction process,providing there is no water and nutrient limitation.
文摘Agricultural system is very complex since it deals with large data situation which comes from a number of factors. A lot of techniques and approaches have been used to identify any interactions between factors that affecting yields with the crop performances. The application of neural network to the task of solving non-linear and complex systems is promising. This paper presents a review on the use of artificial neural network (ANN) in predicting crop yield using various crop performance factors. General overview on the application of ANN and the basic concept of neural network architecture are also presented. From the literature, it has been shown that ANN provides better interpretation of crop variability compared to the other methods.
基金supported by National key research and development program(No.2016YFD0401302)the fundamental research funds for the universities of Henan province(No.2015QNJH03)Key Technological Project of Henan Science and Technology Department(No.182102110394)。
文摘In this study,we analyzed the potential of using dry matter content for determining ethanol yield of sweet potatoes as one of the raw materials for bioethanol production.We tested dry matter content,total starch content,crude protein content,glucose content,fructose content,sucrose content and fermentation indicators of 29 sweet potato varieties in Henan province.Correlation analysis between main component contents of sweet potato and the fermentation indicators were carried on.The results showed that there was strong linear correlation between dry matter content and bioethanol yield(R^2=0.935).In order to prove the conclusion,we also tested dry matter content and ethanol yield of another24 sweet potato varieties.Based on the dry matter content and linear correlations,we predicted the ethanol yields.We performed correlation analysis between the predicted values and the measured values of bioethanol yield of the 24 sweet potato varieties,and found highly significant positive correlation between the predicted values and the measured values.These results confirmed the reliability of using dry matter content for bioethanol production prediction for sweet potatoes.
基金This work was support in part by Research on Key Technologies of Intelligent Decision-Making for Food Big Data under Grant 2018A01038in part by the National Science Fund for Youth of Hubei Province of China under Grant 2018CFB408+2 种基金in part by the Natural Science Foundation of Hubei Province of China under Grant 2015CFA061in part by the National Nature Science Foundation of China under Grant 61272278in part by the Major Technical Innovation Projects of Hubei Province under Grant 2018ABA099。
文摘Grain yield security is a basic national policy of China,and changes in grain yield are influenced by a variety of factors,which often have a complex,non-linear relationship with each other.Therefore,this paper proposes a Grey Relational Analysis-Adaptive Boosting-Support Vector Regression(GRA-AdaBoost-SVR)model,which can ensure the prediction accuracy of the model under small sample,improve the generalization ability,and enhance the prediction accuracy.SVR allows mapping to high-dimensional spaces using kernel functions,good for solving nonlinear problems.Grain yield datasets generally have small sample sizes and many features,making SVR a promising application for grain yield datasets.However,the SVR algorithm’s own problems with the selection of parameters and kernel functions make the model less generalizable.Therefore,the Adaptive Boosting(AdaBoost)algorithm can be used.Using the SVR algorithm as a training method for base learners in the AdaBoost algorithm.Effectively address the generalization capability problem in SVR algorithms.In addition,to address the problem of sensitivity to anomalous samples in the AdaBoost algorithm,the GRA method is used to extract influence factors with higher correlation and reduce the number of anomalous samples.Finally,applying the GRA-AdaBoost-SVR model to grain yield forecasting in China.Experiments were conducted to verify the correctness of the model and to compare the effectiveness of several traditional models applied to the grain yield data.The results show that the GRA-AdaBoost-SVR algorithm improves the prediction accuracy,the model is smoother,and confirms that the model possesses better prediction performance and better generalization ability.
文摘Starting from the supposition of time-space substitution, the Langbein-Schumm's Law was applied to deal with response of fluvial erosion System to the changes in mean annual Precipitation induced by global green-house warming. As a result, a simple method was put forward to predict change in sediment yield, with Ningxia Hui Autonomous Region in the northern fringe of the Loess Plateau of China as an example. Results show that, even the change in mean annual precipitation is the same, the direction and magnitude of the resultant chang in sediment yteld would be quite different in fferent physico-geographical zones. When mean annual precipitation is increased, sediment yield in arid or semi-arid areas with a mean anntal Peripitation of less than 400 mm will be increased, while sediment yield in sub-humid or humid areas with a mean annual precipitation of more than 400 mm will be decreased.Additionally, the complex response of fluvial erosion system in time series due to the lag of change in vegetation behind the changn in precipitation has also been qualitatively discussed in this paper.
基金We thank the financial support from the National Natural Science Foundation of China(Nos.21775107,21822108)the Sichuan Science and Technology Program(20CXTD0112)the Fundamental Research Funds for the Central Universities.
文摘Prediction of reaction yields using machine learning(ML)can help chemists select high-yielding reactions and provide prior experience before wet-lab experimenting to improve efficiency.However,the exploration of a multicomponent organic reaction features many complex variables and limited number of experimental data,which are challenging for the application of ML.Herein,we perform yield prediction for the synthesis of 2-oxazolidones via Cu-catalyzed radical-type oxy-alkylation of allylamines and herteroaryl-methylamines with CO_(2),which is a three-component reaction.Using physicochemical descriptors as features to launch ML modelling,we find that XGBoost shows significantly improved performance over linear models and these features are effective for the yield prediction.Moreover,out-of-sample prediction indicates the application potential of the model.This study demonstrates great potential of regression-modelling-based ML in organic synthesis even with complex factors and a general small size of reaction data,which are generated from the classical research pattern of method for the inquiry of multicomponent reactions.
基金Project supposed by the Natural Science Foundation of Jiangsu Province (Nos. BK20161072, BK20150785, and BK20130877) and the National Natural Science Foundation of China (Nos. 61502234 and 71301081)
文摘Due to continuous process scaling, process, voltage, and temperature (PVT) parameter variations have become one of the most problematic issues in circuit design. The resulting correlations among performance metrics lead to a significant parametric yield loss. Previous algorithms on parametric yield prediction are limited to predicting a single-parametric yield or performing balanced optimization for several single-parametric yields. Consequently, these methods fail to predict the multiparametric yield that optimizes multiple performance metrics simultaneously, which may result in significant accuracy loss. In this paper we suggest an efficient multi-parametric yield prediction framework, in which multiple performance metrics are considered as simultaneous constraint conditions for parametric yield prediction, to maintain the correlations among metrics. First, the framework models the performance metrics in terms of PVT parameter variations by using the adaptive elastic net (AEN) method. Then the parametric yield for a single performance metric can be predicted through the computation of the cumulative distribution function (CDF) based on the multiplication theorem and the Markov chain Monte Carlo (MCMC) method. Finally, a copula-based parametric yield prediction procedure has been developed to solve the multi-parametric yield prediction problem, and to generate an accurate yield estimate. Experimental results demonstrate that the proposed multi-parametric yield prediction framework is able to provide the designer with either an accurate value for parametric yield under specific performance limits, or a multi-parametric yield surface under all ranges of performance limits.
基金funded by Turkish Ministry of Development as a part of Agricultural Monitoring and Information Systems Project (2011A020100)the relevant joint project funded by Ministry of Food,Agriculture and Livestock,Turkey
文摘Variation in phenological stage is the major nonlinearity in monitoring, modeling and various estimations of agricultural systems. Indices are used as a common means of evaluating agricultural monitoring data from remote sensing and terrestrial observation systems, and many of these indices have linear characteristics. The analysis of and relationships between indices are dependent on the type of plant, but they are also highly variable with respect to its phenologicat stage. For this reason, variations in the phenologica! stage affect the performance of spatiotemporal crop status monitoring. We hereby propose an adaptive event-triggered model for monitoring crop status based on remote sensing data and terrestrial observations. In the proposed model, the estimation of phenological stage is a part of predicting crop status, and spatially distributed remote sensing parameters and temporal terrestrial monitoring data are used together as inputs in a state space system model. The temporal data are segmented with respect to the phenological stage-oriented timing of the spatial data, so instead of a generalized discrete state space model, we used logical states combined with analog inputs and adaptive trigger functions, as in the case of a Mealy machine model. This provides the necessary nonlinearity for the state transi- tions. The results showed that observation parameters have considerably greater significance in crop status monitoring with respect to conventional agricultural data fusion techniques.
文摘Estimation of yield performance for crop products is a topic of interest in agriculture.In breeding programs,we cannot test all possible hybrids created by crossing two parents(inbred and tester)since it would be too time consuming and costly.In this paper,we exploit different machine learning algorithms including decision tree,gradient boosting machine,random forest,adaptive boosting,XGBoost and neural network to predict the yield of corn hybrids using data provided in the 2020 Syngenta Crop Challenge.The participants were asked to predict the yield of missing hybrids which were not tested before.Our results show that the prediction obtained by XGBoost is more accurate than other models with a root mean square error equal to 0.0524.Therefore,we use XGBoost model to estimate the yield performance for untested combinations of inbreds and testers.Using this approach,we identify hybrids with high predicted yield that can be bred to increase corn production.
文摘As die size and complexity increase, accurate and efficient extraction of the critical area is essential for yield prediction. Aiming at eliminating the potential integration errors of the traditional shape shifting method, an improved shape shifting method is proposed for Manhattan layouts. By mathematical analyses of the relevance of critical areas to defect sizes, the critical area for all defect sizes is modeled as a piecewise quadratic polynomial function of defect size, which can be obtained by extracting critical area for some certain defect sizes. Because the improved method calculates critical areas for all defect sizes instead of several discrete values with traditional shape shifting method, it eliminates the integration error of the average critical area. Experiments on industrial layouts show that the improved shape shifting method can improve the accuracy of the average critical area calculation by 24.3% or reduce about 59.7% computational expense compared with the traditional method.