In regression, despite being both aimed at estimating the Mean Squared Prediction Error (MSPE), Akaike’s Final Prediction Error (FPE) and the Generalized Cross Validation (GCV) selection criteria are usually derived ...In regression, despite being both aimed at estimating the Mean Squared Prediction Error (MSPE), Akaike’s Final Prediction Error (FPE) and the Generalized Cross Validation (GCV) selection criteria are usually derived from two quite different perspectives. Here, settling on the most commonly accepted definition of the MSPE as the expectation of the squared prediction error loss, we provide theoretical expressions for it, valid for any linear model (LM) fitter, be it under random or non random designs. Specializing these MSPE expressions for each of them, we are able to derive closed formulas of the MSPE for some of the most popular LM fitters: Ordinary Least Squares (OLS), with or without a full column rank design matrix;Ordinary and Generalized Ridge regression, the latter embedding smoothing splines fitting. For each of these LM fitters, we then deduce a computable estimate of the MSPE which turns out to coincide with Akaike’s FPE. Using a slight variation, we similarly get a class of MSPE estimates coinciding with the classical GCV formula for those same LM fitters.展开更多
The research focuses on improving predictive accuracy in the financial sector through the exploration of machine learning algorithms for stock price prediction. The research follows an organized process combining Agil...The research focuses on improving predictive accuracy in the financial sector through the exploration of machine learning algorithms for stock price prediction. The research follows an organized process combining Agile Scrum and the Obtain, Scrub, Explore, Model, and iNterpret (OSEMN) methodology. Six machine learning models, namely Linear Forecast, Naive Forecast, Simple Moving Average with weekly window (SMA 5), Simple Moving Average with monthly window (SMA 20), Autoregressive Integrated Moving Average (ARIMA), and Long Short-Term Memory (LSTM), are compared and evaluated through Mean Absolute Error (MAE), with the LSTM model performing the best, showcasing its potential for practical financial applications. A Django web application “Predict It” is developed to implement the LSTM model. Ethical concerns related to predictive modeling in finance are addressed. Data quality, algorithm choice, feature engineering, and preprocessing techniques are emphasized for better model performance. The research acknowledges limitations and suggests future research directions, aiming to equip investors and financial professionals with reliable predictive models for dynamic markets.展开更多
This study assesses the predictive capabilities of the CMA-GD model for wind speed prediction in two wind farms located in Hubei Province,China.The observed wind speeds at the height of 70m in wind turbines of two win...This study assesses the predictive capabilities of the CMA-GD model for wind speed prediction in two wind farms located in Hubei Province,China.The observed wind speeds at the height of 70m in wind turbines of two wind farms in Suizhou serve as the actual observation data for comparison and testing.At the same time,the wind speed predicted by the EC model is also included for comparative analysis.The results indicate that the CMA-GD model performs better than the EC model in Wind Farm A.The CMA-GD model exhibits a monthly average correlation coefficient of 0.56,root mean square error of 2.72 m s^(-1),and average absolute error of 2.11 m s^(-1).In contrast,the EC model shows a monthly average correlation coefficient of 0.51,root mean square error of 2.83 m s^(-1),and average absolute error of 2.21 m s^(-1).Conversely,in Wind Farm B,the EC model outperforms the CMA-GD model.The CMA-GD model achieves a monthly average correlation coefficient of 0.55,root mean square error of 2.61 m s^(-1),and average absolute error of 2.13 m s^(-1).By contrast,the EC model displays a monthly average correlation coefficient of 0.63,root mean square error of 2.04 m s^(-1),and average absolute error of 1.67 m s^(-1).展开更多
The present study aims to analyze the shift in shoreline due to coastal processes and formulate available for best estimate of future shoreline positions based on precedent shorelines. Information on rates and trends ...The present study aims to analyze the shift in shoreline due to coastal processes and formulate available for best estimate of future shoreline positions based on precedent shorelines. Information on rates and trends of shoreline change can be used to improve the understanding of the underlying causes and potential effects of coastal erosion which can support informed coastal management decisions. In this paper, researchers go over the changes in the recent positions of the shoreline of the Balasore coast for the 38 years from 1975 through 2013. The study area includes the Balasore coastal region from Rasalpur to Udaypur together with Chandipur, Choumukh, Chandrabali as well as Bichitrapur. Transects wise shoreline data base were developed for approximately 67 kilometers of shoreline and erosional/accretional scenario has also been analysed by delineating the shoreline from Landsat imageries of 1975, 1980, 1990, 1995, 2000, 2005, 2010 and 2013. A simple Linear Regression Model and End Point Rate (EPR) have been adopted to take out the rate of change of shoreline and its future positions, based on empirical observations at 67 transects along the Balasore coast. It is found that the north eastern part of Balasore coast in the vicinity of Subarnarekha estuary and Chandrabali beach undergo high rates of shore line shift. The shoreline data were integrated for long- (about 17 years) and short-term (about 7 years) shift rates analysis to comprehend the shoreline change and prediction. For the prediction of future shoreline, the model has been validated with the present shoreline position (2013). The rate of shoreline movement calculated from the fixed base line to shoreline position of 1975, 1980, 1990, 1995, 2000, 2005 and 2010 and based on this, the estimated shoreline of 2013 was calculated. The estimated shoreline was compared with the actual shoreline delineated from satellite imagery of 2013. The model error or positional shift at each sample point is observed. The positional error varies from??4.82 m to 212.41 m. It has been found that model prediction error is higher in the left hand side of river Subarnarekha. The overall error for the entire predicted shoreline was found to be 41.88 m by Root Mean Square Error (RMSE). In addition, it was tested by means difference between actual and predicted shoreline positions using “t” test and it has been found that predicted shore line is not significantly different from actual shoreline position at (t132 = 0.278) p < 0.01.展开更多
It was found earlier that moisture content (MC) of intact kernels of grain and nuts could be determined by Near Infra Red (NIR) reflectance spectrometry. However, if the MC values can be determined while the nuts are ...It was found earlier that moisture content (MC) of intact kernels of grain and nuts could be determined by Near Infra Red (NIR) reflectance spectrometry. However, if the MC values can be determined while the nuts are in their shells, it would save lot of labor and money spent in shelling and cleaning the nuts. Grain and nuts absorb low levels of NIR, and when NIR radiation is incident on them, a substantial portion of the radiation is reflected back. Thus, studying the NIR reflectance spectra emanating from in-shell peanuts, an attempt is made for the first time to determine the MC of in-shell peanuts. In-shell peanuts of two different market types, Virginia and Valencia, were conditioned to different moisture levels between 6% and 26% (wet basis), and separated into calibration and validation groups. NIR absorption spectral data from 1000 nm to 2500 nm in 1 nm intervals were collected from both groups. Measurements were obtained on 30 replicates within each moisture level. Reference MC values for each moisture level in these groups were obtained using standard air-oven method. Partial Least Square (PLS) analysis was performed on the calibration data, and prediction models were developed. The Standard Error of Calibration (SEC), and R2 of the calibration models were computed to select the best calibration model. The selected models were used to predict the moisture content of peanuts in the validation sets. Predicted MC values of the validation samples were compared with their standard air-oven moisture values. Goodness of fit was determined based on the lowest Standard Error of Prediction (SEP) and highest R2 value obtained for the prediction models. The model, with reflectance plus normalization spectral data with an SEP of 0.74 for Valencia and 1.57 for Virginia type in-shell peanuts was selected as the best model. The corresponding R2 values were 0.98 for both peanut types. This work establishes the possibility of sensing MC of intact in-shell peanuts by NIR reflectance method, and would be useful for the peanut and allied industries.展开更多
文摘In regression, despite being both aimed at estimating the Mean Squared Prediction Error (MSPE), Akaike’s Final Prediction Error (FPE) and the Generalized Cross Validation (GCV) selection criteria are usually derived from two quite different perspectives. Here, settling on the most commonly accepted definition of the MSPE as the expectation of the squared prediction error loss, we provide theoretical expressions for it, valid for any linear model (LM) fitter, be it under random or non random designs. Specializing these MSPE expressions for each of them, we are able to derive closed formulas of the MSPE for some of the most popular LM fitters: Ordinary Least Squares (OLS), with or without a full column rank design matrix;Ordinary and Generalized Ridge regression, the latter embedding smoothing splines fitting. For each of these LM fitters, we then deduce a computable estimate of the MSPE which turns out to coincide with Akaike’s FPE. Using a slight variation, we similarly get a class of MSPE estimates coinciding with the classical GCV formula for those same LM fitters.
文摘The research focuses on improving predictive accuracy in the financial sector through the exploration of machine learning algorithms for stock price prediction. The research follows an organized process combining Agile Scrum and the Obtain, Scrub, Explore, Model, and iNterpret (OSEMN) methodology. Six machine learning models, namely Linear Forecast, Naive Forecast, Simple Moving Average with weekly window (SMA 5), Simple Moving Average with monthly window (SMA 20), Autoregressive Integrated Moving Average (ARIMA), and Long Short-Term Memory (LSTM), are compared and evaluated through Mean Absolute Error (MAE), with the LSTM model performing the best, showcasing its potential for practical financial applications. A Django web application “Predict It” is developed to implement the LSTM model. Ethical concerns related to predictive modeling in finance are addressed. Data quality, algorithm choice, feature engineering, and preprocessing techniques are emphasized for better model performance. The research acknowledges limitations and suggests future research directions, aiming to equip investors and financial professionals with reliable predictive models for dynamic markets.
基金National Key Research and Development Program of the Ministry of Science(2018YFB1502801)Hubei Provincial Natural Science Foundation(2022CFD017)Innovation and Development Project of China Meteorological Administration(CXFZ2023J044)。
文摘This study assesses the predictive capabilities of the CMA-GD model for wind speed prediction in two wind farms located in Hubei Province,China.The observed wind speeds at the height of 70m in wind turbines of two wind farms in Suizhou serve as the actual observation data for comparison and testing.At the same time,the wind speed predicted by the EC model is also included for comparative analysis.The results indicate that the CMA-GD model performs better than the EC model in Wind Farm A.The CMA-GD model exhibits a monthly average correlation coefficient of 0.56,root mean square error of 2.72 m s^(-1),and average absolute error of 2.11 m s^(-1).In contrast,the EC model shows a monthly average correlation coefficient of 0.51,root mean square error of 2.83 m s^(-1),and average absolute error of 2.21 m s^(-1).Conversely,in Wind Farm B,the EC model outperforms the CMA-GD model.The CMA-GD model achieves a monthly average correlation coefficient of 0.55,root mean square error of 2.61 m s^(-1),and average absolute error of 2.13 m s^(-1).By contrast,the EC model displays a monthly average correlation coefficient of 0.63,root mean square error of 2.04 m s^(-1),and average absolute error of 1.67 m s^(-1).
文摘The present study aims to analyze the shift in shoreline due to coastal processes and formulate available for best estimate of future shoreline positions based on precedent shorelines. Information on rates and trends of shoreline change can be used to improve the understanding of the underlying causes and potential effects of coastal erosion which can support informed coastal management decisions. In this paper, researchers go over the changes in the recent positions of the shoreline of the Balasore coast for the 38 years from 1975 through 2013. The study area includes the Balasore coastal region from Rasalpur to Udaypur together with Chandipur, Choumukh, Chandrabali as well as Bichitrapur. Transects wise shoreline data base were developed for approximately 67 kilometers of shoreline and erosional/accretional scenario has also been analysed by delineating the shoreline from Landsat imageries of 1975, 1980, 1990, 1995, 2000, 2005, 2010 and 2013. A simple Linear Regression Model and End Point Rate (EPR) have been adopted to take out the rate of change of shoreline and its future positions, based on empirical observations at 67 transects along the Balasore coast. It is found that the north eastern part of Balasore coast in the vicinity of Subarnarekha estuary and Chandrabali beach undergo high rates of shore line shift. The shoreline data were integrated for long- (about 17 years) and short-term (about 7 years) shift rates analysis to comprehend the shoreline change and prediction. For the prediction of future shoreline, the model has been validated with the present shoreline position (2013). The rate of shoreline movement calculated from the fixed base line to shoreline position of 1975, 1980, 1990, 1995, 2000, 2005 and 2010 and based on this, the estimated shoreline of 2013 was calculated. The estimated shoreline was compared with the actual shoreline delineated from satellite imagery of 2013. The model error or positional shift at each sample point is observed. The positional error varies from??4.82 m to 212.41 m. It has been found that model prediction error is higher in the left hand side of river Subarnarekha. The overall error for the entire predicted shoreline was found to be 41.88 m by Root Mean Square Error (RMSE). In addition, it was tested by means difference between actual and predicted shoreline positions using “t” test and it has been found that predicted shore line is not significantly different from actual shoreline position at (t132 = 0.278) p < 0.01.
文摘It was found earlier that moisture content (MC) of intact kernels of grain and nuts could be determined by Near Infra Red (NIR) reflectance spectrometry. However, if the MC values can be determined while the nuts are in their shells, it would save lot of labor and money spent in shelling and cleaning the nuts. Grain and nuts absorb low levels of NIR, and when NIR radiation is incident on them, a substantial portion of the radiation is reflected back. Thus, studying the NIR reflectance spectra emanating from in-shell peanuts, an attempt is made for the first time to determine the MC of in-shell peanuts. In-shell peanuts of two different market types, Virginia and Valencia, were conditioned to different moisture levels between 6% and 26% (wet basis), and separated into calibration and validation groups. NIR absorption spectral data from 1000 nm to 2500 nm in 1 nm intervals were collected from both groups. Measurements were obtained on 30 replicates within each moisture level. Reference MC values for each moisture level in these groups were obtained using standard air-oven method. Partial Least Square (PLS) analysis was performed on the calibration data, and prediction models were developed. The Standard Error of Calibration (SEC), and R2 of the calibration models were computed to select the best calibration model. The selected models were used to predict the moisture content of peanuts in the validation sets. Predicted MC values of the validation samples were compared with their standard air-oven moisture values. Goodness of fit was determined based on the lowest Standard Error of Prediction (SEP) and highest R2 value obtained for the prediction models. The model, with reflectance plus normalization spectral data with an SEP of 0.74 for Valencia and 1.57 for Virginia type in-shell peanuts was selected as the best model. The corresponding R2 values were 0.98 for both peanut types. This work establishes the possibility of sensing MC of intact in-shell peanuts by NIR reflectance method, and would be useful for the peanut and allied industries.