The structural equation model (SEM) concept is generally influenced by the presence of outliers and controlling variables. To a very large extent, this could have consequential effects on the parameters and the model ...The structural equation model (SEM) concept is generally influenced by the presence of outliers and controlling variables. To a very large extent, this could have consequential effects on the parameters and the model fitness. Though previous researches have studied outliers and controlling observations from various perspectives including the use of box plots, normal probability plots, among others, the use of uniform horizontal QQ plot is yet to be explored. This study is, therefore, aimed at applying uniform QQ plots to identifying outliers and possible controlling observations in SEM. The results showed that all the three methods of estimators manifest the ability to identify outliers and possible controlling observations in SEM. It was noted that the Anderson-Rubin estimator of QQ plot showed a more efficient or visual display of spotting outliers and possible controlling observations as compared to the other methods of estimators. Therefore, this paper provides an efficient way identifying outliers as it fragments the data set.展开更多
Given a sample of regression data from (Y, Z), a new diagnostic plotting method is proposed for checking the hypothesis H0: the data are from a given Cox model with the time-dependent covariates Z. It compares two est...Given a sample of regression data from (Y, Z), a new diagnostic plotting method is proposed for checking the hypothesis H0: the data are from a given Cox model with the time-dependent covariates Z. It compares two estimates of the marginal distribution FY of Y. One is an estimate of the modified expression of FY under H0, based on a consistent estimate of the parameter under H0, and based on the baseline distribution of the data. The other is the Kaplan-Meier-estimator of FY, together with its confidence band. The new plot, called the marginal distribution plot, can be viewed as a test for testing H0. The main advantage of the test over the existing residual tests is in the case that the data do not satisfy any Cox model or the Cox model is mis-specified. Then the new test is still valid, but not the residual tests and the residual tests often make type II error with a very large probability.展开更多
Background: The minimum set of sub-models for simulating stand dynamics on an individual-tree basis consists of tree-level models for diameter increment and survival. Ingrowth model is a necessary third component in ...Background: The minimum set of sub-models for simulating stand dynamics on an individual-tree basis consists of tree-level models for diameter increment and survival. Ingrowth model is a necessary third component in uneven-aged management. The development of this type of model set needs data from permanent plots, in which all trees have been numbered and measured at regular intervals for diameter and survival. New trees passing the ingrowth limit should also be numbered and measured. Unfortunately, few datasets meet all these requirements. The trees may not have numbers or the length of the measurement interval varies. Ingrowth trees may not have been measured, or the number tags may have disappeared causing errors in tree identification. Methods: This article discussed and demonstrated the use of an optimization-based approach to individual-tree growth modelling, which makes it possible to utilize data sets having one or several of the above deficiencies. The idea is to estimate all parameters of the sub-models of a growth simulator simultaneously in such a way that, when simulation begins from the diameter distribution at the first measurement occasion, it yields a similar ending diameter distribution as measured in the second measurement occasion. The method was applied to Pinus patula permanent sample plot data from Kenya. In this dataset, trees were correctly numbered and identified but measurement interval varied from 1 to 13 years. Two simple regression approaches were used and compared to the optimization-based model recovery approach. Results: The optimization-based approach resulted in far more accurate simulations of stand basal area and number of surviving trees than the equations fitted through regression analysis. Conclusions: The optimization-based modelling approach can be recommended for growth modelling when the modelling data have been collected at irregular measurement intervals.展开更多
The two-parameter lognormal distribution is a variant of the normal distribution and the three-parameter lognormal distribution is an extension of the two-parameter lognormal distribution by introducing a location par...The two-parameter lognormal distribution is a variant of the normal distribution and the three-parameter lognormal distribution is an extension of the two-parameter lognormal distribution by introducing a location parameter. The Q-Q plot of the three-parameter lognormal distribution is widely used. To obtain the Q-Q plot one needs to iteratively try different values of the shape parameter and subjectively judge the linearity of the Q-Q plot. In this paper,a mathematical method was proposed to determine the value of the shape parameter so as to simplify the generation of the Q-Q plot. Then a new probability plot was proposed,which was more easily obtained and provided more accurate parameter estimates than the Q-Q plot. These are illustrated by three realworld examples.展开更多
The use of Recurrence plots have been extensively used in various fields. In this work, Recurrence Plots (RPs) investigates the changes in the non-linear behaviour of urban air pollution using large datasets of raw da...The use of Recurrence plots have been extensively used in various fields. In this work, Recurrence Plots (RPs) investigates the changes in the non-linear behaviour of urban air pollution using large datasets of raw data (hourly). This analysis has not been used before to extract information from large datasets for this type non-linear problem. Two different approaches have been used to tackle this problem. The first approach is to show results according to monitoring network. The second approach is to show the results by particle type. This analysis shows the feasibility of using Recurrence Analysis for pollution monitoring and control.展开更多
Historical evidence indicates that dust storms of considerable ferocity often wreak havoc, posing a genuine threat to the climatic and societal equilibrium of a place. A systematic study, with emphasis on the modeling...Historical evidence indicates that dust storms of considerable ferocity often wreak havoc, posing a genuine threat to the climatic and societal equilibrium of a place. A systematic study, with emphasis on the modeling and forecasting aspects, thus, becomes imperative, so that efficient measures can be promptly undertaken to cushion the effect of such an unforeseen calamity. The present work intends to discover a suitable ARIMA model using dust storm data from northern China from March 1954 to April 2002, provided by Zhou and Zhang (2003), thereby extending the idea of empirical recurrence rate (ERR) developed by Ho (2008), to model the temporal trend of such sand dust storms. In particular we show that the ERR time series is endowed with the following characteristics: 1) it is a potent surrogate for a point process, 2) it is capable of taking advantage of the well developed and powerful time series modeling tools and 3) it can generate reliable forecasts, with which we can retrieve the corresponding mean number of strong sand dust storms. A simulation study is conducted prior to the actual fitting, to justify the applicability of the proposed technique.展开更多
The Varzaghan district at the northwestern margin of the Urumieh–Dokhtar magmatic arc, is considered a promising area for the exploration of porphyry Cu deposits in Iran. In this study we identified mono-and multi-el...The Varzaghan district at the northwestern margin of the Urumieh–Dokhtar magmatic arc, is considered a promising area for the exploration of porphyry Cu deposits in Iran. In this study we identified mono-and multi-element geochemical anomalies associated with Cu–Au–Mo–Bi mineralization in the central parts of the Varzaghan district by applying the concentration–area fractal method. After mono-element geochemical investigations, principal component analysis was applied to ten selected elements in order to acquire a multi-element geochemical signature based on the mineralization-related component. Quantitative comparisons of the obtained fractal-based populations were carried out in accordance with known Cu occurrences using Student's t-values. Then,significant mono-and multi-element geochemical layers were separately combined with related geologic and structural layers to generate prospectivity models, using the fuzzy GAMMA approach. For quantitative evaluation of the effectiveness of different geochemical signatures in final prospectivity models, a prediction-area plot was adapted. The results show that the multi-element geochemical signature of principal component one(PC1) is more effective than mono-element layers in delimiting exploration targets related to porphyry Cu deposits.展开更多
文摘The structural equation model (SEM) concept is generally influenced by the presence of outliers and controlling variables. To a very large extent, this could have consequential effects on the parameters and the model fitness. Though previous researches have studied outliers and controlling observations from various perspectives including the use of box plots, normal probability plots, among others, the use of uniform horizontal QQ plot is yet to be explored. This study is, therefore, aimed at applying uniform QQ plots to identifying outliers and possible controlling observations in SEM. The results showed that all the three methods of estimators manifest the ability to identify outliers and possible controlling observations in SEM. It was noted that the Anderson-Rubin estimator of QQ plot showed a more efficient or visual display of spotting outliers and possible controlling observations as compared to the other methods of estimators. Therefore, this paper provides an efficient way identifying outliers as it fragments the data set.
文摘Given a sample of regression data from (Y, Z), a new diagnostic plotting method is proposed for checking the hypothesis H0: the data are from a given Cox model with the time-dependent covariates Z. It compares two estimates of the marginal distribution FY of Y. One is an estimate of the modified expression of FY under H0, based on a consistent estimate of the parameter under H0, and based on the baseline distribution of the data. The other is the Kaplan-Meier-estimator of FY, together with its confidence band. The new plot, called the marginal distribution plot, can be viewed as a test for testing H0. The main advantage of the test over the existing residual tests is in the case that the data do not satisfy any Cox model or the Cox model is mis-specified. Then the new test is still valid, but not the residual tests and the residual tests often make type II error with a very large probability.
文摘Background: The minimum set of sub-models for simulating stand dynamics on an individual-tree basis consists of tree-level models for diameter increment and survival. Ingrowth model is a necessary third component in uneven-aged management. The development of this type of model set needs data from permanent plots, in which all trees have been numbered and measured at regular intervals for diameter and survival. New trees passing the ingrowth limit should also be numbered and measured. Unfortunately, few datasets meet all these requirements. The trees may not have numbers or the length of the measurement interval varies. Ingrowth trees may not have been measured, or the number tags may have disappeared causing errors in tree identification. Methods: This article discussed and demonstrated the use of an optimization-based approach to individual-tree growth modelling, which makes it possible to utilize data sets having one or several of the above deficiencies. The idea is to estimate all parameters of the sub-models of a growth simulator simultaneously in such a way that, when simulation begins from the diameter distribution at the first measurement occasion, it yields a similar ending diameter distribution as measured in the second measurement occasion. The method was applied to Pinus patula permanent sample plot data from Kenya. In this dataset, trees were correctly numbered and identified but measurement interval varied from 1 to 13 years. Two simple regression approaches were used and compared to the optimization-based model recovery approach. Results: The optimization-based approach resulted in far more accurate simulations of stand basal area and number of surviving trees than the equations fitted through regression analysis. Conclusions: The optimization-based modelling approach can be recommended for growth modelling when the modelling data have been collected at irregular measurement intervals.
基金National Natural Science Foundation of China(No.71371035)
文摘The two-parameter lognormal distribution is a variant of the normal distribution and the three-parameter lognormal distribution is an extension of the two-parameter lognormal distribution by introducing a location parameter. The Q-Q plot of the three-parameter lognormal distribution is widely used. To obtain the Q-Q plot one needs to iteratively try different values of the shape parameter and subjectively judge the linearity of the Q-Q plot. In this paper,a mathematical method was proposed to determine the value of the shape parameter so as to simplify the generation of the Q-Q plot. Then a new probability plot was proposed,which was more easily obtained and provided more accurate parameter estimates than the Q-Q plot. These are illustrated by three realworld examples.
文摘The use of Recurrence plots have been extensively used in various fields. In this work, Recurrence Plots (RPs) investigates the changes in the non-linear behaviour of urban air pollution using large datasets of raw data (hourly). This analysis has not been used before to extract information from large datasets for this type non-linear problem. Two different approaches have been used to tackle this problem. The first approach is to show results according to monitoring network. The second approach is to show the results by particle type. This analysis shows the feasibility of using Recurrence Analysis for pollution monitoring and control.
文摘Historical evidence indicates that dust storms of considerable ferocity often wreak havoc, posing a genuine threat to the climatic and societal equilibrium of a place. A systematic study, with emphasis on the modeling and forecasting aspects, thus, becomes imperative, so that efficient measures can be promptly undertaken to cushion the effect of such an unforeseen calamity. The present work intends to discover a suitable ARIMA model using dust storm data from northern China from March 1954 to April 2002, provided by Zhou and Zhang (2003), thereby extending the idea of empirical recurrence rate (ERR) developed by Ho (2008), to model the temporal trend of such sand dust storms. In particular we show that the ERR time series is endowed with the following characteristics: 1) it is a potent surrogate for a point process, 2) it is capable of taking advantage of the well developed and powerful time series modeling tools and 3) it can generate reliable forecasts, with which we can retrieve the corresponding mean number of strong sand dust storms. A simulation study is conducted prior to the actual fitting, to justify the applicability of the proposed technique.
文摘The Varzaghan district at the northwestern margin of the Urumieh–Dokhtar magmatic arc, is considered a promising area for the exploration of porphyry Cu deposits in Iran. In this study we identified mono-and multi-element geochemical anomalies associated with Cu–Au–Mo–Bi mineralization in the central parts of the Varzaghan district by applying the concentration–area fractal method. After mono-element geochemical investigations, principal component analysis was applied to ten selected elements in order to acquire a multi-element geochemical signature based on the mineralization-related component. Quantitative comparisons of the obtained fractal-based populations were carried out in accordance with known Cu occurrences using Student's t-values. Then,significant mono-and multi-element geochemical layers were separately combined with related geologic and structural layers to generate prospectivity models, using the fuzzy GAMMA approach. For quantitative evaluation of the effectiveness of different geochemical signatures in final prospectivity models, a prediction-area plot was adapted. The results show that the multi-element geochemical signature of principal component one(PC1) is more effective than mono-element layers in delimiting exploration targets related to porphyry Cu deposits.