Classical survival analysis assumes all subjects will experience the event of interest, but in some cases, a portion of the population may never encounter the event. These survival methods further assume independent s...Classical survival analysis assumes all subjects will experience the event of interest, but in some cases, a portion of the population may never encounter the event. These survival methods further assume independent survival times, which is not valid for honey bees, which live in nests. The study introduces a semi-parametric marginal proportional hazards mixture cure (PHMC) model with exchangeable correlation structure, using generalized estimating equations for survival data analysis. The model was tested on clustered right-censored bees survival data with a cured fraction, where two bee species were subjected to different entomopathogens to test the effect of the entomopathogens on the survival of the bee species. The Expectation-Solution algorithm is used to estimate the parameters. The study notes a weak positive association between cure statuses (ρ1=0.0007) and survival times for uncured bees (ρ2=0.0890), emphasizing their importance. The odds of being uncured for A. mellifera is higher than the odds for species M. ferruginea. The bee species, A. mellifera are more susceptible to entomopathogens icipe 7, icipe 20, and icipe 69. The Cox-Snell residuals show that the proposed semiparametric PH model generally fits the data well as compared to model that assume independent correlation structure. Thus, the semi parametric marginal proportional hazards mixture cure is parsimonious model for correlated bees survival data.展开更多
The EM algorithm is a very popular maximum likelihood estimation method, the iterative algorithm for solving the maximum likelihood estimator when the observation data is the incomplete data, but also is very effectiv...The EM algorithm is a very popular maximum likelihood estimation method, the iterative algorithm for solving the maximum likelihood estimator when the observation data is the incomplete data, but also is very effective algorithm to estimate the finite mixture model parameters. However, EM algorithm can not guarantee to find the global optimal solution, and often easy to fall into local optimal solution, so it is sensitive to the determination of initial value to iteration. Traditional EM algorithm select the initial value at random, we propose an improved method of selection of initial value. First, we use the k-nearest-neighbor method to delete outliers. Second, use the k-means to initialize the EM algorithm. Compare this method with the original random initial value method, numerical experiments show that the parameter estimation effect of the initialization of the EM algorithm is significantly better than the effect of the original EM algorithm.展开更多
This paper presents the modeling of 2D CSAMT responses generated by horizontal electric dipole using the separation of primary and secondary field technique. The primary field is calculated using 1D analytical solutio...This paper presents the modeling of 2D CSAMT responses generated by horizontal electric dipole using the separation of primary and secondary field technique. The primary field is calculated using 1D analytical solution for homogeneous earth and it is used to calculate the secondary electric field in the inhomogeneous Helmholtz Equation. Calculation of Helmholtz Equation is carried out using the finite element method. Validation of this modeling is conducted by comparison of numerical results with 1D analytical response for the case of homogeneous and layered earth. The comparison of CSAMT responses are also provided for 2D cases of vertical contact and anomalous conductive body with the 2D magnetotelluric (MT) responses. The results of this study are expected to provide better interpretation of the 2D CSAMT data.展开更多
The contribution of this work is twofold: (1) a multimodality prediction method of chaotic time series with the Gaussian process mixture (GPM) model is proposed, which employs a divide and conquer strategy. It au...The contribution of this work is twofold: (1) a multimodality prediction method of chaotic time series with the Gaussian process mixture (GPM) model is proposed, which employs a divide and conquer strategy. It automatically divides the chaotic time series into multiple modalities with different extrinsic patterns and intrinsic characteristics, and thus can more precisely fit the chaotic time series. (2) An effective sparse hard-cut expec- tation maximization (SHC-EM) learning algorithm for the GPM model is proposed to improve the prediction performance. SHO-EM replaces a large learning sample set with fewer pseudo inputs, accelerating model learning based on these pseudo inputs. Experiments on Lorenz and Chua time series demonstrate that the proposed method yields not only accurate multimodality prediction, but also the prediction confidence interval SHC-EM outperforms the traditional variational 1earning in terms of both prediction accuracy and speed. In addition, SHC-EM is more robust and insusceptible to noise than variational learning.展开更多
The global economic crisis that blew up at the end of 2006 in the United States has had extremely negative impacts on the social, political, and economic fields. The countries operating in the most affected macro area...The global economic crisis that blew up at the end of 2006 in the United States has had extremely negative impacts on the social, political, and economic fields. The countries operating in the most affected macro areas---the United States and Europe---have put through the wringer the domestic trade relationships as well as the international ones, by injecting a chain reaction into the global economic scenario. However, there are countries that seem to be free from the economic and financial contagion overflowing over the past years, as they are moved by an "invincible projection toward the growth". The present study aims to analyze how much the main emerging market of China has been effectively involved in this vicious circle. More specifically, the study intends to propose an empirical analysis on the real connection between the macroecnnomic data and the strong structure of the Chinese publicly listed companies. This paper investigates the prediction of failure among 3,220 Chinese publicly traded companies (listed companies) during the global crisis period. By analysing the financial accounting data over the past seven years (2008 to 2014), the emerging market score (EMS) has been adopted in order to investigate the impact of the crisis on financial distress in the main emerging market of China. The results confirm the following hypotheses: On one hand, the great majority of companies have not been suffering the downturn, since 71.93% of the entire samples present no risk of financial distress during the global crisis; on the other hand, only 6.18% have a reasonable risk of financial distress.展开更多
文摘Classical survival analysis assumes all subjects will experience the event of interest, but in some cases, a portion of the population may never encounter the event. These survival methods further assume independent survival times, which is not valid for honey bees, which live in nests. The study introduces a semi-parametric marginal proportional hazards mixture cure (PHMC) model with exchangeable correlation structure, using generalized estimating equations for survival data analysis. The model was tested on clustered right-censored bees survival data with a cured fraction, where two bee species were subjected to different entomopathogens to test the effect of the entomopathogens on the survival of the bee species. The Expectation-Solution algorithm is used to estimate the parameters. The study notes a weak positive association between cure statuses (ρ1=0.0007) and survival times for uncured bees (ρ2=0.0890), emphasizing their importance. The odds of being uncured for A. mellifera is higher than the odds for species M. ferruginea. The bee species, A. mellifera are more susceptible to entomopathogens icipe 7, icipe 20, and icipe 69. The Cox-Snell residuals show that the proposed semiparametric PH model generally fits the data well as compared to model that assume independent correlation structure. Thus, the semi parametric marginal proportional hazards mixture cure is parsimonious model for correlated bees survival data.
文摘The EM algorithm is a very popular maximum likelihood estimation method, the iterative algorithm for solving the maximum likelihood estimator when the observation data is the incomplete data, but also is very effective algorithm to estimate the finite mixture model parameters. However, EM algorithm can not guarantee to find the global optimal solution, and often easy to fall into local optimal solution, so it is sensitive to the determination of initial value to iteration. Traditional EM algorithm select the initial value at random, we propose an improved method of selection of initial value. First, we use the k-nearest-neighbor method to delete outliers. Second, use the k-means to initialize the EM algorithm. Compare this method with the original random initial value method, numerical experiments show that the parameter estimation effect of the initialization of the EM algorithm is significantly better than the effect of the original EM algorithm.
文摘This paper presents the modeling of 2D CSAMT responses generated by horizontal electric dipole using the separation of primary and secondary field technique. The primary field is calculated using 1D analytical solution for homogeneous earth and it is used to calculate the secondary electric field in the inhomogeneous Helmholtz Equation. Calculation of Helmholtz Equation is carried out using the finite element method. Validation of this modeling is conducted by comparison of numerical results with 1D analytical response for the case of homogeneous and layered earth. The comparison of CSAMT responses are also provided for 2D cases of vertical contact and anomalous conductive body with the 2D magnetotelluric (MT) responses. The results of this study are expected to provide better interpretation of the 2D CSAMT data.
基金Supported by the National Natural Science Foundation of China under Grant No 60972106the China Postdoctoral Science Foundation under Grant No 2014M561053+1 种基金the Humanity and Social Science Foundation of Ministry of Education of China under Grant No 15YJA630108the Hebei Province Natural Science Foundation under Grant No E2016202341
文摘The contribution of this work is twofold: (1) a multimodality prediction method of chaotic time series with the Gaussian process mixture (GPM) model is proposed, which employs a divide and conquer strategy. It automatically divides the chaotic time series into multiple modalities with different extrinsic patterns and intrinsic characteristics, and thus can more precisely fit the chaotic time series. (2) An effective sparse hard-cut expec- tation maximization (SHC-EM) learning algorithm for the GPM model is proposed to improve the prediction performance. SHO-EM replaces a large learning sample set with fewer pseudo inputs, accelerating model learning based on these pseudo inputs. Experiments on Lorenz and Chua time series demonstrate that the proposed method yields not only accurate multimodality prediction, but also the prediction confidence interval SHC-EM outperforms the traditional variational 1earning in terms of both prediction accuracy and speed. In addition, SHC-EM is more robust and insusceptible to noise than variational learning.
文摘The global economic crisis that blew up at the end of 2006 in the United States has had extremely negative impacts on the social, political, and economic fields. The countries operating in the most affected macro areas---the United States and Europe---have put through the wringer the domestic trade relationships as well as the international ones, by injecting a chain reaction into the global economic scenario. However, there are countries that seem to be free from the economic and financial contagion overflowing over the past years, as they are moved by an "invincible projection toward the growth". The present study aims to analyze how much the main emerging market of China has been effectively involved in this vicious circle. More specifically, the study intends to propose an empirical analysis on the real connection between the macroecnnomic data and the strong structure of the Chinese publicly listed companies. This paper investigates the prediction of failure among 3,220 Chinese publicly traded companies (listed companies) during the global crisis period. By analysing the financial accounting data over the past seven years (2008 to 2014), the emerging market score (EMS) has been adopted in order to investigate the impact of the crisis on financial distress in the main emerging market of China. The results confirm the following hypotheses: On one hand, the great majority of companies have not been suffering the downturn, since 71.93% of the entire samples present no risk of financial distress during the global crisis; on the other hand, only 6.18% have a reasonable risk of financial distress.
基金This work was supported by the National Natural Science Foundation of China (51507015, 61773402, 61540037, 71271215, 61233008, 51425701, 70921001, 51577014), the Natural Science Foundation of Hunan Province (2015JJ3008), the Key Laboratory of Renewable Energy Electric-Technology of Hunan Province (2014ZNDL002), and Hunan Province Science and Technology Program(2015NK3035).