The decomposition-based vector autoregressive model (DVAR) provides a new framework for scrutinizing the efficiency of technical analysis in forecasting stock returns. However, its relation- ships with other technic...The decomposition-based vector autoregressive model (DVAR) provides a new framework for scrutinizing the efficiency of technical analysis in forecasting stock returns. However, its relation- ships with other technical indicators still remain unknown. This paper investigates the relationships of DVAR model with the Japanese Candlestick indicators using simulations, theoretical explanations and empirical studies. The main finding of this paper is that both lower and upper shadows in Japanese Candlestick Granger contribute to the DVAR model explanation power, and thus, providing useful information for improving the DVAR forecasts. This finding makes sense as it means that the infor- mation contained in the lower and upper shadows should be used when modeling the stock returns with DVAR. Empirical studies performed on China SSEC stock index demonstrate that DVAR model with upper and lower shadows as exogenous variables does have informative and valuable out-of-sample forecasts.展开更多
Let {X(t), t ≥ 0} be a centered stationary Gaussian process with correlation r(t)such that 1-r(t) is asymptotic to a regularly varying function. With T being a nonnegative random variable and independent of X(t), the...Let {X(t), t ≥ 0} be a centered stationary Gaussian process with correlation r(t)such that 1-r(t) is asymptotic to a regularly varying function. With T being a nonnegative random variable and independent of X(t), the exact asymptotics of P(sup_(t∈[0,T])X(t) > x) is considered, as x → ∞.展开更多
Dear Editor, The main components of multi-view geometry and computer vision are robust pose estimation and feature matching. This letter discusses how to recover two-view geometry and match features between a pair of ...Dear Editor, The main components of multi-view geometry and computer vision are robust pose estimation and feature matching. This letter discusses how to recover two-view geometry and match features between a pair of images, and presents MCNet(a multiscale clustering network) as an algorithm for extracting multiscale features. It can identify the true inliers from the established putative correspondences, where outliers may degenerate the geometry estimation. In particular, the proposed MCNet is based on graph clustering.展开更多
In this paper, a class of non-autonomous functional integro-differential stochastic equations in a real separable Hilbert space is studied. When the operators A(t) satisfy Acquistapace-Terreni conditions, and with s...In this paper, a class of non-autonomous functional integro-differential stochastic equations in a real separable Hilbert space is studied. When the operators A(t) satisfy Acquistapace-Terreni conditions, and with some suitable assumptions, the existence and uniqueness of a square-mean almost periodic mild solution to the equations are obtained.展开更多
Aims In ecology and conservation biology,the number of species counted in a biodiversity study is a key metric but is usually a biased underestimate of total species richness because many rare species are not detected...Aims In ecology and conservation biology,the number of species counted in a biodiversity study is a key metric but is usually a biased underestimate of total species richness because many rare species are not detected.Moreover,comparing species richness among sites or samples is a statistical challenge because the observed number of species is sensitive to the number of individuals counted or the area sampled.For individual-based data,we treat a single,empirical sample of species abundances from an investigator-defined species assemblage or community as a reference point for two estimation objectives under two sampling models:estimating the expected number of species(and its unconditional variance)in a random sample of(i)a smaller number of individuals(multinomial model)or a smaller area sampled(Poisson model)and(ii)a larger number of individuals or a larger area sampled.For sample-based incidence(presence–absence)data,under a Bernoulli product model,we treat a single set of species incidence frequencies as the reference point to estimate richness for smaller and larger numbers of sampling units.Methods The first objective is a problem in interpolation that we address with classical rarefaction(multinomial model)and Coleman rarefaction(Poisson model)for individual-based data and with sample-based rarefaction(Bernoulli product model)for incidence frequencies.The second is a problem in extrapolation that we address with sampling-theoretic predictors for the number of species in a larger sample(multinomial model),a larger area(Poisson model)or a larger number of sampling units(Bernoulli product model),based on an estimate of asymptotic species richness.Although published methods exist for many of these objectives,we bring them together here with some new estimators under a unified statistical and notational framework.This novel integration of mathematically distinct approaches allowed us to link interpolated(rarefaction)curves and extrapolated curves to plot a unified species accumulation curve for empirical examples.We provide new,unconditional variance estimators for classical,individual-based rarefaction and for Coleman rarefaction,long missing from the toolkit of biodiversity measurement.We illustrate these methods with datasets for tropical beetles,tropical trees and tropical ants.Important Findings Surprisingly,for all datasets we examined,the interpolation(rarefaction)curve and the extrapolation curve meet smoothly at the reference sample,yielding a single curve.Moreover,curves representing 95%confidence intervals for interpolated and extrapolated richness estimates also meet smoothly,allowing rigorous statistical comparison of samples not only for rarefaction but also for extrapolated richness values.The confidence intervals widen as the extrapolation moves further beyond the reference sample,but the method gives reasonable results for extrapolations up to about double or triple the original abundance or area of the reference sample.We found that the multinomial and Poisson models produced indistinguishable results,in units of estimated species,for all estimators and datasets.For sample-based abundance data,which allows the comparison of all three models,the Bernoulli product model generally yields lower richness estimates for rarefied data than either the multinomial or the Poisson models because of the ubiquity of non-random spatial distributions in nature.展开更多
This paper considers the monotonic transformation model with an unspecified transformation function and an unknown error function, and gives its monotone rank estimation with length-biased and rightcensored data. The ...This paper considers the monotonic transformation model with an unspecified transformation function and an unknown error function, and gives its monotone rank estimation with length-biased and rightcensored data. The estimator is shown to be√n-consistent and asymptotically normal. Numerical simulation studies reveal good finite sample performance and the estimator is illustrated with the Oscar data set. The variance can be estimated by a resampling method via perturbing the U-statistics objective function repeatedly.展开更多
This paper provides an estimation procedure for average treatment effect through a random coefficient dummy endogenous variable model. A leading example of the model is estimating the effect of a training program on e...This paper provides an estimation procedure for average treatment effect through a random coefficient dummy endogenous variable model. A leading example of the model is estimating the effect of a training program on earnings. The model is composed of two equations:an outcome equation and a decision equation.Given the linear restriction in outcome and decision equations,Chen(1999) provided a distribution-free estimation procedure under conditional symmetric error distributions. In this paper we extend Chen's estimator by relaxing the linear index into a nonparametric function,which greatly reduces the risk of model misspecification. A two-step approach is proposed:the first step uses a nonparametric regression estimator for the decision variable,and the second step uses an instrumental variables approach to estimate average treatment effect in the outcome equation. The proposed estimator is shown to be consistent and asymptotically normally distributed. Furthermore,we investigate the finite performance of our estimator by a Monte Carlo study and also use our estimator to study the return of college education in different periods of China. The estimates seem more reasonable than those of other commonly used estimators.展开更多
In this article, we study estimation of a partially specified spatial panel data linear regression with random-effects. Under the conditions of exogenous spatial weighting matrix and exogenous regressors, we give an i...In this article, we study estimation of a partially specified spatial panel data linear regression with random-effects. Under the conditions of exogenous spatial weighting matrix and exogenous regressors, we give an instrumental variable estimation. Under certain sufficient assumptions, we show that the proposed estimator for the finite dimensional parameter is root-N consistent and asymptotically normally distributed and the proposed estimator for the unknown function is consistent and asymptotically distributed. Consistent estimators for the asymptotic variance-covariance matrices of both the parametric and unknown components are provided. The Monte Carlo simulation results verify our theory and suggest that the approach has some practical value.展开更多
The principal contradiction facing the Chinese society has evolved to be that between imbalanced and inadequate development and the people’s ever-growing needs for a better life.Given China’s vision for achieving mo...The principal contradiction facing the Chinese society has evolved to be that between imbalanced and inadequate development and the people’s ever-growing needs for a better life.Given China’s vision for achieving moderate prosperity,it is relevant to conduct theoretical and empirical studies on the nation’s development imbalances.As a quantitative index,the Tsinghua China Balanced Development Index measures the extent to which development is uneven and insuf ficient across regions,re flecting the progress and shortfalls in China’s efforts to promote balanced development.Our findings provide implications for how policymakers may help people’s expectations for a better life materialize by spurring balanced economic,social,environmental and livelihood development across regions.展开更多
It is important to understand the geometry of genome space in biology.After transforming genome sequences into frequency matrices of the chaos game representation(FCGR),we regard a genome sequence as a point in a suit...It is important to understand the geometry of genome space in biology.After transforming genome sequences into frequency matrices of the chaos game representation(FCGR),we regard a genome sequence as a point in a suitable Grassmann manifold by analyzing the column space of the corresponding FCGR.To assess the sequence similarity,we employ the generalized Grassmannian distance,an intrinsic geometric distance that differs from the traditional Euclidean distance used in the classical k-mer frequency-based methods.With this method,we constructed phylogenetic trees for various genome datasets,including influenza A virus hemagglutinin gene,Orthocoronavirinae genome,and SARS-CoV-2 complete genome sequences.Our comparative analysis with multiple sequence alignment and alignment-free methods for large-scale sequences revealed that our method,which employs the subspace distance between the column spaces of different FCGRs(FCGR-SD),outperformed its competitors in terms of both speed and accuracy.In addition,we used low-dimensional visualization of the SARS-CoV-2 genome sequences and spike protein nucleotide sequences with our methods,resulting in some intriguing findings.We not only propose a novel and efficient algorithm for comparing genome sequences but also demonstrate that genome data have some intrinsic manifold structures,providing a new geometric perspective for molecular biology studies.展开更多
In this paper, the authors generalize the concept of asymptotically almost negatively associated random variables from the classic probability space to the upper expectation space. Within the framework, the authors pr...In this paper, the authors generalize the concept of asymptotically almost negatively associated random variables from the classic probability space to the upper expectation space. Within the framework, the authors prove some different types of Rosenthal's inequalities for sub-additive expectations. Finally, the authors prove a strong law of large numbers as the application of Rosenthal's inequalities.展开更多
Length-biased data arise in many important fields, including epidemiological cohort studies, cancer screening trials and labor economics. Analysis of such data has attracted much attention in the literature. In this p...Length-biased data arise in many important fields, including epidemiological cohort studies, cancer screening trials and labor economics. Analysis of such data has attracted much attention in the literature. In this paper we propose a quantile regression approach for analyzing right-censored and length-biased data. We derive an inverse probability weighted estimating equation corresponding to the quantile regression to correct the bias due to length-bias sampling and informative censoring. This method can easily handle informative censoring induced by length-biased sampling. This is an appealing feature of our proposed method since it is generally difficult to obtain unbiased estimates of risk factors in the presence of length-bias and informative censoring. We establish the consistency and asymptotic distribution of the proposed estimator using empirical process techniques. A resampling method is adopted to estimate the variance of the estimator. We conduct simulation studies to evaluate its finite sample performance and use a real data set to illustrate the application of the proposed method.展开更多
This paper considers the estimation of a Box-Cox transformation model with varying coefficient. A two-step approach is proposed in which the first step estimates the varying coefficients nonparametrically for any give...This paper considers the estimation of a Box-Cox transformation model with varying coefficient. A two-step approach is proposed in which the first step estimates the varying coefficients nonparametrically for any given parameter a in the transformation function. Then a one-dimensional search of a has been employed based on some least absolute deviation criterion function. The validity of our estimator does not require independence assumption thus is robust to the conditional heteroscedasticity. A simulation study shows a reasonably well finite sample performance. Additionally, a comprehensive empirical study has been carefully examined.展开更多
Prevalent cohort studies frequently involve length-biased and right-censored data, a fact that has drawn considerable attention in survival analysis. In this article, we consider survival data arising from lengthbiase...Prevalent cohort studies frequently involve length-biased and right-censored data, a fact that has drawn considerable attention in survival analysis. In this article, we consider survival data arising from lengthbiased sampling, and propose a new semiparametric-model-based approach to estimate quantile differences of failure time. We establish the asymptotic properties of our new estimators theoretically under mild technical conditions, and propose a resampling method for estimating their asymptotic variance. We then conduct simulations to evaluate the empirical performance and efficiency of the proposed estimators, and demonstrate their application by a real data analysis.展开更多
This article proposes a simple nonparametric estimator of quantile residual lifetime function under left-truncated and right-censored data. The asymptotic consistency and normality of this estimator are proved and the...This article proposes a simple nonparametric estimator of quantile residual lifetime function under left-truncated and right-censored data. The asymptotic consistency and normality of this estimator are proved and the variance expression is calculated. Two bootstrap procedures are employed in the simulation study,where the latter bootstrap from Zeng and Lin(2008) is 4000 times faster than the former naive one, and the numerical results in both methods show that our estimating approach works well. A real data example is used to illustrate its application.展开更多
The literature generally agrees that longer-horizon(over a month) predictions make more sense than short-horizon ones. However, it's an especially challenging task due to the lack of data(in unit of long horizon)a...The literature generally agrees that longer-horizon(over a month) predictions make more sense than short-horizon ones. However, it's an especially challenging task due to the lack of data(in unit of long horizon)and economic data have a low S/N ratio. We hypothesize that the stock trend is largely dictated by driving factors which are filtered by psychological factors and work on behavioral factors: representative indicators from these three aspects would be adequate in trend prediction. We then extend the Stepwise Regression Analysis(SRA)algorithm to constrained SRA(c SRA) to carry out a further feature selection and lag optimization. During modeling stage, we introduce the Deep Neural Network(DNN) model in stock prediction under the suspicion that economic interactions are too complex for shallow networks to capture. Our experiments indeed show that deep structures generally perform better than shallow ones. Instead of comparing to a kitchen sink model, where over-fitting can easily happen with a shortage of data, we turn around and use a model ensemble approach which indirectly demonstrates our proposed method is adequate.展开更多
In this paper, the multivariate linear model Y = XB+e, e ~ Nm×k(0, ImΣ) is considered from the Bayes perspective. Under the normal-inverse Wishart prior for (BΣ), the Bayes estimators are derived. The sup...In this paper, the multivariate linear model Y = XB+e, e ~ Nm×k(0, ImΣ) is considered from the Bayes perspective. Under the normal-inverse Wishart prior for (BΣ), the Bayes estimators are derived. The superiority of the Bayes estimators of B and Σ over the least squares estimators under the criteria of Bayes mean squared error (BMSE) and Bayes mean squared error matrix (BMSEM) is shown. In addition, the Pitman Closeness (PC) criterion is also included to investigate the superiority of the Bayes estimator of B.展开更多
This paper is concerned with tile proOlenl or improving hue ~lma^u~ u~ under Stein's loss. By the partial Iwasawa coordinates of covariance matrix, the corresponding risk can be split into three parts. One can use th...This paper is concerned with tile proOlenl or improving hue ~lma^u~ u~ under Stein's loss. By the partial Iwasawa coordinates of covariance matrix, the corresponding risk can be split into three parts. One can use the information in the weighted matrix of weighted quadratic loss to improve one part of risk. However, this paper indirectly takes advantage of the information in the sample mean and reuses Iwasawa coordinates to improve the rest of risk. It is worth mentioning that the process above can be repeated. Finally, a Monte Carlo simulation study is carried out to verify the theoretical results.展开更多
The varying-coefficient model is flexible and powerful for modeling the dynamic changes of regression coefficients. We study the problem of variable selection and estimation in this model in the sparse, high- dimensio...The varying-coefficient model is flexible and powerful for modeling the dynamic changes of regression coefficients. We study the problem of variable selection and estimation in this model in the sparse, high- dimensional case. We develop a concave group selection approach for this problem using basis function expansion and study its theoretical and empirical properties. We also apply the group Lasso for variable selection and estimation in this model and study its properties. Under appropriate conditions, we show that the group least absolute shrinkage and selection operator (Lasso) selects a model whose dimension is comparable to the underlying mode], regardless of the large number of unimportant variables. In order to improve the selection results, we show that the group minimax concave penalty (MCP) has the oracle selection property in the sense that it correctly selects important variables with probability converging to one under suitable conditions. By comparison, the group Lasso does not have the oracle selection property. In the simulation parts, we apply the group Lasso and the group MCP. At the same time, the two approaches are evaluated using simulation and demonstrated on a data example.展开更多
Receiver operating characteristic (ROC) curve is often used to study and compare two- sample problems in medicine. When more information may be available on one treatment than the other, one can improve estimator of...Receiver operating characteristic (ROC) curve is often used to study and compare two- sample problems in medicine. When more information may be available on one treatment than the other, one can improve estimator of ROC curve if the auxiliary population information is taken into account. The authors show that the empirical likelihood method can be naturally adapted to make efficient use of the auxiliary information to such problems. The authors propose a smoothed empirical likelihood estimator for ROC curve with some auxiliary information in medical studies. The proposed estimates are more efficient than those ROC estimators without any auxiliary information, in the sense of comparing asymptotic variances and mean squared error (MSE). Some asymptotic properties for the empirical likelihood estimation of ROC curve are established. A simulation study is presented to demonstrate the performance of the proposed estimators.展开更多
基金supported by the National Natural Science Foundation of China under Grant No.71401033
文摘The decomposition-based vector autoregressive model (DVAR) provides a new framework for scrutinizing the efficiency of technical analysis in forecasting stock returns. However, its relation- ships with other technical indicators still remain unknown. This paper investigates the relationships of DVAR model with the Japanese Candlestick indicators using simulations, theoretical explanations and empirical studies. The main finding of this paper is that both lower and upper shadows in Japanese Candlestick Granger contribute to the DVAR model explanation power, and thus, providing useful information for improving the DVAR forecasts. This finding makes sense as it means that the infor- mation contained in the lower and upper shadows should be used when modeling the stock returns with DVAR. Empirical studies performed on China SSEC stock index demonstrate that DVAR model with upper and lower shadows as exogenous variables does have informative and valuable out-of-sample forecasts.
基金Supported by the Scientific Research Fund of Sichuan Provincial Education Department(12ZB082)the Scientific research cultivation project of Sichuan University of Science&Engineering(2013PY07)+1 种基金the Scientific Research Fund of Shanghai University of Finance and Economics(2017110080)the Opening Project of Sichuan Province University Key Laboratory of Bridge Non-destruction Detecting and Engineering Computing(2018QZJ01)
文摘Let {X(t), t ≥ 0} be a centered stationary Gaussian process with correlation r(t)such that 1-r(t) is asymptotic to a regularly varying function. With T being a nonnegative random variable and independent of X(t), the exact asymptotics of P(sup_(t∈[0,T])X(t) > x) is considered, as x → ∞.
基金supported by the National Natural Science Foundation of China(61703260,62173252)。
文摘Dear Editor, The main components of multi-view geometry and computer vision are robust pose estimation and feature matching. This letter discusses how to recover two-view geometry and match features between a pair of images, and presents MCNet(a multiscale clustering network) as an algorithm for extracting multiscale features. It can identify the true inliers from the established putative correspondences, where outliers may degenerate the geometry estimation. In particular, the proposed MCNet is based on graph clustering.
基金Acknowledgement This article is funded by the National Natural Science Foundation of China (11161052), Guangxi Natural Science Foundation of China (201 ljjA10044) and Guangxi Education Hall Project (201012MS183)
文摘In this paper, a class of non-autonomous functional integro-differential stochastic equations in a real separable Hilbert space is studied. When the operators A(t) satisfy Acquistapace-Terreni conditions, and with some suitable assumptions, the existence and uniqueness of a square-mean almost periodic mild solution to the equations are obtained.
基金US National Science Foundation(DEB 0639979 and DBI 0851245 to R.K.C.DEB-0541936 to N.J.G.+4 种基金DEB-0424767 and DEB-0639393 to R.L.C.DEB-0640015 to J.T.L.)the US Department of Energy(022821 to N.J.G.)the Taiwan National Science Council(97-2118-M007-MY3 to A.C.)and the University of Connecticut Research Foundation(to R.L.C.).
文摘Aims In ecology and conservation biology,the number of species counted in a biodiversity study is a key metric but is usually a biased underestimate of total species richness because many rare species are not detected.Moreover,comparing species richness among sites or samples is a statistical challenge because the observed number of species is sensitive to the number of individuals counted or the area sampled.For individual-based data,we treat a single,empirical sample of species abundances from an investigator-defined species assemblage or community as a reference point for two estimation objectives under two sampling models:estimating the expected number of species(and its unconditional variance)in a random sample of(i)a smaller number of individuals(multinomial model)or a smaller area sampled(Poisson model)and(ii)a larger number of individuals or a larger area sampled.For sample-based incidence(presence–absence)data,under a Bernoulli product model,we treat a single set of species incidence frequencies as the reference point to estimate richness for smaller and larger numbers of sampling units.Methods The first objective is a problem in interpolation that we address with classical rarefaction(multinomial model)and Coleman rarefaction(Poisson model)for individual-based data and with sample-based rarefaction(Bernoulli product model)for incidence frequencies.The second is a problem in extrapolation that we address with sampling-theoretic predictors for the number of species in a larger sample(multinomial model),a larger area(Poisson model)or a larger number of sampling units(Bernoulli product model),based on an estimate of asymptotic species richness.Although published methods exist for many of these objectives,we bring them together here with some new estimators under a unified statistical and notational framework.This novel integration of mathematically distinct approaches allowed us to link interpolated(rarefaction)curves and extrapolated curves to plot a unified species accumulation curve for empirical examples.We provide new,unconditional variance estimators for classical,individual-based rarefaction and for Coleman rarefaction,long missing from the toolkit of biodiversity measurement.We illustrate these methods with datasets for tropical beetles,tropical trees and tropical ants.Important Findings Surprisingly,for all datasets we examined,the interpolation(rarefaction)curve and the extrapolation curve meet smoothly at the reference sample,yielding a single curve.Moreover,curves representing 95%confidence intervals for interpolated and extrapolated richness estimates also meet smoothly,allowing rigorous statistical comparison of samples not only for rarefaction but also for extrapolated richness values.The confidence intervals widen as the extrapolation moves further beyond the reference sample,but the method gives reasonable results for extrapolations up to about double or triple the original abundance or area of the reference sample.We found that the multinomial and Poisson models produced indistinguishable results,in units of estimated species,for all estimators and datasets.For sample-based abundance data,which allows the comparison of all three models,the Bernoulli product model generally yields lower richness estimates for rarefied data than either the multinomial or the Poisson models because of the ubiquity of non-random spatial distributions in nature.
基金supported by Graduate Innovation Foundation of Shanghai University of Finance and Economics(Grant No.CXJJ2013-451)Cultivation Foundation of Excellent Doctor Degree Dissertation of Shanghai University of Finance and Economics(Grant No.YBPY201504)+4 种基金Program of Educational Department of Fujian Province(Grant Nos.JA14079 and JA12060)Natural Science Foundation of Fujian Province(Grant Nos.2014J01001 and 2012J01028)National Natural Science Foundation of China(Grant No.71271128)the State Key Program of National Natural Science Foundation of China(Grant No.71331006)National Center for Mathematics and Interdisciplinary Sciences,Key Laboratory of Random Complex Structures and Data Science,Chinese Academy of Sciences and Shanghai First-class Discipline A and Innovative Research Team of Shanghai University of Finance and Economics,Program for Changjiang Scholars Innovative Research Team of Ministry of Education(Grant No.IRT13077)
文摘This paper considers the monotonic transformation model with an unspecified transformation function and an unknown error function, and gives its monotone rank estimation with length-biased and rightcensored data. The estimator is shown to be√n-consistent and asymptotically normal. Numerical simulation studies reveal good finite sample performance and the estimator is illustrated with the Oscar data set. The variance can be estimated by a resampling method via perturbing the U-statistics objective function repeatedly.
基金supported by National Natural Science Foundation of China(GrantNo.71171127)the Construction Program of Elaborate Course for Advanced Econometrics Ⅱ of ShanghaiUniversity of Finance and Economics
文摘This paper provides an estimation procedure for average treatment effect through a random coefficient dummy endogenous variable model. A leading example of the model is estimating the effect of a training program on earnings. The model is composed of two equations:an outcome equation and a decision equation.Given the linear restriction in outcome and decision equations,Chen(1999) provided a distribution-free estimation procedure under conditional symmetric error distributions. In this paper we extend Chen's estimator by relaxing the linear index into a nonparametric function,which greatly reduces the risk of model misspecification. A two-step approach is proposed:the first step uses a nonparametric regression estimator for the decision variable,and the second step uses an instrumental variables approach to estimate average treatment effect in the outcome equation. The proposed estimator is shown to be consistent and asymptotically normally distributed. Furthermore,we investigate the finite performance of our estimator by a Monte Carlo study and also use our estimator to study the return of college education in different periods of China. The estimates seem more reasonable than those of other commonly used estimators.
基金supported by National Natural Science Foundation of China(Grant Nos.71371118,71471117)Plateau and Peak Disciplines of Shanghai-Business Management Research Team+3 种基金National Social Science Fund of China(Grant No.14BJY012)Program for Changjiang Scholars and Innovative Research Team in University(Grant No.PCSIRTIRT13077)the State Key Program of National Natural Science of China(Grant No.71331006)supported by National Nature Science Foundation of China(Grant Nos.11101442,11471086)
文摘In this article, we study estimation of a partially specified spatial panel data linear regression with random-effects. Under the conditions of exogenous spatial weighting matrix and exogenous regressors, we give an instrumental variable estimation. Under certain sufficient assumptions, we show that the proposed estimator for the finite dimensional parameter is root-N consistent and asymptotically normally distributed and the proposed estimator for the unknown function is consistent and asymptotically distributed. Consistent estimators for the asymptotic variance-covariance matrices of both the parametric and unknown components are provided. The Monte Carlo simulation results verify our theory and suggest that the approach has some practical value.
基金the final result of the “Tsinghua China Balanced Development Index” Project of the China Data CenterTsinghua University+1 种基金Sponsored by the Minshan Public-Interest Fund of the China Siyuan Foundation for Poverty Alleviation (CSFPA) with special sponsorship from the China Post-Doctoral Science Foundation (2018T110079)general sponsorship from the China Post-Doctoral Science Foundation (2017M620719)。
文摘The principal contradiction facing the Chinese society has evolved to be that between imbalanced and inadequate development and the people’s ever-growing needs for a better life.Given China’s vision for achieving moderate prosperity,it is relevant to conduct theoretical and empirical studies on the nation’s development imbalances.As a quantitative index,the Tsinghua China Balanced Development Index measures the extent to which development is uneven and insuf ficient across regions,re flecting the progress and shortfalls in China’s efforts to promote balanced development.Our findings provide implications for how policymakers may help people’s expectations for a better life materialize by spurring balanced economic,social,environmental and livelihood development across regions.
基金supported by the National Natural Science Foundation of China(12171275 and 12371270)the Shanghai Science and Technology Development Funds(23JC1402100)the Tsinghua University Education Foundation fund(042202008).
文摘It is important to understand the geometry of genome space in biology.After transforming genome sequences into frequency matrices of the chaos game representation(FCGR),we regard a genome sequence as a point in a suitable Grassmann manifold by analyzing the column space of the corresponding FCGR.To assess the sequence similarity,we employ the generalized Grassmannian distance,an intrinsic geometric distance that differs from the traditional Euclidean distance used in the classical k-mer frequency-based methods.With this method,we constructed phylogenetic trees for various genome datasets,including influenza A virus hemagglutinin gene,Orthocoronavirinae genome,and SARS-CoV-2 complete genome sequences.Our comparative analysis with multiple sequence alignment and alignment-free methods for large-scale sequences revealed that our method,which employs the subspace distance between the column spaces of different FCGRs(FCGR-SD),outperformed its competitors in terms of both speed and accuracy.In addition,we used low-dimensional visualization of the SARS-CoV-2 genome sequences and spike protein nucleotide sequences with our methods,resulting in some intriguing findings.We not only propose a novel and efficient algorithm for comparing genome sequences but also demonstrate that genome data have some intrinsic manifold structures,providing a new geometric perspective for molecular biology studies.
基金supported by the National Natural Science Foundation of China(No.11601280)the Innovative Research Team of Shanghai University of Finance and Economics(No.13122402)
文摘In this paper, the authors generalize the concept of asymptotically almost negatively associated random variables from the classic probability space to the upper expectation space. Within the framework, the authors prove some different types of Rosenthal's inequalities for sub-additive expectations. Finally, the authors prove a strong law of large numbers as the application of Rosenthal's inequalities.
基金National Natural Science Funds for Distinguished Young Scholar (No. 70825004)Creative Research Groups of China (No. 10721101)+1 种基金Shanghai University of Finance and Economics Project 211 Phase ⅢShanghai Leading Academic Discipline Project (No. B803)
文摘Length-biased data arise in many important fields, including epidemiological cohort studies, cancer screening trials and labor economics. Analysis of such data has attracted much attention in the literature. In this paper we propose a quantile regression approach for analyzing right-censored and length-biased data. We derive an inverse probability weighted estimating equation corresponding to the quantile regression to correct the bias due to length-bias sampling and informative censoring. This method can easily handle informative censoring induced by length-biased sampling. This is an appealing feature of our proposed method since it is generally difficult to obtain unbiased estimates of risk factors in the presence of length-bias and informative censoring. We establish the consistency and asymptotic distribution of the proposed estimator using empirical process techniques. A resampling method is adopted to estimate the variance of the estimator. We conduct simulation studies to evaluate its finite sample performance and use a real data set to illustrate the application of the proposed method.
基金supported by National Natural Science Foundation of China(Grant Nos.71171127,71471108 and 71601105)the Open Project Program in the Key Laboratory of Mathematical Economics(SUFE)(Grant No.201309KF02)+2 种基金Ministry of Education of the People’s Republic of Chinathe Program for Changjiang Scholars and Innovative Research Team in Shanghai University of Finance and Economicsthe Innovative Research Team of Econometrics in Shanghai Academy of Social Sciences
文摘This paper considers the estimation of a Box-Cox transformation model with varying coefficient. A two-step approach is proposed in which the first step estimates the varying coefficients nonparametrically for any given parameter a in the transformation function. Then a one-dimensional search of a has been employed based on some least absolute deviation criterion function. The validity of our estimator does not require independence assumption thus is robust to the conditional heteroscedasticity. A simulation study shows a reasonably well finite sample performance. Additionally, a comprehensive empirical study has been carefully examined.
基金supported by National Natural Science Foundation of China(Grant No.11401603)the Fundamental Research Funds for the Central Universities(Grant No.QL 18009)+2 种基金Discipline Foundation of Central University of Finance and Economics(Grant No.CUFESAM201811)supported by the State Key Program of National Natural Science Foundation of China(Grant No.71331006)the State Key Program in the Major Research Plan of National Natural Science Foundation of China(Grant No.91546202)
文摘Prevalent cohort studies frequently involve length-biased and right-censored data, a fact that has drawn considerable attention in survival analysis. In this article, we consider survival data arising from lengthbiased sampling, and propose a new semiparametric-model-based approach to estimate quantile differences of failure time. We establish the asymptotic properties of our new estimators theoretically under mild technical conditions, and propose a resampling method for estimating their asymptotic variance. We then conduct simulations to evaluate the empirical performance and efficiency of the proposed estimators, and demonstrate their application by a real data analysis.
基金supported by National Natural Science Foundation of China(Grant No.71271128)the State Key Program of National Natural Science Foundation of China(Grant No.71331006)+2 种基金NCMIS and Shanghai University of Finance and Economics through Project 211 Phase IVShanghai Firstclass Discipline A,Outstanding Ph D Dissertation Cultivation Funds of Shanghai University of Finance and EconomicsGraduate Education Innovation Funds of Shanghai University of Finance and Economics(Grant No.CXJJ-2011-438)
文摘This article proposes a simple nonparametric estimator of quantile residual lifetime function under left-truncated and right-censored data. The asymptotic consistency and normality of this estimator are proved and the variance expression is calculated. Two bootstrap procedures are employed in the simulation study,where the latter bootstrap from Zeng and Lin(2008) is 4000 times faster than the former naive one, and the numerical results in both methods show that our estimating approach works well. A real data example is used to illustrate its application.
基金the National Natural Science Foundation of China(Nos.11501355 and 71571116)the Project of Knowledge Innovation Program of Shanghai Municipal Education Commission(No.15ZZ090)+2 种基金the 59th China Postdoctoral Sciences Foundation Funded Project(No.2016M591640)the Humanities and Social Sciences Research Project of Ministry of Education(No.15YJA790039)the National Social Science Foundation of China(No.15ZDA058)
文摘The literature generally agrees that longer-horizon(over a month) predictions make more sense than short-horizon ones. However, it's an especially challenging task due to the lack of data(in unit of long horizon)and economic data have a low S/N ratio. We hypothesize that the stock trend is largely dictated by driving factors which are filtered by psychological factors and work on behavioral factors: representative indicators from these three aspects would be adequate in trend prediction. We then extend the Stepwise Regression Analysis(SRA)algorithm to constrained SRA(c SRA) to carry out a further feature selection and lag optimization. During modeling stage, we introduce the Deep Neural Network(DNN) model in stock prediction under the suspicion that economic interactions are too complex for shallow networks to capture. Our experiments indeed show that deep structures generally perform better than shallow ones. Instead of comparing to a kitchen sink model, where over-fitting can easily happen with a shortage of data, we turn around and use a model ensemble approach which indirectly demonstrates our proposed method is adequate.
基金Supported by National Natural Science Foundation of China(Grant Nos.11201005,11071015)the Foundation of National Bureau of Statistics(Grant No.2013LZ17)the Natural Science Foundation of Anhui Province(Grant No.1308085QA13)
文摘In this paper, the multivariate linear model Y = XB+e, e ~ Nm×k(0, ImΣ) is considered from the Bayes perspective. Under the normal-inverse Wishart prior for (BΣ), the Bayes estimators are derived. The superiority of the Bayes estimators of B and Σ over the least squares estimators under the criteria of Bayes mean squared error (BMSE) and Bayes mean squared error matrix (BMSEM) is shown. In addition, the Pitman Closeness (PC) criterion is also included to investigate the superiority of the Bayes estimator of B.
基金supported by the National Natural Science Foundation of China under Grant No.11371236the Graduate Student Innovation Foundation of Shanghai University of Finance and Economics(CXJJ-2015-440)
文摘This paper is concerned with tile proOlenl or improving hue ~lma^u~ u~ under Stein's loss. By the partial Iwasawa coordinates of covariance matrix, the corresponding risk can be split into three parts. One can use the information in the weighted matrix of weighted quadratic loss to improve one part of risk. However, this paper indirectly takes advantage of the information in the sample mean and reuses Iwasawa coordinates to improve the rest of risk. It is worth mentioning that the process above can be repeated. Finally, a Monte Carlo simulation study is carried out to verify the theoretical results.
基金supported by National Natural Science Foundation of China(GrantNos.71271128 and 11101442)the State Key Program of National Natural Science Foundation of China(GrantNo.71331006)+2 种基金National Center for Mathematics and Interdisciplinary Sciences(NCMIS)Shanghai Leading Academic Discipline Project A,in Ranking Top of Shanghai University of Finance and Economics(IRTSHUFE)Scientific Research Innovation Fund for PhD Studies(Grant No.CXJJ-2011-434)
文摘The varying-coefficient model is flexible and powerful for modeling the dynamic changes of regression coefficients. We study the problem of variable selection and estimation in this model in the sparse, high- dimensional case. We develop a concave group selection approach for this problem using basis function expansion and study its theoretical and empirical properties. We also apply the group Lasso for variable selection and estimation in this model and study its properties. Under appropriate conditions, we show that the group least absolute shrinkage and selection operator (Lasso) selects a model whose dimension is comparable to the underlying mode], regardless of the large number of unimportant variables. In order to improve the selection results, we show that the group minimax concave penalty (MCP) has the oracle selection property in the sense that it correctly selects important variables with probability converging to one under suitable conditions. By comparison, the group Lasso does not have the oracle selection property. In the simulation parts, we apply the group Lasso and the group MCP. At the same time, the two approaches are evaluated using simulation and demonstrated on a data example.
基金This research was partially supported by National Natural Science Funds for Distinguished Young Scholar under Grant No. 70825004 and National Natural Science Foundation of China (NSFC) under Grant No. 10731010, the National Basic Research Program under Grant No. 2007CB814902, Creative Research Groups of China under Grant No.10721101 and Shanghai University of Finance and Economics through Project 211 Phase III and Shanghai Leading Academic Discipline Project under Grant No. B803.
文摘Receiver operating characteristic (ROC) curve is often used to study and compare two- sample problems in medicine. When more information may be available on one treatment than the other, one can improve estimator of ROC curve if the auxiliary population information is taken into account. The authors show that the empirical likelihood method can be naturally adapted to make efficient use of the auxiliary information to such problems. The authors propose a smoothed empirical likelihood estimator for ROC curve with some auxiliary information in medical studies. The proposed estimates are more efficient than those ROC estimators without any auxiliary information, in the sense of comparing asymptotic variances and mean squared error (MSE). Some asymptotic properties for the empirical likelihood estimation of ROC curve are established. A simulation study is presented to demonstrate the performance of the proposed estimators.