We forecast realized volatilities by developing a time-varying heterogeneous autoregressive(HAR)latent factor model with dynamic model average(DMA)and dynamic model selection(DMS)approaches.The number of latent factor...We forecast realized volatilities by developing a time-varying heterogeneous autoregressive(HAR)latent factor model with dynamic model average(DMA)and dynamic model selection(DMS)approaches.The number of latent factors is determined using Chan and Grant's(2016)deviation information criteria.The predictors in our model include lagged daily,weekly,and monthly volatility variables,the corresponding volatility factors,and a speculation variable.In addition,the time-varying properties of the best-performing DMA(DMS)-HAR-2FX models,including size,inclusion probabilities,and coefficients,are examined.We find that the proposed DMA(DMS)-HAR-2FX model outperforms the competing models for both in-sample and out-of-sample forecasts.Furthermore,the speculation variable displays strong predictability for forecasting the realized volatility of financial futures in China.展开更多
A weed is a plant that thrives in areas of human disturbance, such as gardens, fields, pastures, waysides, and waste places where it is not intentionally cultivated. Dispersal affects community dynamics and vegetation...A weed is a plant that thrives in areas of human disturbance, such as gardens, fields, pastures, waysides, and waste places where it is not intentionally cultivated. Dispersal affects community dynamics and vegetation response to global change. The process of seed disposal is influenced by wind, which plays a crucial role in determining the distance and probability of seed dispersal. Existing models of seed dispersal consider wind direction but fail to incorporate wind intensity. In this paper, a novel seed disposal model was proposed in this paper, incorporating wind intensity based on relevant references. According to various climatic conditions, including temperate, arid, and tropical regions, three specific regions were selected to establish a wind dispersal model that accurately reflects the density function distribution of dispersal distance. Additionally, dandelions growth is influenced by a multitude of factors, encompassing temperature, humidity, climate, and various environmental variables that necessitate meticulous consideration. Based on Factor Analysis model, which completely considers temperature, precipitation, solar radiation, wind, and land carrying capacity, a conclusion is presented, indicating that the growth of seeds is primarily influenced by plant attributes and climate conditions, with the former exerting a relatively stronger impact. Subsequently, the remaining two plants were chosen based on seed weight, yielding consistent conclusion.展开更多
In this editorial,we comment on the article by Chen et al.We specifically focus on the risk factors,prognostic factors,and management of brain metastasis(BM)in breast cancer(BC).BC is the second most common cancer to ...In this editorial,we comment on the article by Chen et al.We specifically focus on the risk factors,prognostic factors,and management of brain metastasis(BM)in breast cancer(BC).BC is the second most common cancer to have BM after lung cancer.Independent risk factors for BM in BC are:HER-2 positive BC,triplenegative BC,and germline BRCA mutation.Other factors associated with BM are lung metastasis,age less than 40 years,and African and American ancestry.Even though risk factors associated with BM in BC are elucidated,there is a lack of data on predictive models for BM in BC.Few studies have been made to formulate predictive models or nomograms to address this issue,where age,grade of tumor,HER-2 receptor status,and number of metastatic sites(1 vs>1)were predictive of BM in metastatic BC.However,none have been used in clinical practice.National Comprehensive Cancer Network recommends screening of BM in advanced BC only when the patient is symptomatic or suspicious of central nervous system symptoms;routine screening for BM in BC is not recommended in the guidelines.BM decreases the quality of life and will have a significant psychological impact.Further studies are required for designing validated nomograms or predictive models for BM in BC;these models can be used in the future to develop treatment approaches to prevent BM,which improves the quality of life and overall survival.展开更多
Latent factor models have become a workhorse for a large number of recommender systems. While these sys- tems are built using ratings data, which is typically assumed static, the ability to incorporate different kinds...Latent factor models have become a workhorse for a large number of recommender systems. While these sys- tems are built using ratings data, which is typically assumed static, the ability to incorporate different kinds of subsequent user feedback is an important asset. For instance, the user might want to provide additional information to the system in order to improve his personal recommendations. To this end, we examine a novel scheme for efficiently learning (or refining) user parameters from such feedback. We propose a scheme where users are presented with a sequence of pair- wise preference questions: "Do you prefer item A over B?" User parameters are updated based on their response, and subsequent questions are chosen adaptively after incorporat- ing the feedback. We operate in a Bayesian framework and the choice of questions is based on an information gain cri- terion. We validate the scheme on the Netflix movie ratings data set and a proprietary television viewership data set. A user study and automated experiments validate our findings.展开更多
This paper provides a survey on recent developments in structural changes for high dimensional factor models. Compared with conventional low-dimensional time series, structural changes in factor models are more compli...This paper provides a survey on recent developments in structural changes for high dimensional factor models. Compared with conventional low-dimensional time series, structural changes in factor models are more complicated due to the unobservability of factors and factor loadings. The following topics are covered in this survey: the identification conditions for the structural changes in the factor loadings, different impacts of big and small breaks in factor models, tests for structural changes in the factor loadings of a specific variable, tests for structural changes in the factor loading matrix, joint tests for structural changes in the factor loadings and coefficients in factor-augmented regressions, tests for smooth changes in the factor loadings, estimation of break dates, and model selection in factor models with structural changes via the shrinkage method.展开更多
Linear factor models are familiar tools used in many fields.Several pioneering literatures established foundational theoretical results of the quasi-maximum likelihood estimator for high-dimensional linear factor mode...Linear factor models are familiar tools used in many fields.Several pioneering literatures established foundational theoretical results of the quasi-maximum likelihood estimator for high-dimensional linear factor models.Their results are based on a critical assumption:The error variance estimators are uniformly bounded in probability.Instead of making such an assumption,we provide a rigorous proof of this result under some mild conditions.展开更多
For the class of(partially specified)internal risk factor models we establish strongly simplified supermodular ordering results in comparison to the case of general risk factor models.This allows us to derive meaningf...For the class of(partially specified)internal risk factor models we establish strongly simplified supermodular ordering results in comparison to the case of general risk factor models.This allows us to derive meaningful and improved risk bounds for the joint portfolio in risk factor models with dependence information given by constrained specification sets for the copulas of the risk components and the systemic risk factor.The proof of our main comparison result is not standard.It is based on grid copula approximation of upper products of copulas and on the theory of mass transfers.An application to real market data shows considerable improvement over the standard method.展开更多
High-dimensional and sparse(HiDS)matrices commonly arise in various industrial applications,e.g.,recommender systems(RSs),social networks,and wireless sensor networks.Since they contain rich information,how to accurat...High-dimensional and sparse(HiDS)matrices commonly arise in various industrial applications,e.g.,recommender systems(RSs),social networks,and wireless sensor networks.Since they contain rich information,how to accurately represent them is of great significance.A latent factor(LF)model is one of the most popular and successful ways to address this issue.Current LF models mostly adopt L2-norm-oriented Loss to represent an HiDS matrix,i.e.,they sum the errors between observed data and predicted ones with L2-norm.Yet L2-norm is sensitive to outlier data.Unfortunately,outlier data usually exist in such matrices.For example,an HiDS matrix from RSs commonly contains many outlier ratings due to some heedless/malicious users.To address this issue,this work proposes a smooth L1-norm-oriented latent factor(SL-LF)model.Its main idea is to adopt smooth L1-norm rather than L2-norm to form its Loss,making it have both strong robustness and high accuracy in predicting the missing data of an HiDS matrix.Experimental results on eight HiDS matrices generated by industrial applications verify that the proposed SL-LF model not only is robust to the outlier data but also has significantly higher prediction accuracy than state-of-the-art models when they are used to predict the missing data of HiDS matrices.展开更多
Latent factor(LF) models are highly effective in extracting useful knowledge from High-Dimensional and Sparse(HiDS) matrices which are commonly seen in various industrial applications. An LF model usually adopts itera...Latent factor(LF) models are highly effective in extracting useful knowledge from High-Dimensional and Sparse(HiDS) matrices which are commonly seen in various industrial applications. An LF model usually adopts iterative optimizers,which may consume many iterations to achieve a local optima,resulting in considerable time cost. Hence, determining how to accelerate the training process for LF models has become a significant issue. To address this, this work proposes a randomized latent factor(RLF) model. It incorporates the principle of randomized learning techniques from neural networks into the LF analysis of HiDS matrices, thereby greatly alleviating computational burden. It also extends a standard learning process for randomized neural networks in context of LF analysis to make the resulting model represent an HiDS matrix correctly.Experimental results on three HiDS matrices from industrial applications demonstrate that compared with state-of-the-art LF models, RLF is able to achieve significantly higher computational efficiency and comparable prediction accuracy for missing data.I provides an important alternative approach to LF analysis of HiDS matrices, which is especially desired for industrial applications demanding highly efficient models.展开更多
BACKGROUND Colorectal cancer is a common digestive cancer worldwide.As a comprehensive treatment for locally advanced rectal cancer(LARC),neoadjuvant therapy(NT)has been increasingly used as the standard treatment for...BACKGROUND Colorectal cancer is a common digestive cancer worldwide.As a comprehensive treatment for locally advanced rectal cancer(LARC),neoadjuvant therapy(NT)has been increasingly used as the standard treatment for clinical stage II/III rectal cancer.However,few patients achieve a complete pathological response,and most patients require surgical resection and adjuvant therapy.Therefore,identifying risk factors and developing accurate models to predict the prognosis of LARC patients are of great clinical significance.AIM To establish effective prognostic nomograms and risk score prediction models to predict overall survival(OS)and disease-free survival(DFS)for LARC treated with NT.METHODS Nomograms and risk factor score prediction models were based on patients who received NT at the Cancer Hospital from 2015 to 2017.The least absolute shrinkage and selection operator regression model were utilized to screen for prognostic risk factors,which were validated by the Cox regression method.Assessment of the performance of the two prediction models was conducted using receiver operating characteristic curves,and that of the two nomograms was conducted by calculating the concordance index(C-index)and calibration curves.The results were validated in a cohort of 65 patients from 2015 to 2017.RESULTS Seven features were significantly associated with OS and were included in the OS prediction nomogram and prediction model:Vascular_tumors_bolt,cancer nodules,yN,body mass index,matchmouth distance from the edge,nerve aggression and postoperative carcinoembryonic antigen.The nomogram showed good predictive value for OS,with a C-index of 0.91(95%CI:0.85,0.97)and good calibration.In the validation cohort,the C-index was 0.69(95%CI:0.53,0.84).The risk factor prediction model showed good predictive value.The areas under the curve for 3-and 5-year survival were 0.811 and 0.782.The nomogram for predicting DFS included ypTNM and nerve aggression and showed good calibration and a C-index of 0.77(95%CI:0.69,0.85).In the validation cohort,the C-index was 0.71(95%CI:0.61,0.81).The prediction model for DFS also had good predictive value,with an AUC for 3-year survival of 0.784 and an AUC for 5-year survival of 0.754.CONCLUSION We established accurate nomograms and prediction models for predicting OS and DFS in patients with LARC after undergoing NT.展开更多
Several important equilibrium Si isotope fractionation factors among minerals,organic molecules and the H_4SiO_4 solution are complemented to facilitate the explanation of the distributions of Si isotopes in Earth'...Several important equilibrium Si isotope fractionation factors among minerals,organic molecules and the H_4SiO_4 solution are complemented to facilitate the explanation of the distributions of Si isotopes in Earth's surface environments.The results reveal that,in comparison to aqueous H_4SiO_4,heavy Si isotopes will be significantly enriched in secondary silicate minerals.On the contrary,quadra-coordinated organosilicon complexes are enriched in light silicon isotope relative to the solution.The extent of ^(28)Si-enrichment in hyper-coordinated organosilicon complexes was found to be the largest.In addition,the large kinetic isotope effect associated with the polymerization of monosilicic acid and dimer was calculated,and the results support the previous statement that highly ^(28)Sienrichment in the formation of amorphous quartz precursor contributes to the discrepancy between theoretical calculations and field observations.With the equilibrium Si isotope fractionation factors provided here,Si isotope distributions in many of Earth's surface systems can be explained.For example,the change of bulk soil δ^(30)Si can be predicted as a concave pattern with respect to the weathering degree,with the minimum value where allophane completely dissolves and the total amount of sesquioxides and poorly crystalline minerals reaches their maximum.When,under equilibrium conditions,the well-crystallized clays start to precipitate from the pore solutions,the bulk soil δ^(30)Si will increase again and reach a constant value.Similarly,the precipitation of crystalline smectite and the dissolution of poorly crystalline kaolinite may explain the δ^(30)Si variations in the ground water profile.The equilibrium Si isotope fractionations among the quadracoordinated organosilicon complexes and the H_4SiO_4solution may also shed light on the Si isotope distributions in the Si-accumulating plants.展开更多
Aiming at the shortage of sufficient continuous parameters for using models to estimate farmland soil organic carbon(SOC) content, an acquisition method of factors influencing farmland SOC and an estimation method of ...Aiming at the shortage of sufficient continuous parameters for using models to estimate farmland soil organic carbon(SOC) content, an acquisition method of factors influencing farmland SOC and an estimation method of farmland SOC content with Internet of Things(IOT) are proposed in this paper. The IOT sensing device and transmission network were established in a wheat demonstration base in Yanzhou Distict of Jining City, Shandong Province, China to acquire data in real time. Using real-time data and statistics data, the dynamic changes of SOC content between October 2012 and June 2015 was simulated in the experimental area with SOC dynamic simulation model. In order to verify the estimation results, potassium dichromate external heating method was applied for measuring the SOC content. The results show that: 1) The estimated value matches the measured value in the lab very well. So the method is feasible in this paper. 2) There is a clear dynamic variation in the SOC content at 0.2 m soil depth in different growing periods of wheat. The content reached the highest level during the sowing period, and is lowest in the flowering period. 3) The SOC content at 0.2 m soil depth varies in accordance with the amount of returned straw. The larger the amount of returned straw is, the higher the SOC content.展开更多
To describe the empirical data of collaboration networks, several evolving mechanisms have been proposed, which usually introduce different dynamics factors controlling the network growth. These models can reasonably ...To describe the empirical data of collaboration networks, several evolving mechanisms have been proposed, which usually introduce different dynamics factors controlling the network growth. These models can reasonably reproduce the empirical degree distributions for a number of we11-studied real-world collaboration networks. On the basis of the previous studies, in this work we propose a collaboration network model in which the network growth is simultaneously controlled by three factors, including partial preferential attachment, partial random attachment and network growth speed. By using a rate equation method, we obtain an analytical formula for the act degree distribution. We discuss the dependence of the act degree distribution on these different dynamics factors. By fitting to the empirical data of two typical collaboration networks, we can extract the respective contributions of these dynamics factors to the evolution of each networks.展开更多
In this paper, we build the Linear Programming (LP) model, factor analysis model and return on investment model to measure the investment amount and which year to invest of each selected schools. We firstly analyze th...In this paper, we build the Linear Programming (LP) model, factor analysis model and return on investment model to measure the investment amount and which year to invest of each selected schools. We firstly analyze the indicators from attached files, and select effective indexes to choose schools donated. Then we select 17 indexes out after preprocessing all the indices. Secondly, we extract 1064 schools by MATLAB which is the Potential Candidate Schools from the table of attached files;we extract 10 common factors of these schools by factor analysis. After calculation, we rank the universities and select the top 100. We calculate the Return on Investment (ROI) based on these 17 indexes. Thirdly, we figure out the investment amount by conducting LP model through MATLAB. According to the property of schools, we calculate the annual limit investment and the mount of investment of each school. Fourthly, we determine which year to invest by ROI model which is operated by LINGO. In order to achieve optimal investment strategy and not duplication of investment, for five years, starting July 2016, we assume that the time duration that the organization’s money should be provided is one year, and the school return to the Good grant Foundation only one year. Then we can get the investment amount per school, the return on that investment, and which years to invest. Fifthly, by changing parameter, the sensitivity analysis is conducted for our models. The result indicates that our models are feasible and robust. Finally, we evaluate our models, and point out the strengths and weakness. Through previous analysis, we can find that our models can be applied to many fields, which have a relatively high generalization.展开更多
High costs are connected with upgrading railway embankments throughout Denmark using the partial factors for geotechnical design calibrated for general application. One way to reduce the costs is reliability-based cal...High costs are connected with upgrading railway embankments throughout Denmark using the partial factors for geotechnical design calibrated for general application. One way to reduce the costs is reliability-based calibration of the partial factors to a reasonable safety level taking into account the specific design situations and uncertainties relevant to railway embankments. A reliability-based design has been investigated, resulting in an optimal partial factor for the considered subsoil. With a stochastic soil model to simulate the undrained shear strength of soft soil deposits, the partial factor is calibrated using asymptotic sampling for the reliability assessment. The calibration shows that the partial factor can be reduced significantly compared to the value specified in the Danish National Annex to DS/EN 1997-1 (2007), Eurocode 7.展开更多
Diffusion has been systematically described as the main mechanism of chloride transport in reinforced concrete(RC) structure, especially when the concrete is in a saturated state. However, the single mechanism of di...Diffusion has been systematically described as the main mechanism of chloride transport in reinforced concrete(RC) structure, especially when the concrete is in a saturated state. However, the single mechanism of diffusion is not able to describe the actual chloride ingress in the nonsaturated concrete. Instead, it is dominated by the interaction of diffusion and convection. With the synergetic effects of various factors taken into account, this study aimed to modify and develop an analytical convection- diffusion coupling model for chloride transport in nonsaturated concrete. The model was verified by simulation of laboratory tests and field measurement. The results of comparison study demonstrate that the analytical model developed in this study is efficient and accurate in predicting the chloride profiles in the nonsaturated concrete.展开更多
Non-parametric methods are treasured in data analysis,particularly in finance.ST-metric is a new concept,introduced by Tulunay(2017).It offers non-parametric methods and a new geometric view to data analysis.In that p...Non-parametric methods are treasured in data analysis,particularly in finance.ST-metric is a new concept,introduced by Tulunay(2017).It offers non-parametric methods and a new geometric view to data analysis.In that paper,ST-metric concept has been applied to performance measures of portfolios.In this current paper,we purpose another ST-metric method for finding factor exposures in the five-style-factors model.Here the style factors are value,size,minimum volatility,quality and momentum.The main idea is to find the factor exposures(weights)of the five-factors-model by minimizing the ST-metric between benchmark returns and the constructed factor model returns.We compare ST-metric method with Tracking Error method(TE-method)which is used for factor analysis of major indexes,decomposed into the style factors(tradable via Exchange Traded Funds(ETFs))by Ang et al.(2018).We show that ST-metric method gives better estimation of the factor exposures(weights)than tracking error method,in general,and further how ST-metric values vary with respect to fluctuations.This explains the reason behind the efficiency of the ST-metric method.We support this idea with empirical evidences.展开更多
The number of attacks is growing tremendously in tandem with the growth of internet technologies.As a result,protecting the private data from prying eyes has become a critical and tough undertaking.Many intrusion dete...The number of attacks is growing tremendously in tandem with the growth of internet technologies.As a result,protecting the private data from prying eyes has become a critical and tough undertaking.Many intrusion detection solutions have been offered by researchers in order to decrease the effect of these attacks.For attack detection,the prior system has created an SMSRPF(Stacking Model Significant Rule Power Factor)classifier.To provide creative instance detection,the SMSRPF combines the detection of trained classifiers such as DT(Decision Tree)and RF(Random Forest).Nevertheless,it does not generate any accuratefindings that are adequate.The suggested system has built an EWF(Ensemble Wrapper Filter)feature selection with SMSRPF classifier for attack detection so as to overcome this problem.The UNSW-NB15 dataset is used as an input in this proposed research project.Specifically,min–max normalization approach is used to pre-process the incoming data.The feature selection is then carried out using EWF.Based on the selected features,SMSRPF classifiers are utilized to detect the attacks.The SMSRPF is integrated with the trained classi-fiers such as DT and RF to create creative instance detection.After that,the testing data is classified using MCAR(Multi-Class Classification based on Association Rules).The SRPF judges the rules correctly even when the confidence and the lift measures fail.Regarding accuracy,precision,recall,f-measure,computation time,and error,the experimental findings suggest that the new system outperforms the prior systems.展开更多
This paper estimates proxy specifications of a five-factor asset pricing model to produce stylized facts of the Saudi capital market and test an arbitrage pricing theory (APT) model. The data set is the panel of 20 ...This paper estimates proxy specifications of a five-factor asset pricing model to produce stylized facts of the Saudi capital market and test an arbitrage pricing theory (APT) model. The data set is the panel of 20 most actively traded firms, excluding firms with negative book value of equity. The contribution to the extant literature is three-fold: (l) organizing Saudi market data based on beta and firm-specific fundamentals, namely, growth, value, accounting earnings, and equity investments; (2) conducting a parsimony analysis within the theoretical framework of APT; and (3) quantifying the information risk facing the marginal investor by decomposing earnings into cash flows and accruals and investigating respective loadings in an unrestricted version of the parsimonious specification. Proxy asset pricing specifications, though intuitively appealing, are scant due to lack of theoretical frameworks and misguided significance tests of factor loadings. Throughout, this issue is addressed by keeping the empirical analysis under describing market facts and testing an APT model. The study concludes with a significant empirical explanation that specifies average returns in terms of the covariance risk and accounting accruals.展开更多
In recent years,the telecommunications sector is no longer limited to traditional communications,but has become the backbone for the use of data,content and digital applications by individuals,governments and companie...In recent years,the telecommunications sector is no longer limited to traditional communications,but has become the backbone for the use of data,content and digital applications by individuals,governments and companies to ensure the continuation of economic and social activity in light of social distancing and total closure inmost countries in the world.Therefore,electronic government(e-Government)andmobile government(m-Government)are the results of technological evolution and innovation.Hence,it is important to investigate the factors that influence the intention to use m-Government services among Jordan’s society.This paper proposed a new m-Government acceptance model in Jordan(AMGS);this model combines the Information System(IS)Success Factor Model and Hofstede Cultural Dimensions Theory.The study was conducted by surveying different groups of the Jordanian community.Astructured questionnaire was used to collect data from203 respondents.Multiple regression analysis has been conducted to analyze the data.The results indicate that the significant predictors of citizen intention to use m-Government services in Jordan are Information Quality,Service Quality,Uncertainty Avoidance,and Indulgence vs.restraint.While,the results also suggest that Power Distance is not a significant predictor of citizen intention to use m-Government services.展开更多
基金supported by grants from the National Natural Science Foundation of China(72171088,71803049,72003205)the Ministry of Education of the People's Republic of China of Humanities and Social Sciences Youth Fundation(20YJC790142)the General Project of Social Science Planning in Guangdong Province,China(GD22CYJ12).
文摘We forecast realized volatilities by developing a time-varying heterogeneous autoregressive(HAR)latent factor model with dynamic model average(DMA)and dynamic model selection(DMS)approaches.The number of latent factors is determined using Chan and Grant's(2016)deviation information criteria.The predictors in our model include lagged daily,weekly,and monthly volatility variables,the corresponding volatility factors,and a speculation variable.In addition,the time-varying properties of the best-performing DMA(DMS)-HAR-2FX models,including size,inclusion probabilities,and coefficients,are examined.We find that the proposed DMA(DMS)-HAR-2FX model outperforms the competing models for both in-sample and out-of-sample forecasts.Furthermore,the speculation variable displays strong predictability for forecasting the realized volatility of financial futures in China.
文摘A weed is a plant that thrives in areas of human disturbance, such as gardens, fields, pastures, waysides, and waste places where it is not intentionally cultivated. Dispersal affects community dynamics and vegetation response to global change. The process of seed disposal is influenced by wind, which plays a crucial role in determining the distance and probability of seed dispersal. Existing models of seed dispersal consider wind direction but fail to incorporate wind intensity. In this paper, a novel seed disposal model was proposed in this paper, incorporating wind intensity based on relevant references. According to various climatic conditions, including temperate, arid, and tropical regions, three specific regions were selected to establish a wind dispersal model that accurately reflects the density function distribution of dispersal distance. Additionally, dandelions growth is influenced by a multitude of factors, encompassing temperature, humidity, climate, and various environmental variables that necessitate meticulous consideration. Based on Factor Analysis model, which completely considers temperature, precipitation, solar radiation, wind, and land carrying capacity, a conclusion is presented, indicating that the growth of seeds is primarily influenced by plant attributes and climate conditions, with the former exerting a relatively stronger impact. Subsequently, the remaining two plants were chosen based on seed weight, yielding consistent conclusion.
文摘In this editorial,we comment on the article by Chen et al.We specifically focus on the risk factors,prognostic factors,and management of brain metastasis(BM)in breast cancer(BC).BC is the second most common cancer to have BM after lung cancer.Independent risk factors for BM in BC are:HER-2 positive BC,triplenegative BC,and germline BRCA mutation.Other factors associated with BM are lung metastasis,age less than 40 years,and African and American ancestry.Even though risk factors associated with BM in BC are elucidated,there is a lack of data on predictive models for BM in BC.Few studies have been made to formulate predictive models or nomograms to address this issue,where age,grade of tumor,HER-2 receptor status,and number of metastatic sites(1 vs>1)were predictive of BM in metastatic BC.However,none have been used in clinical practice.National Comprehensive Cancer Network recommends screening of BM in advanced BC only when the patient is symptomatic or suspicious of central nervous system symptoms;routine screening for BM in BC is not recommended in the guidelines.BM decreases the quality of life and will have a significant psychological impact.Further studies are required for designing validated nomograms or predictive models for BM in BC;these models can be used in the future to develop treatment approaches to prevent BM,which improves the quality of life and overall survival.
文摘Latent factor models have become a workhorse for a large number of recommender systems. While these sys- tems are built using ratings data, which is typically assumed static, the ability to incorporate different kinds of subsequent user feedback is an important asset. For instance, the user might want to provide additional information to the system in order to improve his personal recommendations. To this end, we examine a novel scheme for efficiently learning (or refining) user parameters from such feedback. We propose a scheme where users are presented with a sequence of pair- wise preference questions: "Do you prefer item A over B?" User parameters are updated based on their response, and subsequent questions are chosen adaptively after incorporat- ing the feedback. We operate in a Bayesian framework and the choice of questions is based on an information gain cri- terion. We validate the scheme on the Netflix movie ratings data set and a proprietary television viewership data set. A user study and automated experiments validate our findings.
文摘This paper provides a survey on recent developments in structural changes for high dimensional factor models. Compared with conventional low-dimensional time series, structural changes in factor models are more complicated due to the unobservability of factors and factor loadings. The following topics are covered in this survey: the identification conditions for the structural changes in the factor loadings, different impacts of big and small breaks in factor models, tests for structural changes in the factor loadings of a specific variable, tests for structural changes in the factor loading matrix, joint tests for structural changes in the factor loadings and coefficients in factor-augmented regressions, tests for smooth changes in the factor loadings, estimation of break dates, and model selection in factor models with structural changes via the shrinkage method.
基金supported by National Natural Science Foundation of China(Grant Nos.11631003,11690012 and 11571068)the Fundamental Research Funds for the Central Universities(Grant No.2412019FZ030)+1 种基金Jilin Provincial Science and Technology Development Plan Funded Project(Grant No.20180520026JH)the National Institute of Health。
文摘Linear factor models are familiar tools used in many fields.Several pioneering literatures established foundational theoretical results of the quasi-maximum likelihood estimator for high-dimensional linear factor models.Their results are based on a critical assumption:The error variance estimators are uniformly bounded in probability.Instead of making such an assumption,we provide a rigorous proof of this result under some mild conditions.
文摘For the class of(partially specified)internal risk factor models we establish strongly simplified supermodular ordering results in comparison to the case of general risk factor models.This allows us to derive meaningful and improved risk bounds for the joint portfolio in risk factor models with dependence information given by constrained specification sets for the copulas of the risk components and the systemic risk factor.The proof of our main comparison result is not standard.It is based on grid copula approximation of upper products of copulas and on the theory of mass transfers.An application to real market data shows considerable improvement over the standard method.
基金supported in part by the National Natural Science Foundation of China(61702475,61772493,61902370,62002337)in part by the Natural Science Foundation of Chongqing,China(cstc2019jcyj-msxmX0578,cstc2019jcyjjqX0013)+1 种基金in part by the Chinese Academy of Sciences“Light of West China”Program,in part by the Pioneer Hundred Talents Program of Chinese Academy of Sciencesby Technology Innovation and Application Development Project of Chongqing,China(cstc2019jscx-fxydX0027)。
文摘High-dimensional and sparse(HiDS)matrices commonly arise in various industrial applications,e.g.,recommender systems(RSs),social networks,and wireless sensor networks.Since they contain rich information,how to accurately represent them is of great significance.A latent factor(LF)model is one of the most popular and successful ways to address this issue.Current LF models mostly adopt L2-norm-oriented Loss to represent an HiDS matrix,i.e.,they sum the errors between observed data and predicted ones with L2-norm.Yet L2-norm is sensitive to outlier data.Unfortunately,outlier data usually exist in such matrices.For example,an HiDS matrix from RSs commonly contains many outlier ratings due to some heedless/malicious users.To address this issue,this work proposes a smooth L1-norm-oriented latent factor(SL-LF)model.Its main idea is to adopt smooth L1-norm rather than L2-norm to form its Loss,making it have both strong robustness and high accuracy in predicting the missing data of an HiDS matrix.Experimental results on eight HiDS matrices generated by industrial applications verify that the proposed SL-LF model not only is robust to the outlier data but also has significantly higher prediction accuracy than state-of-the-art models when they are used to predict the missing data of HiDS matrices.
基金supported in part by the National Natural Science Foundation of China (6177249391646114)+1 种基金Chongqing research program of technology innovation and application (cstc2017rgzn-zdyfX0020)in part by the Pioneer Hundred Talents Program of Chinese Academy of Sciences
文摘Latent factor(LF) models are highly effective in extracting useful knowledge from High-Dimensional and Sparse(HiDS) matrices which are commonly seen in various industrial applications. An LF model usually adopts iterative optimizers,which may consume many iterations to achieve a local optima,resulting in considerable time cost. Hence, determining how to accelerate the training process for LF models has become a significant issue. To address this, this work proposes a randomized latent factor(RLF) model. It incorporates the principle of randomized learning techniques from neural networks into the LF analysis of HiDS matrices, thereby greatly alleviating computational burden. It also extends a standard learning process for randomized neural networks in context of LF analysis to make the resulting model represent an HiDS matrix correctly.Experimental results on three HiDS matrices from industrial applications demonstrate that compared with state-of-the-art LF models, RLF is able to achieve significantly higher computational efficiency and comparable prediction accuracy for missing data.I provides an important alternative approach to LF analysis of HiDS matrices, which is especially desired for industrial applications demanding highly efficient models.
文摘BACKGROUND Colorectal cancer is a common digestive cancer worldwide.As a comprehensive treatment for locally advanced rectal cancer(LARC),neoadjuvant therapy(NT)has been increasingly used as the standard treatment for clinical stage II/III rectal cancer.However,few patients achieve a complete pathological response,and most patients require surgical resection and adjuvant therapy.Therefore,identifying risk factors and developing accurate models to predict the prognosis of LARC patients are of great clinical significance.AIM To establish effective prognostic nomograms and risk score prediction models to predict overall survival(OS)and disease-free survival(DFS)for LARC treated with NT.METHODS Nomograms and risk factor score prediction models were based on patients who received NT at the Cancer Hospital from 2015 to 2017.The least absolute shrinkage and selection operator regression model were utilized to screen for prognostic risk factors,which were validated by the Cox regression method.Assessment of the performance of the two prediction models was conducted using receiver operating characteristic curves,and that of the two nomograms was conducted by calculating the concordance index(C-index)and calibration curves.The results were validated in a cohort of 65 patients from 2015 to 2017.RESULTS Seven features were significantly associated with OS and were included in the OS prediction nomogram and prediction model:Vascular_tumors_bolt,cancer nodules,yN,body mass index,matchmouth distance from the edge,nerve aggression and postoperative carcinoembryonic antigen.The nomogram showed good predictive value for OS,with a C-index of 0.91(95%CI:0.85,0.97)and good calibration.In the validation cohort,the C-index was 0.69(95%CI:0.53,0.84).The risk factor prediction model showed good predictive value.The areas under the curve for 3-and 5-year survival were 0.811 and 0.782.The nomogram for predicting DFS included ypTNM and nerve aggression and showed good calibration and a C-index of 0.77(95%CI:0.69,0.85).In the validation cohort,the C-index was 0.71(95%CI:0.61,0.81).The prediction model for DFS also had good predictive value,with an AUC for 3-year survival of 0.784 and an AUC for 5-year survival of 0.754.CONCLUSION We established accurate nomograms and prediction models for predicting OS and DFS in patients with LARC after undergoing NT.
基金the funding support from the 973 Program(2014CB440904)CAS/SAFEA International Partnership Program for Creative Research Teams(Intraplate Mineralization Research Team,KZZD-EW-TZ-20)Chinese NSF projects(41173023,41225012,41490635,41530210)
文摘Several important equilibrium Si isotope fractionation factors among minerals,organic molecules and the H_4SiO_4 solution are complemented to facilitate the explanation of the distributions of Si isotopes in Earth's surface environments.The results reveal that,in comparison to aqueous H_4SiO_4,heavy Si isotopes will be significantly enriched in secondary silicate minerals.On the contrary,quadra-coordinated organosilicon complexes are enriched in light silicon isotope relative to the solution.The extent of ^(28)Si-enrichment in hyper-coordinated organosilicon complexes was found to be the largest.In addition,the large kinetic isotope effect associated with the polymerization of monosilicic acid and dimer was calculated,and the results support the previous statement that highly ^(28)Sienrichment in the formation of amorphous quartz precursor contributes to the discrepancy between theoretical calculations and field observations.With the equilibrium Si isotope fractionation factors provided here,Si isotope distributions in many of Earth's surface systems can be explained.For example,the change of bulk soil δ^(30)Si can be predicted as a concave pattern with respect to the weathering degree,with the minimum value where allophane completely dissolves and the total amount of sesquioxides and poorly crystalline minerals reaches their maximum.When,under equilibrium conditions,the well-crystallized clays start to precipitate from the pore solutions,the bulk soil δ^(30)Si will increase again and reach a constant value.Similarly,the precipitation of crystalline smectite and the dissolution of poorly crystalline kaolinite may explain the δ^(30)Si variations in the ground water profile.The equilibrium Si isotope fractionations among the quadracoordinated organosilicon complexes and the H_4SiO_4solution may also shed light on the Si isotope distributions in the Si-accumulating plants.
基金Under the auspices of National High-tech R&D Program of China(No.2013AA102301)National Natural Science Foundation of China(No.71503148)
文摘Aiming at the shortage of sufficient continuous parameters for using models to estimate farmland soil organic carbon(SOC) content, an acquisition method of factors influencing farmland SOC and an estimation method of farmland SOC content with Internet of Things(IOT) are proposed in this paper. The IOT sensing device and transmission network were established in a wheat demonstration base in Yanzhou Distict of Jining City, Shandong Province, China to acquire data in real time. Using real-time data and statistics data, the dynamic changes of SOC content between October 2012 and June 2015 was simulated in the experimental area with SOC dynamic simulation model. In order to verify the estimation results, potassium dichromate external heating method was applied for measuring the SOC content. The results show that: 1) The estimated value matches the measured value in the lab very well. So the method is feasible in this paper. 2) There is a clear dynamic variation in the SOC content at 0.2 m soil depth in different growing periods of wheat. The content reached the highest level during the sowing period, and is lowest in the flowering period. 3) The SOC content at 0.2 m soil depth varies in accordance with the amount of returned straw. The larger the amount of returned straw is, the higher the SOC content.
基金Supported by the National Natural Science Foundation of China under Grant Nos 11305139 and 11147178
文摘To describe the empirical data of collaboration networks, several evolving mechanisms have been proposed, which usually introduce different dynamics factors controlling the network growth. These models can reasonably reproduce the empirical degree distributions for a number of we11-studied real-world collaboration networks. On the basis of the previous studies, in this work we propose a collaboration network model in which the network growth is simultaneously controlled by three factors, including partial preferential attachment, partial random attachment and network growth speed. By using a rate equation method, we obtain an analytical formula for the act degree distribution. We discuss the dependence of the act degree distribution on these different dynamics factors. By fitting to the empirical data of two typical collaboration networks, we can extract the respective contributions of these dynamics factors to the evolution of each networks.
文摘In this paper, we build the Linear Programming (LP) model, factor analysis model and return on investment model to measure the investment amount and which year to invest of each selected schools. We firstly analyze the indicators from attached files, and select effective indexes to choose schools donated. Then we select 17 indexes out after preprocessing all the indices. Secondly, we extract 1064 schools by MATLAB which is the Potential Candidate Schools from the table of attached files;we extract 10 common factors of these schools by factor analysis. After calculation, we rank the universities and select the top 100. We calculate the Return on Investment (ROI) based on these 17 indexes. Thirdly, we figure out the investment amount by conducting LP model through MATLAB. According to the property of schools, we calculate the annual limit investment and the mount of investment of each school. Fourthly, we determine which year to invest by ROI model which is operated by LINGO. In order to achieve optimal investment strategy and not duplication of investment, for five years, starting July 2016, we assume that the time duration that the organization’s money should be provided is one year, and the school return to the Good grant Foundation only one year. Then we can get the investment amount per school, the return on that investment, and which years to invest. Fifthly, by changing parameter, the sensitivity analysis is conducted for our models. The result indicates that our models are feasible and robust. Finally, we evaluate our models, and point out the strengths and weakness. Through previous analysis, we can find that our models can be applied to many fields, which have a relatively high generalization.
基金The funding initiating this work was provided by Banedanmark
文摘High costs are connected with upgrading railway embankments throughout Denmark using the partial factors for geotechnical design calibrated for general application. One way to reduce the costs is reliability-based calibration of the partial factors to a reasonable safety level taking into account the specific design situations and uncertainties relevant to railway embankments. A reliability-based design has been investigated, resulting in an optimal partial factor for the considered subsoil. With a stochastic soil model to simulate the undrained shear strength of soft soil deposits, the partial factor is calibrated using asymptotic sampling for the reliability assessment. The calibration shows that the partial factor can be reduced significantly compared to the value specified in the Danish National Annex to DS/EN 1997-1 (2007), Eurocode 7.
基金Funded by the National Natural Science Foundation of China(Nos.51278304,U1134209,U1434204&51422814)the National Basic Research Program(973 Program)of China(No.011-CB013604)the Technology Research and Development Program(Basic Research Project)of Shenzhen(Nos.JCYJ20120613174456685&JCYJ20130329143859418)
文摘Diffusion has been systematically described as the main mechanism of chloride transport in reinforced concrete(RC) structure, especially when the concrete is in a saturated state. However, the single mechanism of diffusion is not able to describe the actual chloride ingress in the nonsaturated concrete. Instead, it is dominated by the interaction of diffusion and convection. With the synergetic effects of various factors taken into account, this study aimed to modify and develop an analytical convection- diffusion coupling model for chloride transport in nonsaturated concrete. The model was verified by simulation of laboratory tests and field measurement. The results of comparison study demonstrate that the analytical model developed in this study is efficient and accurate in predicting the chloride profiles in the nonsaturated concrete.
文摘Non-parametric methods are treasured in data analysis,particularly in finance.ST-metric is a new concept,introduced by Tulunay(2017).It offers non-parametric methods and a new geometric view to data analysis.In that paper,ST-metric concept has been applied to performance measures of portfolios.In this current paper,we purpose another ST-metric method for finding factor exposures in the five-style-factors model.Here the style factors are value,size,minimum volatility,quality and momentum.The main idea is to find the factor exposures(weights)of the five-factors-model by minimizing the ST-metric between benchmark returns and the constructed factor model returns.We compare ST-metric method with Tracking Error method(TE-method)which is used for factor analysis of major indexes,decomposed into the style factors(tradable via Exchange Traded Funds(ETFs))by Ang et al.(2018).We show that ST-metric method gives better estimation of the factor exposures(weights)than tracking error method,in general,and further how ST-metric values vary with respect to fluctuations.This explains the reason behind the efficiency of the ST-metric method.We support this idea with empirical evidences.
文摘The number of attacks is growing tremendously in tandem with the growth of internet technologies.As a result,protecting the private data from prying eyes has become a critical and tough undertaking.Many intrusion detection solutions have been offered by researchers in order to decrease the effect of these attacks.For attack detection,the prior system has created an SMSRPF(Stacking Model Significant Rule Power Factor)classifier.To provide creative instance detection,the SMSRPF combines the detection of trained classifiers such as DT(Decision Tree)and RF(Random Forest).Nevertheless,it does not generate any accuratefindings that are adequate.The suggested system has built an EWF(Ensemble Wrapper Filter)feature selection with SMSRPF classifier for attack detection so as to overcome this problem.The UNSW-NB15 dataset is used as an input in this proposed research project.Specifically,min–max normalization approach is used to pre-process the incoming data.The feature selection is then carried out using EWF.Based on the selected features,SMSRPF classifiers are utilized to detect the attacks.The SMSRPF is integrated with the trained classi-fiers such as DT and RF to create creative instance detection.After that,the testing data is classified using MCAR(Multi-Class Classification based on Association Rules).The SRPF judges the rules correctly even when the confidence and the lift measures fail.Regarding accuracy,precision,recall,f-measure,computation time,and error,the experimental findings suggest that the new system outperforms the prior systems.
文摘This paper estimates proxy specifications of a five-factor asset pricing model to produce stylized facts of the Saudi capital market and test an arbitrage pricing theory (APT) model. The data set is the panel of 20 most actively traded firms, excluding firms with negative book value of equity. The contribution to the extant literature is three-fold: (l) organizing Saudi market data based on beta and firm-specific fundamentals, namely, growth, value, accounting earnings, and equity investments; (2) conducting a parsimony analysis within the theoretical framework of APT; and (3) quantifying the information risk facing the marginal investor by decomposing earnings into cash flows and accruals and investigating respective loadings in an unrestricted version of the parsimonious specification. Proxy asset pricing specifications, though intuitively appealing, are scant due to lack of theoretical frameworks and misguided significance tests of factor loadings. Throughout, this issue is addressed by keeping the empirical analysis under describing market facts and testing an APT model. The study concludes with a significant empirical explanation that specifies average returns in terms of the covariance risk and accounting accruals.
基金This research funded by Al-Zaytoonah University of Jordan.
文摘In recent years,the telecommunications sector is no longer limited to traditional communications,but has become the backbone for the use of data,content and digital applications by individuals,governments and companies to ensure the continuation of economic and social activity in light of social distancing and total closure inmost countries in the world.Therefore,electronic government(e-Government)andmobile government(m-Government)are the results of technological evolution and innovation.Hence,it is important to investigate the factors that influence the intention to use m-Government services among Jordan’s society.This paper proposed a new m-Government acceptance model in Jordan(AMGS);this model combines the Information System(IS)Success Factor Model and Hofstede Cultural Dimensions Theory.The study was conducted by surveying different groups of the Jordanian community.Astructured questionnaire was used to collect data from203 respondents.Multiple regression analysis has been conducted to analyze the data.The results indicate that the significant predictors of citizen intention to use m-Government services in Jordan are Information Quality,Service Quality,Uncertainty Avoidance,and Indulgence vs.restraint.While,the results also suggest that Power Distance is not a significant predictor of citizen intention to use m-Government services.