In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluste...In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluster analysis, hyper-parameter test and other models, and SPSS, Python and other tools were used to obtain the classification rules of glass products under different fluxes, sub classification under different chemical compositions, hyper-parameter K value test and rationality analysis. Research can provide theoretical support for the protection and restoration of ancient glass relics.展开更多
With recent advances in biotechnology, genome-wide association study (GWAS) has been widely used to identify genetic variants that underlie human complex diseases and traits. In case-control GWAS, typical statistica...With recent advances in biotechnology, genome-wide association study (GWAS) has been widely used to identify genetic variants that underlie human complex diseases and traits. In case-control GWAS, typical statistical strategy is traditional logistical regression (LR) based on single-locus analysis. However, such a single-locus analysis leads to the well-known multiplicity problem, with a risk of inflating type I error and reducing power. Dimension reduction-based techniques, such as principal component-based logistic regression (PC-LR), partial least squares-based logistic regression (PLS-LR), have recently gained much attention in the analysis of high dimensional genomic data. However, the perfor- mance of these methods is still not clear, especially in GWAS. We conducted simulations and real data application to compare the type I error and power of PC-LR, PLS-LR and LR applicable to GWAS within a defined single nucleotide polymorphism (SNP) set region. We found that PC-LR and PLS can reasonably control type I error under null hypothesis. On contrast, LR, which is corrected by Bonferroni method, was more conserved in all simulation settings. In particular, we found that PC-LR and PLS-LR had comparable power and they both outperformed LR, especially when the causal SNP was in high linkage disequilibrium with genotyped ones and with a small effective size in simulation. Based on SNP set analysis, we applied all three methods to analyze non-small cell lung cancer GWAS data.展开更多
[Objectives]The research aimed to explore the distribution characteristics of TCM constitution types of patients with hypertension and insomnia,and study the clinical characteristics of patients with different constit...[Objectives]The research aimed to explore the distribution characteristics of TCM constitution types of patients with hypertension and insomnia,and study the clinical characteristics of patients with different constitutions,in order to provide new ideas for the treatment of patients with hypertension and insomnia.[Methods]Cross sectional observation method was used,and 420 patients with hypertension and insomnia were selected.Required information was collected,and the constitution type of traditional Chinese medicine was determined,and relevant data were recorded.SPSS and Logistic regression analysis method were used to explore the correlation between the distribution of TCM constitution types and gender,age,24 h-SBP,24 h-DBP,24 h-BPV,PSQI score,etc.[Results]Among 420 patients,the proportion of gentleness constitution was the most,and others in turn were Qi deficiency constitution>Yang deficiency constitution>phlegm dampness constitution>Qi stagnation constitution>Yin deficiency constitution>blood stasis constitution>damp heat constitution>special constitution.Among male patients,the proportion of gentleness constitution was the most.Among female patients,the proportion of Qi deficiency constitution was the most.In each constitution,the proportion of men and women was different,and the difference in gentleness constitution,Qi deficiency constitution and Yin deficiency constitution had statistical significance(P<0.05).The proportion of gentleness constitution for young and middle-aged patients was the most,while elderly patients with Qi deficiency constitution was the most.There was difference in the distribution of TCM constitution in different age groups,and the difference had statistical significance(P<0.05).Compared with the patients with gentleness constitution,the patients with Qi deficiency constitution,Yang deficiency constitution,Yin deficiency constitution,damp heat constitution,blood stasis constitution and Qi stagnation constitution had different differences in terms of age,24 h-SBP,24 h-DBP,24 h-BPV and PSQI score,and there was statistical significance(P<0.05).Except damp heat constitution,blood stasis constitution and special constitution,other constitutions had certain correlation with age,24 h-SBP,24 h-DBP,24 h-BPV and PSQI score.[Conclusions]TCM constitution types of patients with hypertension and insomnia were dominant by gentleness constitution,Qi deficiency constitution and Yang deficiency constitution.The distribution of TCM constitution in different gender and age groups was different,and different TCM constitution was related to ABPM and PSQI.展开更多
This paper discusses some methodological aspects for the production of susceptibility maps of slope instability developed within the CARG Project (Geological Cartography of Italy at 1:50,000 scale). It describes an ex...This paper discusses some methodological aspects for the production of susceptibility maps of slope instability developed within the CARG Project (Geological Cartography of Italy at 1:50,000 scale). It describes an example of a susceptibility map in the presence of low susceptibility, using database having zero or negligible cost, with the aim to test some methodologies that can be easily reproducible to get a first estimate of the landslide susceptibility on a wide area. Two statistical approaches have been applied: the non-parametric conditional analysis and the logistic analysis for rare events. The predictive ability obtained from the two methodologies, was evaluated by the success-prediction curves for the conditional analysis, and by the Receiver Operating Characteristic curve (ROC), for the logistic model. The landslide susceptibility maps have been classified into four classes using both the Natural Breaks algorithm and the method proposed by Chung and Fabbri (2003). The paper considers the influence of these two classification methods on the quality of final results.展开更多
The aim of this study is to show complementary usage of logistic and correspondence analysis in a research subject to self-healing methodologies. Firstly, the number of the variables is reduced by logistic regression ...The aim of this study is to show complementary usage of logistic and correspondence analysis in a research subject to self-healing methodologies. Firstly, the number of the variables is reduced by logistic regression according to relationship between dependent and independent variables and then research carries on searching variables. The relationship among the behaviours of individuals and their demographic characteristics is modelled by logistic regression and shown graphically by correspondence analysis. In application, first of all, the effect of age, sex, marital status, education level, occupation and income level and present health condition, on appreciating self-health, is explained by a model. As a result of that model, it can be said that the effect of age, occupation and present health condition is reasonable. After analysing that model, the relationship between categorical variables (age, sex, occupation, preferred precautions, and worth of personal health) is shown graphically by multiple correspondence analysis.展开更多
Objective This study was undertaken to investigate the influencing factors on serum ALT level and hepatitis C virus(HCV)RNA titer in chronic hepatitis C(CHC)patients.Methods All patients enrolled into this study were ...Objective This study was undertaken to investigate the influencing factors on serum ALT level and hepatitis C virus(HCV)RNA titer in chronic hepatitis C(CHC)patients.Methods All patients enrolled into this study were anti-HCV positive.Retrospective tracing method was applied to detect serum ALT level and HCV RNA titer and to collect general information of the patients such as genders,age groups,interferon medication history,infection pathways,height and weight.Then the multi-factor analysis was adopted with the application of binominal logistic regression mode.Results The abnormal rate of ALT level was positively correlated to HCV RNA and gender while negatively correlated to interferon medication history and age group,with Wald value of the 4 factors as 39.604,11.823,18.991 and 7.389,respectively.The positive rate of HCV RNA was negatively correlated to interferon medication history and gender while positively correlated to ALT level,with corresponding Wald value of the 3 factors as81.394,7.618 and 27.562,respectively.Conclusions The normal ALT level in HCV infected patients was associated with viral load,age,gender and interferon medication history,while the normal rate of HCV RNA titer was closely associated with gender,interferon medication history and ALT level.展开更多
Objective: Our object is to study risk factors of tumor patients’ PICC catheter-related blood stream infection. Method: a retrospective analysis of data of 586 PICC catheterized patients was implemented, a univariate...Objective: Our object is to study risk factors of tumor patients’ PICC catheter-related blood stream infection. Method: a retrospective analysis of data of 586 PICC catheterized patients was implemented, a univariate analysis of general data and catheterizing data of tumor patients was then carried out, and data of single factors with statistical significance were incorporated into multi-factor Logistic regression model for analysis. Results: PICC catheter-related blood stream infection occurred to 16 patients, and occurrence rate was 2.73%. Multi-factor Logistic regression analysis results showed that number of puncturing times, positioning method and maintenance frequency were risk factors for tumor patients’ peripherally inserted central catheter catheter-related blood stream infection, and odds risk values were respectively 8.762, 9.253 and 10.324. Conclusion: for tumor patients implanted with peripherally inserted central catheters, using ECG positioning during strict sterile operation and catheterizing process to avoid repeated puncturing and increasing maintenance frequency could effectively reduce occurrence of PICC catheter-related blood stream infection.展开更多
There are a variety of classification techniques such as neural network, decision tree, support vector machine and logistic regression. The problem of dimensionality is pertinent to many learning algorithms, and it de...There are a variety of classification techniques such as neural network, decision tree, support vector machine and logistic regression. The problem of dimensionality is pertinent to many learning algorithms, and it denotes the drastic raise of computational complexity, however, we need to use dimensionality reduction methods. These methods include principal component analysis (PCA) and locality preserving projection (LPP). In many real-world classification problems, the local structure is more important than the global structure and dimensionality reduction techniques ignore the local structure and preserve the global structure. The objectives is to compare PCA and LPP in terms of accuracy, to develop appropriate representations of complex data by reducing the dimensions of the data and to explain the importance of using LPP with logistic regression. The results of this paper find that the proposed LPP approach provides a better representation and high accuracy than the PCA approach.展开更多
Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for rep...Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for reporting site-specific air pollution levels. Accurately predicting air quality, as measured by the AQI, is essential for effective air pollution management. In this study, we aim to identify the most reliable regression model among linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), logistic regression, and K-nearest neighbors (KNN). We conducted four different regression analyses using a machine learning approach to determine the model with the best performance. By employing the confusion matrix and error percentages, we selected the best-performing model, which yielded prediction error rates of 22%, 23%, 20%, and 27%, respectively, for LDA, QDA, logistic regression, and KNN models. The logistic regression model outperformed the other three statistical models in predicting AQI. Understanding these models' performance can help address an existing gap in air quality research and contribute to the integration of regression techniques in AQI studies, ultimately benefiting stakeholders like environmental regulators, healthcare professionals, urban planners, and researchers.展开更多
文摘In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluster analysis, hyper-parameter test and other models, and SPSS, Python and other tools were used to obtain the classification rules of glass products under different fluxes, sub classification under different chemical compositions, hyper-parameter K value test and rationality analysis. Research can provide theoretical support for the protection and restoration of ancient glass relics.
基金founded by the National Natural Science Foundation of China(81202283,81473070,81373102 and81202267)Key Grant of Natural Science Foundation of the Jiangsu Higher Education Institutions of China(10KJA330034 and11KJA330001)+1 种基金the Research Fund for the Doctoral Program of Higher Education of China(20113234110002)the Priority Academic Program for the Development of Jiangsu Higher Education Institutions(Public Health and Preventive Medicine)
文摘With recent advances in biotechnology, genome-wide association study (GWAS) has been widely used to identify genetic variants that underlie human complex diseases and traits. In case-control GWAS, typical statistical strategy is traditional logistical regression (LR) based on single-locus analysis. However, such a single-locus analysis leads to the well-known multiplicity problem, with a risk of inflating type I error and reducing power. Dimension reduction-based techniques, such as principal component-based logistic regression (PC-LR), partial least squares-based logistic regression (PLS-LR), have recently gained much attention in the analysis of high dimensional genomic data. However, the perfor- mance of these methods is still not clear, especially in GWAS. We conducted simulations and real data application to compare the type I error and power of PC-LR, PLS-LR and LR applicable to GWAS within a defined single nucleotide polymorphism (SNP) set region. We found that PC-LR and PLS can reasonably control type I error under null hypothesis. On contrast, LR, which is corrected by Bonferroni method, was more conserved in all simulation settings. In particular, we found that PC-LR and PLS-LR had comparable power and they both outperformed LR, especially when the causal SNP was in high linkage disequilibrium with genotyped ones and with a small effective size in simulation. Based on SNP set analysis, we applied all three methods to analyze non-small cell lung cancer GWAS data.
基金the National Key R&D Program Funded Project(2018 YFC17056009)Study on Insomnia and Its Relationship with Climacteric Syndrome,Hypertension,Mild Cognitive Impairment in the Elderly and Comprehensive Treatment Plan(2018YFC1705604)Pilot Project of Clinical Cooperation between Traditional Chinese and Western Medicine for Major and Difficult Diseases by the State Administration of Traditional Chinese Medicine:"Refractory Hypertension"(GZYYBYZF[2018]3).
文摘[Objectives]The research aimed to explore the distribution characteristics of TCM constitution types of patients with hypertension and insomnia,and study the clinical characteristics of patients with different constitutions,in order to provide new ideas for the treatment of patients with hypertension and insomnia.[Methods]Cross sectional observation method was used,and 420 patients with hypertension and insomnia were selected.Required information was collected,and the constitution type of traditional Chinese medicine was determined,and relevant data were recorded.SPSS and Logistic regression analysis method were used to explore the correlation between the distribution of TCM constitution types and gender,age,24 h-SBP,24 h-DBP,24 h-BPV,PSQI score,etc.[Results]Among 420 patients,the proportion of gentleness constitution was the most,and others in turn were Qi deficiency constitution>Yang deficiency constitution>phlegm dampness constitution>Qi stagnation constitution>Yin deficiency constitution>blood stasis constitution>damp heat constitution>special constitution.Among male patients,the proportion of gentleness constitution was the most.Among female patients,the proportion of Qi deficiency constitution was the most.In each constitution,the proportion of men and women was different,and the difference in gentleness constitution,Qi deficiency constitution and Yin deficiency constitution had statistical significance(P<0.05).The proportion of gentleness constitution for young and middle-aged patients was the most,while elderly patients with Qi deficiency constitution was the most.There was difference in the distribution of TCM constitution in different age groups,and the difference had statistical significance(P<0.05).Compared with the patients with gentleness constitution,the patients with Qi deficiency constitution,Yang deficiency constitution,Yin deficiency constitution,damp heat constitution,blood stasis constitution and Qi stagnation constitution had different differences in terms of age,24 h-SBP,24 h-DBP,24 h-BPV and PSQI score,and there was statistical significance(P<0.05).Except damp heat constitution,blood stasis constitution and special constitution,other constitutions had certain correlation with age,24 h-SBP,24 h-DBP,24 h-BPV and PSQI score.[Conclusions]TCM constitution types of patients with hypertension and insomnia were dominant by gentleness constitution,Qi deficiency constitution and Yang deficiency constitution.The distribution of TCM constitution in different gender and age groups was different,and different TCM constitution was related to ABPM and PSQI.
文摘This paper discusses some methodological aspects for the production of susceptibility maps of slope instability developed within the CARG Project (Geological Cartography of Italy at 1:50,000 scale). It describes an example of a susceptibility map in the presence of low susceptibility, using database having zero or negligible cost, with the aim to test some methodologies that can be easily reproducible to get a first estimate of the landslide susceptibility on a wide area. Two statistical approaches have been applied: the non-parametric conditional analysis and the logistic analysis for rare events. The predictive ability obtained from the two methodologies, was evaluated by the success-prediction curves for the conditional analysis, and by the Receiver Operating Characteristic curve (ROC), for the logistic model. The landslide susceptibility maps have been classified into four classes using both the Natural Breaks algorithm and the method proposed by Chung and Fabbri (2003). The paper considers the influence of these two classification methods on the quality of final results.
文摘The aim of this study is to show complementary usage of logistic and correspondence analysis in a research subject to self-healing methodologies. Firstly, the number of the variables is reduced by logistic regression according to relationship between dependent and independent variables and then research carries on searching variables. The relationship among the behaviours of individuals and their demographic characteristics is modelled by logistic regression and shown graphically by correspondence analysis. In application, first of all, the effect of age, sex, marital status, education level, occupation and income level and present health condition, on appreciating self-health, is explained by a model. As a result of that model, it can be said that the effect of age, occupation and present health condition is reasonable. After analysing that model, the relationship between categorical variables (age, sex, occupation, preferred precautions, and worth of personal health) is shown graphically by multiple correspondence analysis.
基金supported by a grant from National Health Department of China(2008ZX10005-009)Roche company
文摘Objective This study was undertaken to investigate the influencing factors on serum ALT level and hepatitis C virus(HCV)RNA titer in chronic hepatitis C(CHC)patients.Methods All patients enrolled into this study were anti-HCV positive.Retrospective tracing method was applied to detect serum ALT level and HCV RNA titer and to collect general information of the patients such as genders,age groups,interferon medication history,infection pathways,height and weight.Then the multi-factor analysis was adopted with the application of binominal logistic regression mode.Results The abnormal rate of ALT level was positively correlated to HCV RNA and gender while negatively correlated to interferon medication history and age group,with Wald value of the 4 factors as 39.604,11.823,18.991 and 7.389,respectively.The positive rate of HCV RNA was negatively correlated to interferon medication history and gender while positively correlated to ALT level,with corresponding Wald value of the 3 factors as81.394,7.618 and 27.562,respectively.Conclusions The normal ALT level in HCV infected patients was associated with viral load,age,gender and interferon medication history,while the normal rate of HCV RNA titer was closely associated with gender,interferon medication history and ALT level.
文摘Objective: Our object is to study risk factors of tumor patients’ PICC catheter-related blood stream infection. Method: a retrospective analysis of data of 586 PICC catheterized patients was implemented, a univariate analysis of general data and catheterizing data of tumor patients was then carried out, and data of single factors with statistical significance were incorporated into multi-factor Logistic regression model for analysis. Results: PICC catheter-related blood stream infection occurred to 16 patients, and occurrence rate was 2.73%. Multi-factor Logistic regression analysis results showed that number of puncturing times, positioning method and maintenance frequency were risk factors for tumor patients’ peripherally inserted central catheter catheter-related blood stream infection, and odds risk values were respectively 8.762, 9.253 and 10.324. Conclusion: for tumor patients implanted with peripherally inserted central catheters, using ECG positioning during strict sterile operation and catheterizing process to avoid repeated puncturing and increasing maintenance frequency could effectively reduce occurrence of PICC catheter-related blood stream infection.
文摘There are a variety of classification techniques such as neural network, decision tree, support vector machine and logistic regression. The problem of dimensionality is pertinent to many learning algorithms, and it denotes the drastic raise of computational complexity, however, we need to use dimensionality reduction methods. These methods include principal component analysis (PCA) and locality preserving projection (LPP). In many real-world classification problems, the local structure is more important than the global structure and dimensionality reduction techniques ignore the local structure and preserve the global structure. The objectives is to compare PCA and LPP in terms of accuracy, to develop appropriate representations of complex data by reducing the dimensions of the data and to explain the importance of using LPP with logistic regression. The results of this paper find that the proposed LPP approach provides a better representation and high accuracy than the PCA approach.
文摘Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for reporting site-specific air pollution levels. Accurately predicting air quality, as measured by the AQI, is essential for effective air pollution management. In this study, we aim to identify the most reliable regression model among linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), logistic regression, and K-nearest neighbors (KNN). We conducted four different regression analyses using a machine learning approach to determine the model with the best performance. By employing the confusion matrix and error percentages, we selected the best-performing model, which yielded prediction error rates of 22%, 23%, 20%, and 27%, respectively, for LDA, QDA, logistic regression, and KNN models. The logistic regression model outperformed the other three statistical models in predicting AQI. Understanding these models' performance can help address an existing gap in air quality research and contribute to the integration of regression techniques in AQI studies, ultimately benefiting stakeholders like environmental regulators, healthcare professionals, urban planners, and researchers.