Cross-Project Defect Prediction(CPDP)is a method that utilizes historical data from other source projects to train predictive models for defect prediction in the target project.However,existing CPDP methods only consi...Cross-Project Defect Prediction(CPDP)is a method that utilizes historical data from other source projects to train predictive models for defect prediction in the target project.However,existing CPDP methods only consider linear correlations between features(indicators)of the source and target projects.These models are not capable of evaluating non-linear correlations between features when they exist,for example,when there are differences in data distributions between the source and target projects.As a result,the performance of such CPDP models is compromised.In this paper,this paper proposes a novel CPDP method based on Synthetic Minority Oversampling Technique(SMOTE)and Deep Canonical Correlation Analysis(DCCA),referred to as S-DCCA.Canonical Correlation Analysis(CCA)is employed to address the issue of non-linear correlations between features of the source and target projects.S-DCCA extends CCA by incorporating the MlpNet model for feature extraction from the dataset.The redundant features are then eliminated by maximizing the correlated feature subset using the CCA loss function.Finally,cross-project defect prediction is achieved through the application of the SMOTE data sampling technique.Area Under Curve(AUC)and F1 scores(F1)are used as evaluation metrics.This paper conducted experiments on 27 projects from four public datasets to validate the proposed method.The results demonstrate that,on average,our method outperforms all baseline approaches by at least 1.2%in AUC and 5.5%in F1 score.This indicates that the proposed method exhibits favorable performance characteristics.展开更多
[Objective] The study aimed at exploring the relationship among the agronomic characters of B. juncea in western China, in order to provide scientific basis for the breeding of B. juncea in western China. [Method] 39 ...[Objective] The study aimed at exploring the relationship among the agronomic characters of B. juncea in western China, in order to provide scientific basis for the breeding of B. juncea in western China. [Method] 39 B. juncea materials from western China were used for the canonical correlation analysis, and canonical correlations between each pair of the four ecological character (containing 18 variables) were verified, including yield characters (5 variables), caulis characters (6 variables), branch characters (3 variables) and pod characters (3 variables). [Result] Yield per plant of B. juncea in western China suffered a tremendous influence from effective pod number per plant while was not significantly affected by the total pod number per plant, seed number per pod and 1 000-seed weight; the most important character related with the yield character of B. juncea in western China was caulis character, followed by the branch character and pod character; yield characters, caulis characters, branch characters and pod characters of B. juncea in western China were closely correlated. [Conclusion] In order to improve the yield characters of B. juncea in western China, caulis characters should be focused on, followed by branch characters and pod characters; rapeseed varieties with high performance in total pod number per plant and effective pod number per plant should be chosen through the perspectives of effective branch number, plant height, pod number of main inflorescence, fruit stalk number of main inflorescence and other traits, while rapeseed varieties with high performance in seed number per pod and 1 000-seed weight should be chosen through the perspectives of beak length and other traits.展开更多
Quick, simple to perform, and cheap biomarkers were combined in a rapid assessment approach to measure the effects of metal pollutants, Cu, Cd, Pb and Zn in meadow burozem on wheat. Analysis of orthogonal design showe...Quick, simple to perform, and cheap biomarkers were combined in a rapid assessment approach to measure the effects of metal pollutants, Cu, Cd, Pb and Zn in meadow burozem on wheat. Analysis of orthogonal design showed that the significant zinc factor indicated both the inhibition rate of shoot mass and that of root elongation were affected by zinc(P<0.05 and P<0.01, respectively). The first toxicity canonical variable (TOXI), formed from the toxicity data set, explained 49% of the total variance in the toxicity data set; the first biological canonical variable(BIOL) explained 42% of the total variation in the biological data set. The correlation between the first canonical variables TOXI and BIOL (canonical correlation) was 0.94 (P<0.0001). Therefore, it is reliable and feasible to use the achievement to assess toxicity of heavy metal combined polluted soil using canonical analysis. Toxicity of soil combined polluted by heavy metals to plant community was estimated by comparing the IC 50 values describing the concentration needed to cause 50% decrease with grow rate compared to no metal addition. Environmental quality standard for soils prescribe that all these tested concentration of heavy metals in soil should not cause hazard and pollution ultimately, whereas it indicated that the soils in second grade cause more or less than 50% inhibition rates of wheat growth. So environmental quality standard for soils can be modified to include other features.展开更多
In order to identify the principal factors influencing soil water characteristics (SWC) and evaluate SWC effectively, the multivariate-statistical canonical correlation analysis (CCA) method was used to study and ...In order to identify the principal factors influencing soil water characteristics (SWC) and evaluate SWC effectively, the multivariate-statistical canonical correlation analysis (CCA) method was used to study and analyze the correlation between SWC and soil physical and chemical properties. Twenty-two soil samples were taken from 11 main tobacco-growing areas in Guizhou Province in China and the soil water characteristic curves (SWCC) and basic physical and chemical properties of the soil samples were determined. The results show that: (1) The soil bulk density, soil total porosity and soil capillary porosity have significant effects on SWC of tobacco fiels. Bulk density and total porosity are positively correlated with soil water retention characteristics (SWRC), and soil capillary porosity is positively correlated with soil water supply characteristics (SWSC). (2) Soil samples from different soil layers at the same soil sampling point show similarity or consistency in SWC. Inadequate soil water supply capability and imbalance between SWRC and SWSC are problems of tobacco soil. (3) The SWC of loamy clay are generally superior to those of silty clay loam.展开更多
Hashing technology has the advantages of reducing data storage and improving the efficiency of the learning system,making it more and more widely used in image retrieval.Multi-view data describes image information mor...Hashing technology has the advantages of reducing data storage and improving the efficiency of the learning system,making it more and more widely used in image retrieval.Multi-view data describes image information more comprehensively than traditional methods using a single-view.How to use hashing to combine multi-view data for image retrieval is still a challenge.In this paper,a multi-view fusion hashing method based on RKCCA(Random Kernel Canonical Correlation Analysis)is proposed.In order to describe image content more accurately,we use deep learning dense convolutional network feature DenseNet to construct multi-view by combining GIST feature or BoW_SIFT(Bag-of-Words model+SIFT feature)feature.This algorithm uses RKCCA method to fuse multi-view features to construct association features and apply them to image retrieval.The algorithm generates binary hash code with minimal distortion error by designing quantization regularization terms.A large number of experiments on benchmark datasets show that this method is superior to other multi-view hashing methods.展开更多
Distributed denial of service(DDoS)attacks launch more and more frequently and are more destructive.Feature representation as an important part of DDoS defense technology directly affects the efficiency of defense.Mos...Distributed denial of service(DDoS)attacks launch more and more frequently and are more destructive.Feature representation as an important part of DDoS defense technology directly affects the efficiency of defense.Most DDoS feature extraction methods cannot fully utilize the information of the original data,resulting in the extracted features losing useful features.In this paper,a DDoS feature representation method based on deep belief network(DBN)is proposed.We quantify the original data by the size of the network flows,the distribution of IP addresses and ports,and the diversity of packet sizes of different protocols and train the DBN in an unsupervised manner by these quantified values.Two feedforward neural networks(FFNN)are initialized by the trained deep belief network,and one of the feedforward neural networks continues to be trained in a supervised manner.The canonical correlation analysis(CCA)method is used to fuse the features extracted by two feedforward neural networks per layer.Experiments show that compared with other methods,the proposed method can extract better features.展开更多
By expanding the perturbation of covariance matrix in the powers of er-ror term,the influence functions for five canonical measurements in CCA are devel-oped and three sample versions are given.For generalized correla...By expanding the perturbation of covariance matrix in the powers of er-ror term,the influence functions for five canonical measurements in CCA are devel-oped and three sample versions are given.For generalized correlation coefficient p_z,the influence function is a quadratic form of r.v.z,and its distribution is considered.A practical example iUustrates the utility of the proposed influence functions.展开更多
In this paper, one of the most classical statistical methods, Canonical Correlation Analysis (CCA) is applied to identify quantitatively the driving forces of landuse structure in Yulin Prefecture. The main analysis i...In this paper, one of the most classical statistical methods, Canonical Correlation Analysis (CCA) is applied to identify quantitatively the driving forces of landuse structure in Yulin Prefecture. The main analysis is carried out through the software SPSS with the data on the level of towns and townships in 1992. The results indicate that landuse structure is determined by comprehensive action of different factors. Landuse structure with rural characteristics is mainly determined by geographical factors such as the elevation, temperature and precipitation, while the landuse structure with urban characteristics is mainly determined by demographic and socioeconomic conditions. At the same time, tests were carried out through the canonical correlation coefficient and redundancy analysis.展开更多
Canonical correlation analysis ( CCA ) based methods for low-resolution ( LR ) face recognition involve face images with different resolutions ( or multi-resolutions ), i.e.LR and high-resolution ( HR ) .For single-re...Canonical correlation analysis ( CCA ) based methods for low-resolution ( LR ) face recognition involve face images with different resolutions ( or multi-resolutions ), i.e.LR and high-resolution ( HR ) .For single-resolution face recognition , researchers have shown that utilizing spatial information is beneficial to improving the recognition accuracy , mainly because the pixels of each face are not independent but spatially correlated.However , for a multi-resolution scenario , there are no related works.Therefore , a method named spatial regularization of canonical correlation analysis ( SRCCA ) is developed for LR face recognition to improve the performance of CCA by the regularization utilizing spatial information of different resolution faces.Furthermore , the impact of LR and HR spatial regularization terms on LR face recognition is analyzed through experiments.展开更多
This paper uses canonical piecewise-linear analysis method to analyze nonlinear DC fault circuitsand solve for the values of the test port voltages which are selected beforehand .The method needs lessmemory storages,o...This paper uses canonical piecewise-linear analysis method to analyze nonlinear DC fault circuitsand solve for the values of the test port voltages which are selected beforehand .The method needs lessmemory storages,obtains the results in finite steps and has high efficiency in computation.It can be appliedto the circuits containing multiport nonlinear elements.It is a good method of pre-test analysis for fault cir-cuits in simulation-before-test aproach in analogue circuit diagnosis.展开更多
To understand the status of phytoplankton community of the Genhe River in the summer of 2015,we investigated the phytoplankton in Genhe River. We identified 5 phyla and 36 species,among which Bacillariophyta(23) were ...To understand the status of phytoplankton community of the Genhe River in the summer of 2015,we investigated the phytoplankton in Genhe River. We identified 5 phyla and 36 species,among which Bacillariophyta(23) were the most,followed by Chlorophyta(10),Cyanophyta(1),Chrysophyta(1),Pyrrophyta(1). The phytoplankton abundance was(15. 6-810) × 104 ind·L^(-1); the biomass was(0. 07-2. 876) mg·L-1; Shannon-wiener index was 1. 05-3. 24; Pielou evenness index was 0. 27-0. 96. Using Shannon-wiener index and Pielou index,the water quality of Genhe River was assessed,and the results showed that the water quality was the best at 5# sampling point,the water quality was good in 3#,4#,7#,8#,9# sampling points,and there was a state of pollution at other sampling points. Canonical correspondence analysis and Pearson correlation analysis showed that iron ion,transparency,p H value,water depth and water temperature were important environmental factors that affect the distribution of phytoplankton,and copper ion,nitrite ion and COD also significantly affected the distribution of phytoplankton.展开更多
Voice conversion algorithm aims to provide high level of similarity to the target voice with an acceptable level of quality.The main object of this paper was to build a nonlinear relationship between the parameters fo...Voice conversion algorithm aims to provide high level of similarity to the target voice with an acceptable level of quality.The main object of this paper was to build a nonlinear relationship between the parameters for the acoustical features of source and target speaker using Non-Linear Canonical Correlation Analysis(NLCCA) based on jointed Gaussian mixture model.Speaker indi-viduality transformation was achieved mainly by altering vocal tract characteristics represented by Line Spectral Frequencies(LSF).To obtain the transformed speech which sounded more like the target voices,prosody modification is involved through residual prediction.Both objective and subjective evaluations were conducted.The experimental results demonstrated that our proposed algorithm was effective and outperformed the conventional conversion method utilized by the Minimum Mean Square Error(MMSE) estimation.展开更多
Correlation analysis as used by meteorologists and oceanographers is a tool for the analysisof the spacial or temporal variability of physical fields. In his notes, Dr. Hasselmann pro-posed to combine correlation anal...Correlation analysis as used by meteorologists and oceanographers is a tool for the analysisof the spacial or temporal variability of physical fields. In his notes, Dr. Hasselmann pro-posed to combine correlation analysis and linear regression analysis in climate prediction re-search. The main idea is to decompose the physical field into its principal oscillation patterns.展开更多
A novel algorithm for voice conversion is proposed in this paper. The mapping function of spectral vectors of the source and target speakers is calculated by the Canonical Correlation Analysis (CCA) estimation based o...A novel algorithm for voice conversion is proposed in this paper. The mapping function of spectral vectors of the source and target speakers is calculated by the Canonical Correlation Analysis (CCA) estimation based on Gaussian mixture models. Since the spectral envelope feature remains a majority of second order statistical information contained in speech after Linear Prediction Coding (LPC) analysis, the CCA method is more suitable for spectral conversion than Minimum Mean Square Error (MMSE) because CCA explicitly considers the variance of each component of the spectral vectors during conversion procedure. Both objective evaluations and subjective listening tests are conducted. The experimental results demonstrate that the proposed scheme can achieve better per- formance than the previous method which uses MMSE estimation criterion.展开更多
To solve the problem of multiple moving sources passive location, a novel blind source separa- tion (BSS) algorithm based on the muhiset canonical correlation analysis (MCCA) is presented by exploiting the differe...To solve the problem of multiple moving sources passive location, a novel blind source separa- tion (BSS) algorithm based on the muhiset canonical correlation analysis (MCCA) is presented by exploiting the different temporal structure of uncorrelated source signals first, and then on the basis of this algorithm, a novel multiple moving sources passive location method is proposed using time difference of arrival (TDOA) and frequency difference of arrival (FDOA) measurements. The key technique of this location method is TDOA and FDOA joint estimation, which is based on BSS. By blindly separating mixed signals from multiple moving sources, the multiple sources location problem can be translated to each source location in turn, and the effect of interference and noise can also he removed. The simulation results illustrate that the performance of the MCCA algorithm is very good with relatively light computation burden, and the location algorithm is relatively simple and effective.展开更多
The teleconnection distribution characteristics of sea surface temperature (SST) over the India Ocean and the precipitation during rainy season in China were studied by using the methods of EOF and CCA. The results in...The teleconnection distribution characteristics of sea surface temperature (SST) over the India Ocean and the precipitation during rainy season in China were studied by using the methods of EOF and CCA. The results indicate that the change of SST field will affect the change of rain belt during rainy seasons in China, and greatly affect the precipitation in northwest and southwest China, the Yangzi and Yellow River downstream basins. Strong signal phenomena of SSTA over India Ocean were revealed that showed the anoma-lous distribution of drought and flood in China. It shows that the precipitation during rainy seasons in China may be forecast by analyzing SST distribution characteristics over the India Ocean.展开更多
Amorphous and crystalline poly (chloro-p-xytylene) (PPX C) membranes are constructed by using a novel com- putational technique, that is, a combined method of NVT+NPT-molecular dynamics (MD) and gradually reduc...Amorphous and crystalline poly (chloro-p-xytylene) (PPX C) membranes are constructed by using a novel com- putational technique, that is, a combined method of NVT+NPT-molecular dynamics (MD) and gradually reducing the size (GRS) methods. The related free volumes are defined as homology clusters. Then the sorption and the permeation of gases in PPX C polymers are studied using grand canonical Monte Carlo (GCMC) and NVT-MD methods. The results show that the crystalline PPX C membranes provide smaller free volumes for absorbing or transferring gases relative to the amorphous PPX C area. The gas sorption in PPX C membranes mainly belongs to the physical one, and H bonds can appear obviously in the amorphous area. By cluster analyzing on the mean square displacement of gases, we find that gases walk along the x axis in the crystalline area and walk randomly in the amorphous area. The calculated permeability coefficients are close to the experimental data.展开更多
<strong>Objective:</strong> To understand the influencing factors of job burnout among nurses in Haikou 3A hospital and explore its direct and indirect effects, so as to provide a scientific basis for the ...<strong>Objective:</strong> To understand the influencing factors of job burnout among nurses in Haikou 3A hospital and explore its direct and indirect effects, so as to provide a scientific basis for the work efficiency of nursing staff. <strong>Methods:</strong> Between November 2, 2015 and November 2015, using multi stage random sampling, self-administered questionnaire survey was conducted among 1049 nursing staff, using the path analysis method to study the effect of direct and indirect factors effect. <strong>Results:</strong> The total score of job burnout of nurses was 38.44 ± 7.55, high occupational burnout was 0.9%, moderate occupational burnout was 66.5%, and low occupational burnout was 32.6%. The scores of job burnout were compared among the nurses with different titles, and less achievement (F = 8.342, P < 0.001) and depersonalization (F = 3.12, P = 0.025) were statistically significant. Nurses’ Job Burnout and job stressors were the first, and the canonical correlation coefficient was 0.4397 (F = 20.54, P < 0.0001), indicating that the more problems existed in patient care, the greater the degree of emotional exhaustion. The first canonical correlation coefficient of job burnout and job satisfaction of nurses was 0.3791 (F = 12.8, P < 0.0001), indicating that the better the family and work balance, the less individualized nurses were. The path analysis results showed that the 4 dimensions of job stressors (management and interpersonal problems) is positive, the direct effect of the strongest (0.219), the total effect of sort of work pressure source of 4 dimensions (0.245) > 5 dimensions of work pressure source (0.125) > title (<span style="white-space:nowrap;"><span style="white-space:nowrap;">−</span></span>0.112) job satisfaction scores (<span style="white-space:nowrap;"><span style="white-space:nowrap;">−</span></span>0.097). <strong>Conclusion:</strong> Job stress, job satisfaction and job title are the factors that affect job burnout. The 4 and the direct and indirect effects of job stressors are the strongest, and measures should be taken to solve these problems.展开更多
Biometric gait recognition is a lesser-known but emerging and effective biometric recognition method which enables subjects’walking patterns to be recognized.Existing research in this area has primarily focused on fe...Biometric gait recognition is a lesser-known but emerging and effective biometric recognition method which enables subjects’walking patterns to be recognized.Existing research in this area has primarily focused on feature analysis through the extraction of individual features,which captures most of the information but fails to capture subtle variations in gait dynamics.Therefore,a novel feature taxonomy and an approach for deriving a relationship between a function of one set of gait features with another set are introduced.The gait features extracted from body halves divided by anatomical planes on vertical,horizontal,and diagonal axes are grouped to form canonical gait covariates.Canonical Correlation Analysis is utilized to measure the strength of association between the canonical covariates of gait.Thus,gait assessment and identification are enhancedwhenmore semantic information is available through CCA-basedmulti-feature fusion.Hence,CarnegieMellon University’s 3D gait database,which contains 32 gait samples taken at different paces,is utilized in analyzing gait characteristics.The performance of Linear Discriminant Analysis,K-Nearest Neighbors,Naive Bayes,Artificial Neural Networks,and Support Vector Machines was improved by a 4%average when the CCA-utilized gait identification approachwas used.Asignificant maximumaccuracy rate of 97.8%was achieved throughCCA-based gait identification.Beyond that,the rate of false identifications and unrecognized gaits went down to half,demonstrating state-of-the-art for gait identification.展开更多
Inferential models are widely used in the chemical industry to infer key process variables, which are challenging or expensive to measure, from other more easily measured variables. The aim of this paper is three-fold...Inferential models are widely used in the chemical industry to infer key process variables, which are challenging or expensive to measure, from other more easily measured variables. The aim of this paper is three-fold: to present a theoretical review of some of the well known linear inferential modeling techniques, to enhance the predictive ability of the regularized canonical correlation analysis (RCCA) method, and finally to compare the performances of these techniques and highlight some of the practical issues that can affect their predictive abilities. The inferential modeling techniques considered in this study include full rank modeling techniques, such as ordinary least square (OLS) regression and ridge regression (RR), and latent variable regression (LVR) techniques, such as principal component regression (PCR), partial least squares (PLS) regression, and regularized canonical correlation analysis (RCCA). The theoretical analysis shows that the loading vectors used in LVR modeling can be computed by solving eigenvalue problems. Also, for the RCCA method, we show that by optimizing the regularization parameter, an improvement in prediction accuracy can be achieved over other modeling techniques. To illustrate the performances of all inferential modeling techniques, a comparative analysis was performed through two simulated examples, one using synthetic data and the other using simulated distillation column data. All techniques are optimized and compared by computing the cross validation mean square error using unseen testing data. The results of this comparative analysis show that scaling the data helps improve the performances of all modeling techniques, and that the LVR techniques outperform the full rank ones. One reason for this advantage is that the LVR techniques improve the conditioning of the model by discarding the latent variables (or principal components) with small eigenvalues, which also reduce the effect of the noise on the model prediction. The results also show that PCR and PLS have comparable performances, and that RCCA can provide an advantage by optimizing its regularization parameter.展开更多
基金NationalNatural Science Foundation of China,Grant/AwardNumber:61867004National Natural Science Foundation of China Youth Fund,Grant/Award Number:41801288.
文摘Cross-Project Defect Prediction(CPDP)is a method that utilizes historical data from other source projects to train predictive models for defect prediction in the target project.However,existing CPDP methods only consider linear correlations between features(indicators)of the source and target projects.These models are not capable of evaluating non-linear correlations between features when they exist,for example,when there are differences in data distributions between the source and target projects.As a result,the performance of such CPDP models is compromised.In this paper,this paper proposes a novel CPDP method based on Synthetic Minority Oversampling Technique(SMOTE)and Deep Canonical Correlation Analysis(DCCA),referred to as S-DCCA.Canonical Correlation Analysis(CCA)is employed to address the issue of non-linear correlations between features of the source and target projects.S-DCCA extends CCA by incorporating the MlpNet model for feature extraction from the dataset.The redundant features are then eliminated by maximizing the correlated feature subset using the CCA loss function.Finally,cross-project defect prediction is achieved through the application of the SMOTE data sampling technique.Area Under Curve(AUC)and F1 scores(F1)are used as evaluation metrics.This paper conducted experiments on 27 projects from four public datasets to validate the proposed method.The results demonstrate that,on average,our method outperforms all baseline approaches by at least 1.2%in AUC and 5.5%in F1 score.This indicates that the proposed method exhibits favorable performance characteristics.
基金Supported by National Natural Science Foundation(30760122)National High-Tech Research and Development Program(863Program)(2009AA101105)+1 种基金Faculty Construction of 211 Project(SZTD-211-02)Project of Introducing Advanced Agricultural Science and Technology of Ministry of Agriculture(948Program)(2010-Z54)~~
文摘[Objective] The study aimed at exploring the relationship among the agronomic characters of B. juncea in western China, in order to provide scientific basis for the breeding of B. juncea in western China. [Method] 39 B. juncea materials from western China were used for the canonical correlation analysis, and canonical correlations between each pair of the four ecological character (containing 18 variables) were verified, including yield characters (5 variables), caulis characters (6 variables), branch characters (3 variables) and pod characters (3 variables). [Result] Yield per plant of B. juncea in western China suffered a tremendous influence from effective pod number per plant while was not significantly affected by the total pod number per plant, seed number per pod and 1 000-seed weight; the most important character related with the yield character of B. juncea in western China was caulis character, followed by the branch character and pod character; yield characters, caulis characters, branch characters and pod characters of B. juncea in western China were closely correlated. [Conclusion] In order to improve the yield characters of B. juncea in western China, caulis characters should be focused on, followed by branch characters and pod characters; rapeseed varieties with high performance in total pod number per plant and effective pod number per plant should be chosen through the perspectives of effective branch number, plant height, pod number of main inflorescence, fruit stalk number of main inflorescence and other traits, while rapeseed varieties with high performance in seed number per pod and 1 000-seed weight should be chosen through the perspectives of beak length and other traits.
文摘Quick, simple to perform, and cheap biomarkers were combined in a rapid assessment approach to measure the effects of metal pollutants, Cu, Cd, Pb and Zn in meadow burozem on wheat. Analysis of orthogonal design showed that the significant zinc factor indicated both the inhibition rate of shoot mass and that of root elongation were affected by zinc(P<0.05 and P<0.01, respectively). The first toxicity canonical variable (TOXI), formed from the toxicity data set, explained 49% of the total variance in the toxicity data set; the first biological canonical variable(BIOL) explained 42% of the total variation in the biological data set. The correlation between the first canonical variables TOXI and BIOL (canonical correlation) was 0.94 (P<0.0001). Therefore, it is reliable and feasible to use the achievement to assess toxicity of heavy metal combined polluted soil using canonical analysis. Toxicity of soil combined polluted by heavy metals to plant community was estimated by comparing the IC 50 values describing the concentration needed to cause 50% decrease with grow rate compared to no metal addition. Environmental quality standard for soils prescribe that all these tested concentration of heavy metals in soil should not cause hazard and pollution ultimately, whereas it indicated that the soils in second grade cause more or less than 50% inhibition rates of wheat growth. So environmental quality standard for soils can be modified to include other features.
基金supported by the National Key High-Tech Program (863) of China (Grant No. 2006AA10Z271)the Key Project of the Guizhou Tobacco Monopoly Administration (2007-7)
文摘In order to identify the principal factors influencing soil water characteristics (SWC) and evaluate SWC effectively, the multivariate-statistical canonical correlation analysis (CCA) method was used to study and analyze the correlation between SWC and soil physical and chemical properties. Twenty-two soil samples were taken from 11 main tobacco-growing areas in Guizhou Province in China and the soil water characteristic curves (SWCC) and basic physical and chemical properties of the soil samples were determined. The results show that: (1) The soil bulk density, soil total porosity and soil capillary porosity have significant effects on SWC of tobacco fiels. Bulk density and total porosity are positively correlated with soil water retention characteristics (SWRC), and soil capillary porosity is positively correlated with soil water supply characteristics (SWSC). (2) Soil samples from different soil layers at the same soil sampling point show similarity or consistency in SWC. Inadequate soil water supply capability and imbalance between SWRC and SWSC are problems of tobacco soil. (3) The SWC of loamy clay are generally superior to those of silty clay loam.
基金This work is supported by the National Natural Science Foundation of China(No.61772561)the Key Research&Development Plan of Hunan Province(No.2018NK2012)+1 种基金the Science Research Projects of Hunan Provincial Education Department(Nos.18A174,18C0262)the Science&Technology Innovation Platform and Talent Plan of Hunan Province(2017TP1022).
文摘Hashing technology has the advantages of reducing data storage and improving the efficiency of the learning system,making it more and more widely used in image retrieval.Multi-view data describes image information more comprehensively than traditional methods using a single-view.How to use hashing to combine multi-view data for image retrieval is still a challenge.In this paper,a multi-view fusion hashing method based on RKCCA(Random Kernel Canonical Correlation Analysis)is proposed.In order to describe image content more accurately,we use deep learning dense convolutional network feature DenseNet to construct multi-view by combining GIST feature or BoW_SIFT(Bag-of-Words model+SIFT feature)feature.This algorithm uses RKCCA method to fuse multi-view features to construct association features and apply them to image retrieval.The algorithm generates binary hash code with minimal distortion error by designing quantization regularization terms.A large number of experiments on benchmark datasets show that this method is superior to other multi-view hashing methods.
基金supported by the National Natural Science Foundation of Hainan(2018CXTD333,617048)National Natural Science Foundation of China(61762033,61702539)+4 种基金The National Natural Science Foundation of Hunan(2018JJ3611)Social Development Project of Public Welfare Technology Application of Zhejiang Province(LGF18F020019)Hainan University Doctor Start Fund Project(kyqd1328)Hainan University Youth Fund Project(qnjj1444)State Key Laboratory of Marine Resource Utilization in South China Sea Funding.
文摘Distributed denial of service(DDoS)attacks launch more and more frequently and are more destructive.Feature representation as an important part of DDoS defense technology directly affects the efficiency of defense.Most DDoS feature extraction methods cannot fully utilize the information of the original data,resulting in the extracted features losing useful features.In this paper,a DDoS feature representation method based on deep belief network(DBN)is proposed.We quantify the original data by the size of the network flows,the distribution of IP addresses and ports,and the diversity of packet sizes of different protocols and train the DBN in an unsupervised manner by these quantified values.Two feedforward neural networks(FFNN)are initialized by the trained deep belief network,and one of the feedforward neural networks continues to be trained in a supervised manner.The canonical correlation analysis(CCA)method is used to fuse the features extracted by two feedforward neural networks per layer.Experiments show that compared with other methods,the proposed method can extract better features.
文摘By expanding the perturbation of covariance matrix in the powers of er-ror term,the influence functions for five canonical measurements in CCA are devel-oped and three sample versions are given.For generalized correlation coefficient p_z,the influence function is a quadratic form of r.v.z,and its distribution is considered.A practical example iUustrates the utility of the proposed influence functions.
文摘In this paper, one of the most classical statistical methods, Canonical Correlation Analysis (CCA) is applied to identify quantitatively the driving forces of landuse structure in Yulin Prefecture. The main analysis is carried out through the software SPSS with the data on the level of towns and townships in 1992. The results indicate that landuse structure is determined by comprehensive action of different factors. Landuse structure with rural characteristics is mainly determined by geographical factors such as the elevation, temperature and precipitation, while the landuse structure with urban characteristics is mainly determined by demographic and socioeconomic conditions. At the same time, tests were carried out through the canonical correlation coefficient and redundancy analysis.
基金Supported by the National Natural Science Foundation of China(6117015161070133+2 种基金60903130)the Natural Science Research Project of Higher Education of Jiangsu Province(12KJB520018)the Research Foundation of Nanjing University of Aeronautics and Astronautics(NP2011030)
文摘Canonical correlation analysis ( CCA ) based methods for low-resolution ( LR ) face recognition involve face images with different resolutions ( or multi-resolutions ), i.e.LR and high-resolution ( HR ) .For single-resolution face recognition , researchers have shown that utilizing spatial information is beneficial to improving the recognition accuracy , mainly because the pixels of each face are not independent but spatially correlated.However , for a multi-resolution scenario , there are no related works.Therefore , a method named spatial regularization of canonical correlation analysis ( SRCCA ) is developed for LR face recognition to improve the performance of CCA by the regularization utilizing spatial information of different resolution faces.Furthermore , the impact of LR and HR spatial regularization terms on LR face recognition is analyzed through experiments.
文摘This paper uses canonical piecewise-linear analysis method to analyze nonlinear DC fault circuitsand solve for the values of the test port voltages which are selected beforehand .The method needs lessmemory storages,obtains the results in finite steps and has high efficiency in computation.It can be appliedto the circuits containing multiport nonlinear elements.It is a good method of pre-test analysis for fault cir-cuits in simulation-before-test aproach in analogue circuit diagnosis.
基金Supported by Supported by the United Nations Environment Program(DXAL-2014-002)
文摘To understand the status of phytoplankton community of the Genhe River in the summer of 2015,we investigated the phytoplankton in Genhe River. We identified 5 phyla and 36 species,among which Bacillariophyta(23) were the most,followed by Chlorophyta(10),Cyanophyta(1),Chrysophyta(1),Pyrrophyta(1). The phytoplankton abundance was(15. 6-810) × 104 ind·L^(-1); the biomass was(0. 07-2. 876) mg·L-1; Shannon-wiener index was 1. 05-3. 24; Pielou evenness index was 0. 27-0. 96. Using Shannon-wiener index and Pielou index,the water quality of Genhe River was assessed,and the results showed that the water quality was the best at 5# sampling point,the water quality was good in 3#,4#,7#,8#,9# sampling points,and there was a state of pollution at other sampling points. Canonical correspondence analysis and Pearson correlation analysis showed that iron ion,transparency,p H value,water depth and water temperature were important environmental factors that affect the distribution of phytoplankton,and copper ion,nitrite ion and COD also significantly affected the distribution of phytoplankton.
基金Supported by the National High Technology Research and Development Program of China (863 Program,No.2006AA010102)
文摘Voice conversion algorithm aims to provide high level of similarity to the target voice with an acceptable level of quality.The main object of this paper was to build a nonlinear relationship between the parameters for the acoustical features of source and target speaker using Non-Linear Canonical Correlation Analysis(NLCCA) based on jointed Gaussian mixture model.Speaker indi-viduality transformation was achieved mainly by altering vocal tract characteristics represented by Line Spectral Frequencies(LSF).To obtain the transformed speech which sounded more like the target voices,prosody modification is involved through residual prediction.Both objective and subjective evaluations were conducted.The experimental results demonstrated that our proposed algorithm was effective and outperformed the conventional conversion method utilized by the Minimum Mean Square Error(MMSE) estimation.
文摘Correlation analysis as used by meteorologists and oceanographers is a tool for the analysisof the spacial or temporal variability of physical fields. In his notes, Dr. Hasselmann pro-posed to combine correlation analysis and linear regression analysis in climate prediction re-search. The main idea is to decompose the physical field into its principal oscillation patterns.
文摘A novel algorithm for voice conversion is proposed in this paper. The mapping function of spectral vectors of the source and target speakers is calculated by the Canonical Correlation Analysis (CCA) estimation based on Gaussian mixture models. Since the spectral envelope feature remains a majority of second order statistical information contained in speech after Linear Prediction Coding (LPC) analysis, the CCA method is more suitable for spectral conversion than Minimum Mean Square Error (MMSE) because CCA explicitly considers the variance of each component of the spectral vectors during conversion procedure. Both objective evaluations and subjective listening tests are conducted. The experimental results demonstrate that the proposed scheme can achieve better per- formance than the previous method which uses MMSE estimation criterion.
基金Supported by the National High Technology Research and Development Program of China(No.2009AAJ116,2009AAJ208,2010AA7010422)the National Science Foundation for Post-Doctoral Scientists of China(No.20080431379,200902671)the Hubei Natural Science Foundation(No.2009CDB031)
文摘To solve the problem of multiple moving sources passive location, a novel blind source separa- tion (BSS) algorithm based on the muhiset canonical correlation analysis (MCCA) is presented by exploiting the different temporal structure of uncorrelated source signals first, and then on the basis of this algorithm, a novel multiple moving sources passive location method is proposed using time difference of arrival (TDOA) and frequency difference of arrival (FDOA) measurements. The key technique of this location method is TDOA and FDOA joint estimation, which is based on BSS. By blindly separating mixed signals from multiple moving sources, the multiple sources location problem can be translated to each source location in turn, and the effect of interference and noise can also he removed. The simulation results illustrate that the performance of the MCCA algorithm is very good with relatively light computation burden, and the location algorithm is relatively simple and effective.
基金Mechanisms for important climatic catastrophes in China and theoretic study of the predic-tion" a project first set off in the "Plan for developing key national fundamental research" Project 97D033Q of Application Fund by the Science and Technology F
文摘The teleconnection distribution characteristics of sea surface temperature (SST) over the India Ocean and the precipitation during rainy season in China were studied by using the methods of EOF and CCA. The results indicate that the change of SST field will affect the change of rain belt during rainy seasons in China, and greatly affect the precipitation in northwest and southwest China, the Yangzi and Yellow River downstream basins. Strong signal phenomena of SSTA over India Ocean were revealed that showed the anoma-lous distribution of drought and flood in China. It shows that the precipitation during rainy seasons in China may be forecast by analyzing SST distribution characteristics over the India Ocean.
基金Project supported by the National Natural Science Foundation (Grant No. 11011120241 and 11076002)the China Academy of Engineering Physics "Double Hundred Talents Project" Candidates Optional Subjects (Grant Nos. 2008Rc01 and ZX03010)the China Academy of Engineering Physics Science and Technology Development Fund (Grant No. 2010A0302012)
文摘Amorphous and crystalline poly (chloro-p-xytylene) (PPX C) membranes are constructed by using a novel com- putational technique, that is, a combined method of NVT+NPT-molecular dynamics (MD) and gradually reducing the size (GRS) methods. The related free volumes are defined as homology clusters. Then the sorption and the permeation of gases in PPX C polymers are studied using grand canonical Monte Carlo (GCMC) and NVT-MD methods. The results show that the crystalline PPX C membranes provide smaller free volumes for absorbing or transferring gases relative to the amorphous PPX C area. The gas sorption in PPX C membranes mainly belongs to the physical one, and H bonds can appear obviously in the amorphous area. By cluster analyzing on the mean square displacement of gases, we find that gases walk along the x axis in the crystalline area and walk randomly in the amorphous area. The calculated permeability coefficients are close to the experimental data.
文摘<strong>Objective:</strong> To understand the influencing factors of job burnout among nurses in Haikou 3A hospital and explore its direct and indirect effects, so as to provide a scientific basis for the work efficiency of nursing staff. <strong>Methods:</strong> Between November 2, 2015 and November 2015, using multi stage random sampling, self-administered questionnaire survey was conducted among 1049 nursing staff, using the path analysis method to study the effect of direct and indirect factors effect. <strong>Results:</strong> The total score of job burnout of nurses was 38.44 ± 7.55, high occupational burnout was 0.9%, moderate occupational burnout was 66.5%, and low occupational burnout was 32.6%. The scores of job burnout were compared among the nurses with different titles, and less achievement (F = 8.342, P < 0.001) and depersonalization (F = 3.12, P = 0.025) were statistically significant. Nurses’ Job Burnout and job stressors were the first, and the canonical correlation coefficient was 0.4397 (F = 20.54, P < 0.0001), indicating that the more problems existed in patient care, the greater the degree of emotional exhaustion. The first canonical correlation coefficient of job burnout and job satisfaction of nurses was 0.3791 (F = 12.8, P < 0.0001), indicating that the better the family and work balance, the less individualized nurses were. The path analysis results showed that the 4 dimensions of job stressors (management and interpersonal problems) is positive, the direct effect of the strongest (0.219), the total effect of sort of work pressure source of 4 dimensions (0.245) > 5 dimensions of work pressure source (0.125) > title (<span style="white-space:nowrap;"><span style="white-space:nowrap;">−</span></span>0.112) job satisfaction scores (<span style="white-space:nowrap;"><span style="white-space:nowrap;">−</span></span>0.097). <strong>Conclusion:</strong> Job stress, job satisfaction and job title are the factors that affect job burnout. The 4 and the direct and indirect effects of job stressors are the strongest, and measures should be taken to solve these problems.
基金supported by Istanbul University Scientific Research Project Department with IRP-51706 Project Number.
文摘Biometric gait recognition is a lesser-known but emerging and effective biometric recognition method which enables subjects’walking patterns to be recognized.Existing research in this area has primarily focused on feature analysis through the extraction of individual features,which captures most of the information but fails to capture subtle variations in gait dynamics.Therefore,a novel feature taxonomy and an approach for deriving a relationship between a function of one set of gait features with another set are introduced.The gait features extracted from body halves divided by anatomical planes on vertical,horizontal,and diagonal axes are grouped to form canonical gait covariates.Canonical Correlation Analysis is utilized to measure the strength of association between the canonical covariates of gait.Thus,gait assessment and identification are enhancedwhenmore semantic information is available through CCA-basedmulti-feature fusion.Hence,CarnegieMellon University’s 3D gait database,which contains 32 gait samples taken at different paces,is utilized in analyzing gait characteristics.The performance of Linear Discriminant Analysis,K-Nearest Neighbors,Naive Bayes,Artificial Neural Networks,and Support Vector Machines was improved by a 4%average when the CCA-utilized gait identification approachwas used.Asignificant maximumaccuracy rate of 97.8%was achieved throughCCA-based gait identification.Beyond that,the rate of false identifications and unrecognized gaits went down to half,demonstrating state-of-the-art for gait identification.
文摘Inferential models are widely used in the chemical industry to infer key process variables, which are challenging or expensive to measure, from other more easily measured variables. The aim of this paper is three-fold: to present a theoretical review of some of the well known linear inferential modeling techniques, to enhance the predictive ability of the regularized canonical correlation analysis (RCCA) method, and finally to compare the performances of these techniques and highlight some of the practical issues that can affect their predictive abilities. The inferential modeling techniques considered in this study include full rank modeling techniques, such as ordinary least square (OLS) regression and ridge regression (RR), and latent variable regression (LVR) techniques, such as principal component regression (PCR), partial least squares (PLS) regression, and regularized canonical correlation analysis (RCCA). The theoretical analysis shows that the loading vectors used in LVR modeling can be computed by solving eigenvalue problems. Also, for the RCCA method, we show that by optimizing the regularization parameter, an improvement in prediction accuracy can be achieved over other modeling techniques. To illustrate the performances of all inferential modeling techniques, a comparative analysis was performed through two simulated examples, one using synthetic data and the other using simulated distillation column data. All techniques are optimized and compared by computing the cross validation mean square error using unseen testing data. The results of this comparative analysis show that scaling the data helps improve the performances of all modeling techniques, and that the LVR techniques outperform the full rank ones. One reason for this advantage is that the LVR techniques improve the conditioning of the model by discarding the latent variables (or principal components) with small eigenvalues, which also reduce the effect of the noise on the model prediction. The results also show that PCR and PLS have comparable performances, and that RCCA can provide an advantage by optimizing its regularization parameter.