The application of support vector machines to forecasting problems is becoming popular, lately. Several comparisons between neural networks trained with error backpropagation and support vector machines have shown adv...The application of support vector machines to forecasting problems is becoming popular, lately. Several comparisons between neural networks trained with error backpropagation and support vector machines have shown advantage for the latter in different domains of application. However, some difficulties still deteriorate the performance of the support vector machines. The main one is related to the setting of the hyperparameters involved in their training. Techniques based on meta-heuristics have been employed to determine appropriate values for those hyperparameters. However, because of the high noneonvexity of this estimation problem, which makes the search for a good solution very hard, an approach based on Bayesian inference, called relevance vector machine, has been proposed more recently. The present paper aims at investigating the suitability of this new approach to the short-term load forecasting problem.展开更多
A novel diversity-sampling based nonparametric multi-modal background model is proposed. Using the samples having more popular and various intensity values in the training sequence, a nonparametric model is built for ...A novel diversity-sampling based nonparametric multi-modal background model is proposed. Using the samples having more popular and various intensity values in the training sequence, a nonparametric model is built for background subtraction. According to the related intensifies, different weights are given to the distinct samples in kernel density estimation. This avoids repeated computation using all samples, and makes computation more efficient in the evaluation phase. Experimental results show the validity of the diversity- sampling scheme and robustness of the proposed model in moving objects segmentation. The proposed algorithm can be used in outdoor surveillance systems.展开更多
Panel data combine cross-section data and time series data. If the cross-section is locations, there is a need to check the correlation among locations. ρ and λ are parameters in generalized spatial model to cover e...Panel data combine cross-section data and time series data. If the cross-section is locations, there is a need to check the correlation among locations. ρ and λ are parameters in generalized spatial model to cover effect of correlation between locations. Value of ρ or λ will influence the goodness of fit model, so it is important to make parameter estimation. The effect of another location is covered by making contiguity matrix until it gets spatial weighted matrix (W). There are some types of W—uniform W, binary W, kernel Gaussian W and some W from real case of economics condition or transportation condition from locations. This study is aimed to compare uniform W and kernel Gaussian W in spatial panel data model using RMSE value. The result of analysis showed that uniform weight had RMSE value less than kernel Gaussian model. Uniform W had stabil value for all the combinations.展开更多
A recursive Kernel eigenspace updating algorithm was proposed to build the soft sensor for end-product quality.The updating procedure was composed of two sub-stages,i.e.firstly performing forward increasing updating a...A recursive Kernel eigenspace updating algorithm was proposed to build the soft sensor for end-product quality.The updating procedure was composed of two sub-stages,i.e.firstly performing forward increasing updating and then followed by backward decreasing updating,which drastically decreased the required computation workload.Further,the whole Kernel matrix did not need to be stored.Simulation study on the Tennessee Eastman process showed that the consequent impurity component model had satisfying precision under both normal and faulty operations,which was obviously superior to the offline batch model and meanwhile approximated the performance of model obtained by successively applying the time-consuming traditional eigenvalue numerical algorithm.展开更多
Landslide susceptibility mapping is significant for landslide prevention.Many approaches have been used for landslide susceptibility prediction,however,their performances are unstable.This study constructed a hybrid m...Landslide susceptibility mapping is significant for landslide prevention.Many approaches have been used for landslide susceptibility prediction,however,their performances are unstable.This study constructed a hybrid model,namely box counting dimension-based kernel logistic regression model,which uses fractal dimension calculated by box counting method as input data based on grid cells mapping unit and terrain mapping unit.The performance of this model was evaluated in the application in Zhidan County,Shaanxi Province,China.Firstly,a total of 221 landslides were identified and mapped,and 11 landslide predisposing factors were considered.Secondly,the landslide susceptibility maps(LSMs) of the study area were obtained by constructing the model on two different mapping units.Finally,the results were evaluated with five statistical indexes,sensitivity,specificity,positive predictive value(PPV),negative predictive value(NPV) and Accuracy.The statistical indexes of the model obtained on the terrain mapping unit were larger than those based on grid cells mapping unit.For training and validation datasets,the area under the receiver operating characteristic curve(AUC) of the model based on terrain mapping unit were 0.9374 and 0.9527,respectively,indicating that establishing this model on the terrain mapping unit was advantageous in the study area.The results show that the fractal dimension improves the prediction ability of the kernel logistic model.In addition,the terrain mapping unit is a more promising mapping unit in Loess areas.展开更多
QSPR models of PCDD/Fs were generated by means of kernel partial least squares. The molecular distance-edge vector method was used as descriptors to get model I for predicting PCDD/Fs retention behavior. The chlorinat...QSPR models of PCDD/Fs were generated by means of kernel partial least squares. The molecular distance-edge vector method was used as descriptors to get model I for predicting PCDD/Fs retention behavior. The chlorinated positions were also used and model II was obtained. In studied cases, the predictive ability of the KPLS model is comparable or superior to those of PLS and ANN. The results indicate that KPLS can be used as an alternative powerful modeling tool for QSPR studies.展开更多
The composite quantile regression should provide estimation efficiency gain over a single quantile regression. In this paper, we extend composite quantile regression to nonparametric model with random censored data. T...The composite quantile regression should provide estimation efficiency gain over a single quantile regression. In this paper, we extend composite quantile regression to nonparametric model with random censored data. The asymptotic normality of the proposed estimator is established. The proposed methods are applied to the lung cancer data. Extensive simulations are reported, showing that the proposed method works well in practical settings.展开更多
This work focuses on drop breakage for liquid-liquid system with an adoption of numerical simulation by using computational fluid dynamics and population balance model (PBM) coupled with two-fluid model (TFM). Two dif...This work focuses on drop breakage for liquid-liquid system with an adoption of numerical simulation by using computational fluid dynamics and population balance model (PBM) coupled with two-fluid model (TFM). Two different breakage kernels based on identical breakage mechanism but different descriptions of breaking time are take n into account in this work. Eight cases corresp on ding to distinct configurations of agitator are carried out to validate numerical predictions, namely agitators with different porosity and hole diameters, respectively implemented in Cases 1 to 5 and Cases 6 to 8. The results are compared with experimental data for testing the applicability of both kernels. Simulations are implemented, in this work, with an approach of class method for the solution of population balance model by the special-purpose computational fluid dynamics solver Fluent 16.1 based on finite volume method, and the grids used for meshing the solution domain are accomplished in a commercial software Gambit 2.4.6. The effects of configurations of agitator corresponding to different parameters mentioned above on final Sauter mean diameter are equally concentrated in this work. Analysis of both kernels and comparisons with experimental results reveal that, the second kernel has more decent agreement with experiments, and the results of investigations on effects of agitator configurations show that the in fluences of these parameters on Sauter mean diameter are marginal, but appropriate porosity and hole diameter are actually able to decrease Sauter mean diameter. These outcomes allow us to draw general conclusions and help investigate performances of liquid-liquid system.展开更多
Strong mechanical vibration and acoustical signals of grinding process contain useful information related to load parameters in ball mills. It is a challenge to extract latent features and construct soft sensor model ...Strong mechanical vibration and acoustical signals of grinding process contain useful information related to load parameters in ball mills. It is a challenge to extract latent features and construct soft sensor model with high dimensional frequency spectra of these signals. This paper aims to develop a selective ensemble modeling approach based on nonlinear latent frequency spectral feature extraction for accurate measurement of material to ball volume ratio. Latent features are first extracted from different vibrations and acoustic spectral segments by kernel partial least squares. Algorithms of bootstrap and least squares support vector machines are employed to produce candidate sub-models using these latent features as inputs. Ensemble sub-models are selected based on genetic algorithm optimization toolbox. Partial least squares regression is used to combine these sub-models to eliminate collinearity among their prediction outputs. Results indicate that the proposed modeling approach has better prediction performance than previous ones.展开更多
Consider tile partial linear model Y=Xβ+ g(T) + e. Wilers Y is at risk of being censored from the right, g is an unknown smoothing function on [0,1], β is a 1-dimensional parameter to be estimated and e is an unobse...Consider tile partial linear model Y=Xβ+ g(T) + e. Wilers Y is at risk of being censored from the right, g is an unknown smoothing function on [0,1], β is a 1-dimensional parameter to be estimated and e is an unobserved error. In Ref[1,2], it wes proved that the estimator for the asymptotic variance of βn(βn) is consistent. In this paper, we establish the limit distribution and the law of the iterated logarithm for,En, and obtain the convergest rates for En and the strong uniform convergent rates for gn(gn).展开更多
There have been vast amount of studies on background modeling to detect moving objects. Two recent reviews[1,2] showed that kernel density estimation(KDE) method and Gaussian mixture model(GMM) perform about equally b...There have been vast amount of studies on background modeling to detect moving objects. Two recent reviews[1,2] showed that kernel density estimation(KDE) method and Gaussian mixture model(GMM) perform about equally best among possible background models. For KDE, the selection of kernel functions and their bandwidths greatly influence the performance. There were few attempts to compare the adequacy of functions for KDE. In this paper, we evaluate the performance of various functions for KDE. Functions tested include almost everyone cited in the literature and a new function, Laplacian of Gaussian(LoG) is also introduced for comparison. All tests were done on real videos with vary-ing background dynamics and results were analyzed both qualitatively and quantitatively. Effect of different bandwidths was also investigated.展开更多
The development of forecasting models for pollution particles shows a nonlinear dynamic behavior;hence, implementation is a non-trivial process. In the literature, there have been multiple models of particulate pollut...The development of forecasting models for pollution particles shows a nonlinear dynamic behavior;hence, implementation is a non-trivial process. In the literature, there have been multiple models of particulate pollutants, which use softcomputing techniques and machine learning such as: multilayer perceptrons, neural networks, support vector machines, kernel algorithms, and so on. This paper presents a prediction pollution model using support vector machines and kernel functions, which are: Gaussian, Polynomial and Spline. Finally, the prediction results of ozone (O3), particulate matter (PM10) and nitrogen dioxide (NO2) at Mexico City are presented as a case study using these techniques.展开更多
文摘The application of support vector machines to forecasting problems is becoming popular, lately. Several comparisons between neural networks trained with error backpropagation and support vector machines have shown advantage for the latter in different domains of application. However, some difficulties still deteriorate the performance of the support vector machines. The main one is related to the setting of the hyperparameters involved in their training. Techniques based on meta-heuristics have been employed to determine appropriate values for those hyperparameters. However, because of the high noneonvexity of this estimation problem, which makes the search for a good solution very hard, an approach based on Bayesian inference, called relevance vector machine, has been proposed more recently. The present paper aims at investigating the suitability of this new approach to the short-term load forecasting problem.
基金Project supported by National Basic Research Program of Chinaon Urban Traffic Monitoring and Management System(Grant No .TG1998030408)
文摘A novel diversity-sampling based nonparametric multi-modal background model is proposed. Using the samples having more popular and various intensity values in the training sequence, a nonparametric model is built for background subtraction. According to the related intensifies, different weights are given to the distinct samples in kernel density estimation. This avoids repeated computation using all samples, and makes computation more efficient in the evaluation phase. Experimental results show the validity of the diversity- sampling scheme and robustness of the proposed model in moving objects segmentation. The proposed algorithm can be used in outdoor surveillance systems.
文摘Panel data combine cross-section data and time series data. If the cross-section is locations, there is a need to check the correlation among locations. ρ and λ are parameters in generalized spatial model to cover effect of correlation between locations. Value of ρ or λ will influence the goodness of fit model, so it is important to make parameter estimation. The effect of another location is covered by making contiguity matrix until it gets spatial weighted matrix (W). There are some types of W—uniform W, binary W, kernel Gaussian W and some W from real case of economics condition or transportation condition from locations. This study is aimed to compare uniform W and kernel Gaussian W in spatial panel data model using RMSE value. The result of analysis showed that uniform weight had RMSE value less than kernel Gaussian model. Uniform W had stabil value for all the combinations.
文摘A recursive Kernel eigenspace updating algorithm was proposed to build the soft sensor for end-product quality.The updating procedure was composed of two sub-stages,i.e.firstly performing forward increasing updating and then followed by backward decreasing updating,which drastically decreased the required computation workload.Further,the whole Kernel matrix did not need to be stored.Simulation study on the Tennessee Eastman process showed that the consequent impurity component model had satisfying precision under both normal and faulty operations,which was obviously superior to the offline batch model and meanwhile approximated the performance of model obtained by successively applying the time-consuming traditional eigenvalue numerical algorithm.
基金funded by National Key Research and Development Program of China, Ecological Safety Guarantee Technology and Demonstration Channel and Slope Treatment Project in Loess Hilly and Gully Area (Grant No. 2017YFC0504700)。
文摘Landslide susceptibility mapping is significant for landslide prevention.Many approaches have been used for landslide susceptibility prediction,however,their performances are unstable.This study constructed a hybrid model,namely box counting dimension-based kernel logistic regression model,which uses fractal dimension calculated by box counting method as input data based on grid cells mapping unit and terrain mapping unit.The performance of this model was evaluated in the application in Zhidan County,Shaanxi Province,China.Firstly,a total of 221 landslides were identified and mapped,and 11 landslide predisposing factors were considered.Secondly,the landslide susceptibility maps(LSMs) of the study area were obtained by constructing the model on two different mapping units.Finally,the results were evaluated with five statistical indexes,sensitivity,specificity,positive predictive value(PPV),negative predictive value(NPV) and Accuracy.The statistical indexes of the model obtained on the terrain mapping unit were larger than those based on grid cells mapping unit.For training and validation datasets,the area under the receiver operating characteristic curve(AUC) of the model based on terrain mapping unit were 0.9374 and 0.9527,respectively,indicating that establishing this model on the terrain mapping unit was advantageous in the study area.The results show that the fractal dimension improves the prediction ability of the kernel logistic model.In addition,the terrain mapping unit is a more promising mapping unit in Loess areas.
基金the National Natural Science Foundation of China(No.20275026).
文摘QSPR models of PCDD/Fs were generated by means of kernel partial least squares. The molecular distance-edge vector method was used as descriptors to get model I for predicting PCDD/Fs retention behavior. The chlorinated positions were also used and model II was obtained. In studied cases, the predictive ability of the KPLS model is comparable or superior to those of PLS and ANN. The results indicate that KPLS can be used as an alternative powerful modeling tool for QSPR studies.
文摘The composite quantile regression should provide estimation efficiency gain over a single quantile regression. In this paper, we extend composite quantile regression to nonparametric model with random censored data. The asymptotic normality of the proposed estimator is established. The proposed methods are applied to the lung cancer data. Extensive simulations are reported, showing that the proposed method works well in practical settings.
基金Supported by the National Natural Science Foundation of China(21776180,21306116)
文摘This work focuses on drop breakage for liquid-liquid system with an adoption of numerical simulation by using computational fluid dynamics and population balance model (PBM) coupled with two-fluid model (TFM). Two different breakage kernels based on identical breakage mechanism but different descriptions of breaking time are take n into account in this work. Eight cases corresp on ding to distinct configurations of agitator are carried out to validate numerical predictions, namely agitators with different porosity and hole diameters, respectively implemented in Cases 1 to 5 and Cases 6 to 8. The results are compared with experimental data for testing the applicability of both kernels. Simulations are implemented, in this work, with an approach of class method for the solution of population balance model by the special-purpose computational fluid dynamics solver Fluent 16.1 based on finite volume method, and the grids used for meshing the solution domain are accomplished in a commercial software Gambit 2.4.6. The effects of configurations of agitator corresponding to different parameters mentioned above on final Sauter mean diameter are equally concentrated in this work. Analysis of both kernels and comparisons with experimental results reveal that, the second kernel has more decent agreement with experiments, and the results of investigations on effects of agitator configurations show that the in fluences of these parameters on Sauter mean diameter are marginal, but appropriate porosity and hole diameter are actually able to decrease Sauter mean diameter. These outcomes allow us to draw general conclusions and help investigate performances of liquid-liquid system.
基金Supported partially by the Post Doctoral Natural Science Foundation of China(2013M532118,2015T81082)the National Natural Science Foundation of China(61573364,61273177,61503066)+2 种基金the State Key Laboratory of Synthetical Automation for Process Industriesthe National High Technology Research and Development Program of China(2015AA043802)the Scientific Research Fund of Liaoning Provincial Education Department(L2013272)
文摘Strong mechanical vibration and acoustical signals of grinding process contain useful information related to load parameters in ball mills. It is a challenge to extract latent features and construct soft sensor model with high dimensional frequency spectra of these signals. This paper aims to develop a selective ensemble modeling approach based on nonlinear latent frequency spectral feature extraction for accurate measurement of material to ball volume ratio. Latent features are first extracted from different vibrations and acoustic spectral segments by kernel partial least squares. Algorithms of bootstrap and least squares support vector machines are employed to produce candidate sub-models using these latent features as inputs. Ensemble sub-models are selected based on genetic algorithm optimization toolbox. Partial least squares regression is used to combine these sub-models to eliminate collinearity among their prediction outputs. Results indicate that the proposed modeling approach has better prediction performance than previous ones.
文摘Consider tile partial linear model Y=Xβ+ g(T) + e. Wilers Y is at risk of being censored from the right, g is an unknown smoothing function on [0,1], β is a 1-dimensional parameter to be estimated and e is an unobserved error. In Ref[1,2], it wes proved that the estimator for the asymptotic variance of βn(βn) is consistent. In this paper, we establish the limit distribution and the law of the iterated logarithm for,En, and obtain the convergest rates for En and the strong uniform convergent rates for gn(gn).
文摘There have been vast amount of studies on background modeling to detect moving objects. Two recent reviews[1,2] showed that kernel density estimation(KDE) method and Gaussian mixture model(GMM) perform about equally best among possible background models. For KDE, the selection of kernel functions and their bandwidths greatly influence the performance. There were few attempts to compare the adequacy of functions for KDE. In this paper, we evaluate the performance of various functions for KDE. Functions tested include almost everyone cited in the literature and a new function, Laplacian of Gaussian(LoG) is also introduced for comparison. All tests were done on real videos with vary-ing background dynamics and results were analyzed both qualitatively and quantitatively. Effect of different bandwidths was also investigated.
文摘The development of forecasting models for pollution particles shows a nonlinear dynamic behavior;hence, implementation is a non-trivial process. In the literature, there have been multiple models of particulate pollutants, which use softcomputing techniques and machine learning such as: multilayer perceptrons, neural networks, support vector machines, kernel algorithms, and so on. This paper presents a prediction pollution model using support vector machines and kernel functions, which are: Gaussian, Polynomial and Spline. Finally, the prediction results of ozone (O3), particulate matter (PM10) and nitrogen dioxide (NO2) at Mexico City are presented as a case study using these techniques.