期刊文献+
共找到11篇文章
< 1 >
每页显示 20 50 100
Efficient strategies for leave-one-out cross validation for genomic best linear unbiased prediction 被引量:3
1
作者 Hao Cheng Dorian J.Garrick Rohan L.Fernando 《Journal of Animal Science and Biotechnology》 SCIE CAS CSCD 2017年第3期733-737,共5页
Background: A random multiple-regression model that simultaneously fit all allele substitution effects for additive markers or haplotypes as uncorrelated random effects was proposed for Best Linear Unbiased Predictio... Background: A random multiple-regression model that simultaneously fit all allele substitution effects for additive markers or haplotypes as uncorrelated random effects was proposed for Best Linear Unbiased Prediction, using whole-genome data. Leave-one-out cross validation can be used to quantify the predictive ability of a statistical model.Methods: Naive application of Leave-one-out cross validation is computationally intensive because the training and validation analyses need to be repeated n times, once for each observation. Efficient Leave-one-out cross validation strategies are presented here, requiring little more effort than a single analysis.Results: Efficient Leave-one-out cross validation strategies is 786 times faster than the naive application for a simulated dataset with 1,000 observations and 10,000 markers and 99 times faster with 1,000 observations and 100 markers. These efficiencies relative to the naive approach using the same model will increase with increases in the number of observations.Conclusions: Efficient Leave-one-out cross validation strategies are presented here, requiring little more effort than a single analysis. 展开更多
关键词 leave-one-out cross validation GBLUP
下载PDF
Efficient Leave-One-Out Strategy for Supervised Feature Selection 被引量:3
2
作者 Dingcheng Feng Feng Chen Wenli Xu 《Tsinghua Science and Technology》 SCIE EI CAS 2013年第6期629-635,共7页
Feature selection is a key task in statistical pattern recognition. Most feature selection algorithms have been proposed based on specific objective functions which are usually intuitively reasonable but can sometimes... Feature selection is a key task in statistical pattern recognition. Most feature selection algorithms have been proposed based on specific objective functions which are usually intuitively reasonable but can sometimes be far from the more basic objectives of the feature selection. This paper describes how to select features such that the basic objectives, e.g., classification or clustering accuracies, can be optimized in a more direct way. The analysis requires that the contribution of each feature to the evaluation metrics can be quantitatively described by some score function. Motivated by the conditional independence structure in probabilistic distributions, the analysis uses a leave-one-out feature selection algorithm which provides an approximate solution. The leave-one- out algorithm improves the conventional greedy backward elimination algorithm by preserving more interactions among features in the selection process, so that the various feature selection objectives can be optimized in a unified way. Experiments on six real-world datasets with different feature evaluation metrics have shown that this algorithm outperforms popular feature selection algorithms in most situations. 展开更多
关键词 leave-one-out feature selection objectives evaluation metrics
原文传递
基于遗传算法的多尺度支持向量机及其在机械故障诊断中的应用 被引量:7
3
作者 李良敏 温广瑞 《机械科学与技术》 CSCD 北大核心 2008年第8期1101-1106,共6页
通过对支持向量机核函数的分析发现,当对样本的各个特征赋予不同大小的尺度参数时,可以避免冗余特征干扰分类,增强关键特征在分类中的作用,提高支持向量机分类器的学习和泛化能力。在此基础上,提出一种具有不同特征尺度参数的支持向量机... 通过对支持向量机核函数的分析发现,当对样本的各个特征赋予不同大小的尺度参数时,可以避免冗余特征干扰分类,增强关键特征在分类中的作用,提高支持向量机分类器的学习和泛化能力。在此基础上,提出一种具有不同特征尺度参数的支持向量机(简称多尺度支持向量机),并通过遗传算法最小化LOO(leave-one-out)泛化错误上限估计,根据各个特征的识别能力赋予其不同大小的尺度参数。将多尺度支持向量机用于轴承故障诊断,实验结果表明,与传统的单尺度参数支持向量机相比,多尺度支持向量机具有更好的泛化能力。对压缩机气阀的故障识别表明,尺度参数的大小直接反映了对应特征识别能力的大小,因此可以依据尺度参数的大小进行特征选择,保留关键特征,剔除冗余特征。 展开更多
关键词 支持向量机 遗传算法 尺度参数 leave-one-out估计 泛化能力 特征选择
下载PDF
Improved adaptive pruning algorithm for least squares support vector regression 被引量:4
4
作者 Runpeng Gao Ye San 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2012年第3期438-444,共7页
As the solutions of the least squares support vector regression machine (LS-SVRM) are not sparse, it leads to slow prediction speed and limits its applications. The defects of the ex- isting adaptive pruning algorit... As the solutions of the least squares support vector regression machine (LS-SVRM) are not sparse, it leads to slow prediction speed and limits its applications. The defects of the ex- isting adaptive pruning algorithm for LS-SVRM are that the training speed is slow, and the generalization performance is not satis- factory, especially for large scale problems. Hence an improved algorithm is proposed. In order to accelerate the training speed, the pruned data point and fast leave-one-out error are employed to validate the temporary model obtained after decremental learning. The novel objective function in the termination condition which in- volves the whole constraints generated by all training data points and three pruning strategies are employed to improve the generali- zation performance. The effectiveness of the proposed algorithm is tested on six benchmark datasets. The sparse LS-SVRM model has a faster training speed and better generalization performance. 展开更多
关键词 least squares support vector regression machine (LS- SVRM) PRUNING leave-one-out (LOO) error incremental learning decremental learning.
下载PDF
Optimization of Shanghai marine environment monitoring sites by integrating spatial correlation and stratified heterogeneity 被引量:2
5
作者 FAN Haimei GAO Bingbo +1 位作者 XU Ren WANG Jinfeng 《Acta Oceanologica Sinica》 SCIE CAS CSCD 2017年第2期111-121,共11页
The water quality grades of phosphate(PO4-P) and dissolved inorganic nitrogen(DIN) are integrated by spatial partitioning to fit the global and local semi-variograms of these nutrients. Leave-one-out cross validat... The water quality grades of phosphate(PO4-P) and dissolved inorganic nitrogen(DIN) are integrated by spatial partitioning to fit the global and local semi-variograms of these nutrients. Leave-one-out cross validation is used to determine the statistical inference method. To minimize absolute average errors and error mean squares,stratified Kriging(SK) interpolation is applied to DIN and ordinary Kriging(OK) interpolation is applied to PO4-P.Ten percent of the sites is adjusted by considering their impact on the change in deviations in DIN and PO4-P interpolation and the resultant effect on areas with different water quality grades. Thus, seven redundant historical sites are removed. Seven historical sites are distributed in areas with water quality poorer than Grade IV at the north and south branches of the Changjiang(Yangtze River) Estuary and at the coastal region north of the Hangzhou Bay. Numerous sites are installed in these regions. The contents of various elements in the waters are not remarkably changed, and the waters are mixed well. Seven sites that have been optimized and removed are set to water with quality Grades III and IV. Optimization and adjustment of unrestricted areas show that the optimized and adjusted sites are mainly distributed in regions where the water quality grade undergoes transition.Therefore, key sites for adjustment and optimization are located at the boundaries of areas with different water quality grades and seawater. 展开更多
关键词 area of water quality grade stratified Kriging(SK) leave-one-out cross validation method spatial simulated annealing method monitoring sites optimization
下载PDF
Artificial intelligence model validation before its application in clinical diagnosis assistance
6
作者 Gustavo Jesus Vazquez-Zapien Monica Maribel Mata-Miranda +1 位作者 Francisco Garibay-Gonzalez Miguel Sanchez-Brito 《World Journal of Gastroenterology》 SCIE CAS 2022年第5期602-604,共3页
The process of selecting an artificial intelligence(AI)model to assist clinical diagnosis of a particular pathology and its validation tests is relevant since the values of accuracy,sensitivity and specificity may not... The process of selecting an artificial intelligence(AI)model to assist clinical diagnosis of a particular pathology and its validation tests is relevant since the values of accuracy,sensitivity and specificity may not reflect the behavior of the method in a real environment.Here,we provide helpful considerations to increase the success of using an AI model in clinical practice. 展开更多
关键词 Artificial intelligence Diagnostic assistance Validation tests leave-one-out cross-validation K-fold validation Hold-out validation
下载PDF
Discrimination between Dementia Groups and Healthy Elderlies Using Scalp-Recorded-EEG-Based Brain Functional Connectivity Networks
7
作者 Sakura Nishijima Tetsushi Yada +7 位作者 Toshimasa Yamazaki Yoshiyuki Kuroiwa Makoto Nakane Kimihiro Fujino Toshiaki Hirai Yasuhisa Baba Shoko Yamada Sho Tsukiyama 《Journal of Biomedical Science and Engineering》 2020年第7期153-167,共15页
<em>Objective:</em> To establish a practical method for discriminating dementia groups and healthy elderlies, by using scalp-recorded electroencephalograms (EEGs). <em>Methods:</em> 16-ch EEGs ... <em>Objective:</em> To establish a practical method for discriminating dementia groups and healthy elderlies, by using scalp-recorded electroencephalograms (EEGs). <em>Methods:</em> 16-ch EEGs were recorded during resting state for 39 dementia groups and 11 healthy elderlies. The connectivity between any two electrodes was estimated by synchronization likelihood (SL). The brain networks were constructed by normalized SL values. The present leave-one-out cross validation (LOOCV) required the Euclidean distance between any two subjects having 120-dimensional vectors concerned with the SL values for six frequency bands. In order to investigate factors which would affect the LOOCV results, principal component analysis (PCA) was applied to all the subjects. <em>Results:</em> The accuracy for the upper alpha yielded more than 80% and 70% in the dementia groups and the healthy elderlies, respectively. The LOOCV result could be explained in terms of brain networks such as executive control network (ECN) and default mode network (DMN) characterized by factor loadings of principal components. <em>Conclusions:</em> Dementia groups and healthy elderlies could be characterized by principal components of SL values between all the electrode pairs, even less connections, which revealed disruption and preservation of DMN and ECN. <em>Significance:</em> This study will provide a simple and practical method for discriminating dementia groups from healthy elderlies by scalp-recorded EEGs. 展开更多
关键词 ELECTROENCEPHALOGRAPHY Network DEMENTIA Synchronization Likelihood leave-one-out Cross Validation Euclidean Distance
下载PDF
Comparing independent climate-sensitive models of aboveground biomass and diame ter grow th with their compatible simultaneous model system for three larch species in China 被引量:1
8
作者 Zhigang Gao Qiuyan Wang +7 位作者 Zongda Hus Peng Luo Guangshuang Duan Ram PSharma Qiaolin Ye Wenqiang Gao Xinyu Song Liyong Fut 《International Journal of Biomathematics》 SCIE 2019年第7期1-20,共20页
Accurate estimate of tree biomass is essential for forest management.In recent years,several climate-sensitive allometric biomass models with diameter at breast height(D)as a predictor have been proposed for various t... Accurate estimate of tree biomass is essential for forest management.In recent years,several climate-sensitive allometric biomass models with diameter at breast height(D)as a predictor have been proposed for various tree species and climate zones to estimate tree aboveground biomass(AGB).But the allometric models only account for the potential effects of climate on tree biomass and do not simultaneously explain the influence of climate on D growth.In this study,based on the AGB data from 256 destructively sampled trees of three larch species randomly distributed across the five secondary climate zones in northeastern and northern China,we first developed a climate-sensitive AGB base model and a climate-sensitive D growth base model using a nonlinear least square regression separately.A compatible simultaneous model system was then developed with the climate-sensitive AGB and D growth models using a nonlinear seemingly unrelated regression.The potential effects of several temperature and precipitation variables on AGB and D growth were evaluated.The fitting results of climatic sensitive base models were compared against those of their compatible simultaneous model system.It was found that a decreased isothermality([mean of monthly(maximum temperatureminimum temperature)]/(Maximum temperature of the warmest month-Minimum temperature of the coldest month))and total growing season precipitation,and increased annual precipitation significantly increased the values of AGB;an increase of temperature seasonality(a standard deviation of the mean monthly temperature)and precipitation seasonality(a standard deviation of the mean monthly precipitation)could lead to the increase of D.The differences of the model fitting results between the compatible simultaneous system with the consideration of climate effects on both AGB and D growth and its corresponding climate-sensitive AGB and D growth base models were very small and insignificant(p>0.05).Compared to the base models,the inhere nt correlation of AGB with D was taken into account effectively by the proposed compatible model system developed with the climate-sensitive AGB and D grow th models.In addition,the compatible properties of the estimated AGB and D were also addressed substantially in the proposed model system. 展开更多
关键词 LARCH aboveground biomass diameter at breast height climate change seemingly unrelated regression leave-one-out cross-validation
原文传递
Quantitative structure–biodegradability relationships for biokinetic parameter of polycyclic aromatic hydrocarbons 被引量:2
9
作者 Peng Xu Wencheng Ma +2 位作者 Hongjun Han Shengyong Jia Baolin Hou 《Journal of Environmental Sciences》 SCIE EI CAS CSCD 2015年第4期180-185,共6页
Prediction of the biodegradability of organic pollutants is an ecologically desirable and economically feasible tool for estimating the environmental fate of chemicals. In this paper,stepwise multiple linear regressio... Prediction of the biodegradability of organic pollutants is an ecologically desirable and economically feasible tool for estimating the environmental fate of chemicals. In this paper,stepwise multiple linear regression analysis method was applied to establish quantitative structure biodegradability relationship(QSBR) between the chemical structure and a novel biodegradation activity index(qmax) of 20 polycyclic aromatic hydrocarbons(PAHs). The frequency B3LYP/6-311+G(2df,p) calculations showed no imaginary values, implying that all the structures are minima on the potential energy surface. After eliminating the parameters which had low related coefficient with qmax, the major descriptors influencing the biodegradation activity were screened to be Freq, D, MR, EHOMOand To IE. The evaluation of the developed QSBR mode, using a leave-one-out cross-validation procedure, showed that the relationships are significant and the model had good robustness and predictive ability. The results would be helpful for understanding the mechanisms governing biodegradation at the molecular level. 展开更多
关键词 leave-one-out cross-validation Stepwise multiple linear regression Polycyclic aromatic hydrocarbons QSBR
原文传递
New method for estimating strike and dip based on structural expansion orientation for 3D geological modeling
10
作者 Yabo ZHAO Weihua HUA +3 位作者 Guoxiong CHEN Dong LIANG Zhipeng LIU Xiuguo LIU 《Frontiers of Earth Science》 SCIE CSCD 2021年第3期676-691,共16页
Strike and dip are essential to the description of geological features and therefore play important roles in 3D geological modeling.Unevenly and sparsely measured orientations from geological field mapping pose proble... Strike and dip are essential to the description of geological features and therefore play important roles in 3D geological modeling.Unevenly and sparsely measured orientations from geological field mapping pose problems for the geological modeling,especially for covered and deep areas.This study developed a new method for estimating strike and dip based on structural expansion orientation,which can be automatically extracted from both geological and geophysical maps or profiles.Specifically,strike and dip can be estimated by minimizing an objective function composed of the included angle between the strike and dip and the leave-one-out cross-validation strike and dip.We used angle parameterization to reduce dimensionality and proposed a quasi-gradient descent(QGD)method to rapidly obtain a near-optimal solution,improving the time-efficiency and accuracy of objective function optimization with the particle swarm method.A synthetic basin fold model was subsequently used to test the proposed method,and the results showed that the strike and dip estimates were close to the true values.Finally,the proposed method was applied to a real fold structure largely covered by Cainozoic sediments in Australia.The strikes and dips estimated by the proposed method conformed to the actual geological structures more than those of the vector interpolation method did.As expected,the results of 3D geological implicit interface modeling and the strike and dip vector field were much improved by the addition of estimated strikes and dips. 展开更多
关键词 strike and dip structural expansion orientation leave-one-out cross-validation covered area
原文传递
Predicting microRNA-disease association based on microRNA structural and functional similarity network
11
作者 Tao Ding Jie Gao +2 位作者 Shanshan Zhu Junhua Xu Min Wu 《Frontiers of Electrical and Electronic Engineering in China》 CSCD 2019年第2期138-146,共9页
Background:Increasing evidences indicate that microRNAs (miRNAs) are functionally related to the development and progression of various human diseases.Inferring disease-related miRNAs can be helpful in promoting disea... Background:Increasing evidences indicate that microRNAs (miRNAs) are functionally related to the development and progression of various human diseases.Inferring disease-related miRNAs can be helpful in promoting disease biomarker detection for the treatment,diagnosis,and prevention of complex diseases.Methods:To improve the prediction accuracy of miRNA-disease association and capture more potential diseaserelated miRNAs,we constructed a precise miRNA global similarity network (MSFSN) via calculating the miRNA similarity based on secondary structures,families,and functions.Results:We tested the network on the classical algorithms:WBSMDA and RWRMDA through the method of leaveone- out cross-validation.Eventually,AUCs of 0.8212 and 0.9657 are obtained,respectively.Also,the proposed MSFSN is applied to three cancers for breast neoplasms,hepatocellular carcinoma,and prostate neoplasms.Consequently,82%,76%,and 82% of the top 50 potential miRNAs for these diseases are respectively validated by the miRNA-disease associations database miR2Disease and oncomiRDB.Conclusion:Therefore,MSFSN provides a novel miRNA similarity network combining precise function network with global structure network of miRNAs to predict the associations between miRNAs and diseases in various models. 展开更多
关键词 MIRNAS HAIRPIN structure miRNA families functional SIMILARITY disease semantic leave-one-out CROSSVALIDATION
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部