期刊文献+
共找到1,309篇文章
< 1 2 66 >
每页显示 20 50 100
NONLINEAR DATA RECONCILIATION METHOD BASED ON KERNEL PRINCIPAL COMPONENT ANALYSIS 被引量:6
1
作者 Yan Weiwu Shao HuiheDepartment of Automation,Shanghai Jiaotong University,Shanghai 200030, China 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2003年第2期117-119,共3页
In the industrial process situation, principal component analysis (PCA) is ageneral method in data reconciliation. However, PCA sometime is unfeasible to nonlinear featureanalysis and limited in application to nonline... In the industrial process situation, principal component analysis (PCA) is ageneral method in data reconciliation. However, PCA sometime is unfeasible to nonlinear featureanalysis and limited in application to nonlinear industrial process. Kernel PCA (KPCA) is extensionof PCA and can be used for nonlinear feature analysis. A nonlinear data reconciliation method basedon KPCA is proposed. The basic idea of this method is that firstly original data are mapped to highdimensional feature space by nonlinear function, and PCA is implemented in the feature space. Thennonlinear feature analysis is implemented and data are reconstructed by using the kernel. The datareconciliation method based on KPCA is applied to ternary distillation column. Simulation resultsshow that this method can filter the noise in measurements of nonlinear process and reconciliateddata can represent the true information of nonlinear process. 展开更多
关键词 principal component analysis kernel data reconciliation NONLINEAR
下载PDF
Decentralized Fault Diagnosis of Large-scale Processes Using Multiblock Kernel Principal Component Analysis 被引量:22
2
作者 ZHANG Ying-Wei ZHOU Hong QIN S. Joe 《自动化学报》 EI CSCD 北大核心 2010年第4期593-597,共5页
关键词 分散系统 MBKPCA SPF PCA
下载PDF
Kernel Generalization of Multi-Rate Probabilistic Principal Component Analysis for Fault Detection in Nonlinear Process 被引量:2
3
作者 Donglei Zheng Le Zhou Zhihuan Song 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2021年第8期1465-1476,共12页
In practical process industries,a variety of online and offline sensors and measuring instruments have been used for process control and monitoring purposes,which indicates that the measurements coming from different ... In practical process industries,a variety of online and offline sensors and measuring instruments have been used for process control and monitoring purposes,which indicates that the measurements coming from different sources are collected at different sampling rates.To build a complete process monitoring strategy,all these multi-rate measurements should be considered for data-based modeling and monitoring.In this paper,a novel kernel multi-rate probabilistic principal component analysis(K-MPPCA)model is proposed to extract the nonlinear correlations among different sampling rates.In the proposed model,the model parameters are calibrated using the kernel trick and the expectation-maximum(EM)algorithm.Also,the corresponding fault detection methods based on the nonlinear features are developed.Finally,a simulated nonlinear case and an actual pre-decarburization unit in the ammonia synthesis process are tested to demonstrate the efficiency of the proposed method. 展开更多
关键词 Fault detection kernel method multi-rate process probability principal component analysis(PPCA)
下载PDF
FUZZY PRINCIPAL COMPONENT ANALYSIS AND ITS KERNEL-BASED MODEL 被引量:4
4
作者 Wu Xiaohong Zhou Jianjiang 《Journal of Electronics(China)》 2007年第6期772-775,共4页
Principal Component Analysis(PCA)is one of the most important feature extraction methods,and Kernel Principal Component Analysis(KPCA)is a nonlinear extension of PCA based on kernel methods.In real world,each input da... Principal Component Analysis(PCA)is one of the most important feature extraction methods,and Kernel Principal Component Analysis(KPCA)is a nonlinear extension of PCA based on kernel methods.In real world,each input data may not be fully assigned to one class and it may partially belong to other classes.Based on the theory of fuzzy sets,this paper presents Fuzzy Principal Component Analysis(FPCA)and its nonlinear extension model,i.e.,Kernel-based Fuzzy Principal Component Analysis(KFPCA).The experimental results indicate that the proposed algorithms have good performances. 展开更多
关键词 计算机技术 网络设计 设计方案 通信技术 信息处理
下载PDF
Comparison of Kernel Entropy Component Analysis with Several Dimensionality Reduction Methods
5
作者 马西沛 张蕾 孙以泽 《Journal of Donghua University(English Edition)》 EI CAS 2017年第4期577-582,共6页
Dimensionality reduction techniques play an important role in data mining. Kernel entropy component analysis( KECA) is a newly developed method for data transformation and dimensionality reduction. This paper conducte... Dimensionality reduction techniques play an important role in data mining. Kernel entropy component analysis( KECA) is a newly developed method for data transformation and dimensionality reduction. This paper conducted a comparative study of KECA with other five dimensionality reduction methods,principal component analysis( PCA),kernel PCA( KPCA),locally linear embedding( LLE),laplacian eigenmaps( LAE) and diffusion maps( DM). Three quality assessment criteria, local continuity meta-criterion( LCMC),trustworthiness and continuity measure(T&C),and mean relative rank error( MRRE) are applied as direct performance indexes to assess those dimensionality reduction methods. Moreover,the clustering accuracy is used as an indirect performance index to evaluate the quality of the representative data gotten by those methods. The comparisons are performed on six datasets and the results are analyzed by Friedman test with the corresponding post-hoc tests. The results indicate that KECA shows an excellent performance in both quality assessment criteria and clustering accuracy assessing. 展开更多
关键词 dimensionality reduction kernel entropy component analysis(KECA) kernel principal component analysis(KPCA) CLUSTERING
下载PDF
Application of Particle Swarm Optimization to Fault Condition Recognition Based on Kernel Principal Component Analysis 被引量:1
6
作者 WEI Xiu-ye PAN Hong-xia HUANG Jin-ying WANG Fu-jie 《International Journal of Plant Engineering and Management》 2009年第3期129-135,共7页
Panicle swarm optimization (PSO) is an optimization algorithm based on the swarm intelligent principle. In this paper the modified PSO is applied to a kernel principal component analysis ( KPCA ) for an optimal ke... Panicle swarm optimization (PSO) is an optimization algorithm based on the swarm intelligent principle. In this paper the modified PSO is applied to a kernel principal component analysis ( KPCA ) for an optimal kernel function parameter. We first comprehensively considered within-class scatter and between-class scatter of the sample features. Then, the fitness function of an optimized kernel function parameter is constructed, and the particle swarm optimization algorithm with adaptive acceleration (CPSO) is applied to optimizing it. It is used for gearbox condi- tion recognition, and the result is compared with the recognized results based on principal component analysis (PCA). The results show that KPCA optimized by CPSO can effectively recognize fault conditions of the gearbox by reducing bind set-up of the kernel function parameter, and its results of fault recognition outperform those of PCA. We draw the conclusion that KPCA based on CPSO has an advantage in nonlinear feature extraction of mechanical failure, and is helpful for fault condition recognition of complicated machines. 展开更多
关键词 particle swarm optimization kernel principal component analysis kernel function parameter feature extraction gearbox condition recognition
下载PDF
Multi-state Information Dimension Reduction Based on Particle Swarm Optimization-Kernel Independent Component Analysis
7
作者 邓士杰 苏续军 +1 位作者 唐力伟 张英波 《Journal of Donghua University(English Edition)》 EI CAS 2017年第6期791-795,共5页
The precision of the kernel independent component analysis( KICA) algorithm depends on the type and parameter values of kernel function. Therefore,it's of great significance to study the choice method of KICA'... The precision of the kernel independent component analysis( KICA) algorithm depends on the type and parameter values of kernel function. Therefore,it's of great significance to study the choice method of KICA's kernel parameters for improving its feature dimension reduction result. In this paper, a fitness function was established by use of the ideal of Fisher discrimination function firstly. Then the global optimal solution of fitness function was searched by particle swarm optimization( PSO) algorithm and a multi-state information dimension reduction algorithm based on PSO-KICA was established. Finally,the validity of this algorithm to enhance the precision of feature dimension reduction has been proven. 展开更多
关键词 kernel independent component analysis(KICA) particle swarm optimization(PSO) feature dimension reduction fitness function
下载PDF
Nonlinear Statistical Process Monitoring Based on Control Charts with Memory Effect and Kernel Independent Component Analysis
8
作者 张曦 阎威武 +1 位作者 赵旭 邵惠鹤 《Journal of Shanghai Jiaotong university(Science)》 EI 2007年第5期563-571,共9页
A novel nonlinear combination process monitoring method was proposed based on techniques with memory effect (multivariate exponentially weighted moving average (MEWMA)) and kernel independent component analysis (KICA)... A novel nonlinear combination process monitoring method was proposed based on techniques with memory effect (multivariate exponentially weighted moving average (MEWMA)) and kernel independent component analysis (KICA). The method was developed for dealing with nonlinear issues and detecting small or moderate drifts in one or more process variables with autocorrelation. MEWMA charts use additional information from the past history of the process for keeping the memory effect of the process behavior trend. KICA is a recently developed statistical technique for revealing hidden, nonlinear statistically independent factors that underlie sets of measurements and it is a two-phase algorithm: whitened kernel principal component analysis (KPCA) plus independent component analysis (ICA). The application to the fluid catalytic cracking unit (FCCU) simulated process indicates that the proposed combined method based on MEWMA and KICA can effectively capture the nonlinear relationship and detect small drifts in process variables. Its performance significantly outperforms monitoring method based on ICA, MEWMA-ICA and KICA, especially for long-term performance deterioration. 展开更多
关键词 核心独立成分分析 非线性统计 管理图表 监视系统
下载PDF
Statistical Monitoring of Chemical Processes Based on Sensitive Kernel Principal Components 被引量:10
9
作者 JIANG Qingchao YAN Xuefeng 《Chinese Journal of Chemical Engineering》 SCIE EI CAS CSCD 2013年第6期633-643,共11页
核主管部件分析(KPCA ) 方法采用开始的几个核主管部件(KPC ) ,它为过程监视显示正常观察的大多数变化信息,但是不能反映差错信息。在这研究,敏感内核主管部件分析(SKPCA ) 被建议改进监视性能的过程,即,错过了察觉率处理 T2 统计... 核主管部件分析(KPCA ) 方法采用开始的几个核主管部件(KPC ) ,它为过程监视显示正常观察的大多数变化信息,但是不能反映差错信息。在这研究,敏感内核主管部件分析(SKPCA ) 被建议改进监视性能的过程,即,错过了察觉率处理 T2 统计、摆平的预言错误 SPE 统计数值和还原剂的不和。T2 统计数值能被用来沿着每 KPC 直接测量变化并且分析察觉表演以及在一个过程捕获最有用的信息。随沿着每 KPC 的 T2 统计数值的变化率的计算, SKPCA 为进程监视选择敏感内核主管部件。一个模仿的简单系统和田纳西伊斯门过程被采用在联机监视上表明 SKPCA 的效率。结果显示监视表演显著地被改进。 展开更多
关键词 核主成分分析 在线监测 统计量 敏感 化工过程 基础 组成部分 监控信息
下载PDF
Relationships between changes of kernel nutritive components and seed vigor during development stages of F_1 seeds of sh_2 sweet corn 被引量:6
10
作者 Dong-dong CAO Jin HU +3 位作者 Xin-xian HUANG Xian-ju WANG Ya-jing GUAN Zhou-fei WANG 《Journal of Zhejiang University-Science B(Biomedicine & Biotechnology)》 SCIE CAS CSCD 2008年第12期964-968,共5页
The changes of kernel nutritive components and seed vigor in F1 seeds of sh2 sweet corn during seed development stage were investigated and the relationships between them were analyzed by time series regression (TSR) ... The changes of kernel nutritive components and seed vigor in F1 seeds of sh2 sweet corn during seed development stage were investigated and the relationships between them were analyzed by time series regression (TSR) analysis. The results show that total soluble sugar and reducing sugar contents gradually declined, while starch and soluble protein contents increased throughout the seed development stages. Germination percentage, energy of germination, germination index and vigor index gradually increased along with seed development and reached the highest levels at 38 d after pollination (DAP). The TSR showed that, during 14 to 42 DAP, total soluble sugar content was independent of the vigor parameters determined in present experiment, while the reducing sugar content had a significant effect on seed vigor. TSR equations between seed reducing sugar and seed vigor were also developed. There were negative correlations between the seed reducing sugar content and the germination percentage, energy of germination, germination index and vigor index, respectively. It is suggested that the seed germination, energy of germination, germination index and vigor index could be predicted by the content of reducing sugar in sweet corn seeds during seed development stages. 展开更多
关键词 玉米 种子 农作物 农业
下载PDF
Kernel Factor Analysis Algorithm with Varimax
11
作者 夏国恩 金炜东 张葛祥 《Journal of Southwest Jiaotong University(English Edition)》 2006年第4期394-399,共6页
Kernal factor analysis (KFA) with vafimax was proposed by using Mercer kernel function which can map the data in the original space to a high-dimensional feature space, and was compared with the kernel principle com... Kernal factor analysis (KFA) with vafimax was proposed by using Mercer kernel function which can map the data in the original space to a high-dimensional feature space, and was compared with the kernel principle component analysis (KPCA). The results show that the best error rate in handwritten digit recognition by kernel factor analysis with vadmax (4.2%) was superior to KPCA (4.4%). The KFA with varimax could more accurately image handwritten digit recognition. 展开更多
关键词 kernel factor analysis kernel principal component analysis Support vector machine Varimax ALGORITHM Handwritten digit recognition
下载PDF
Multivariate time delay analysis based local KPCA fault prognosis approach for nonlinear processes 被引量:5
12
作者 Yuan Xu Ying Liu Qunxiong Zhu 《Chinese Journal of Chemical Engineering》 SCIE EI CAS CSCD 2016年第10期1413-1422,共10页
Currently,some fault prognosis technology occasionally has relatively unsatisfied performance especially for incipient faults in nonlinear processes duo to their large time delay and complex internal connection.To ove... Currently,some fault prognosis technology occasionally has relatively unsatisfied performance especially for incipient faults in nonlinear processes duo to their large time delay and complex internal connection.To overcome this deficiency,multivariate time delay analysis is incorporated into the high sensitive local kernel principal component analysis.In this approach,mutual information estimation and Bayesian information criterion(BIC) are separately used to acquire the correlation degree and time delay of the process variables.Moreover,in order to achieve prediction,time series prediction by back propagation(BP) network is applied whose input is multivariate correlated time series other than the original time series.Then the multivariate time delayed series and future values obtained by time series prediction are combined to construct the input of local kernel principal component analysis(LKPCA) model for incipient fault prognosis.The new method has been exemplified in a simple nonlinear process and the complicated Tennessee Eastman(TE) benchmark process.The results indicate that the new method has superiority in the fault prognosis sensitivity over other traditional fault prognosis methods. 展开更多
关键词 差错预后 时间延期评价 本地核主管部件分析
下载PDF
A deep kernel method for lithofacies identification using conventional well logs 被引量:1
13
作者 Shao-Qun Dong Zhao-Hui Zhong +5 位作者 Xue-Hui Cui Lian-Bo Zeng Xu Yang Jian-jun Liu Yan-Ming Sun jing-Ru Hao 《Petroleum Science》 SCIE EI CAS CSCD 2023年第3期1411-1428,共18页
How to fit a properly nonlinear classification model from conventional well logs to lithofacies is a key problem for machine learning methods.Kernel methods(e.g.,KFD,SVM,MSVM)are effective attempts to solve this issue... How to fit a properly nonlinear classification model from conventional well logs to lithofacies is a key problem for machine learning methods.Kernel methods(e.g.,KFD,SVM,MSVM)are effective attempts to solve this issue due to abilities of handling nonlinear features by kernel functions.Deep mining of log features indicating lithofacies still needs to be improved for kernel methods.Hence,this work employs deep neural networks to enhance the kernel principal component analysis(KPCA)method and proposes a deep kernel method(DKM)for lithofacies identification using well logs.DKM includes a feature extractor and a classifier.The feature extractor consists of a series of KPCA models arranged according to residual network structure.A gradient-free optimization method is introduced to automatically optimize parameters and structure in DKM,which can avoid complex tuning of parameters in models.To test the validation of the proposed DKM for lithofacies identification,an open-sourced dataset with seven con-ventional logs(GR,CAL,AC,DEN,CNL,LLD,and LLS)and lithofacies labels from the Daniudi Gas Field in China is used.There are eight lithofacies,namely clastic rocks(pebbly,coarse,medium,and fine sand-stone,siltstone,mudstone),coal,and carbonate rocks.The comparisons between DKM and three commonly used kernel methods(KFD,SVM,MSVM)show that(1)DKM(85.7%)outperforms SVM(77%),KFD(79.5%),and MSVM(82.8%)in accuracy of lithofacies identification;(2)DKM is about twice faster than the multi-kernel method(MSVM)with good accuracy.The blind well test in Well D13 indicates that compared with the other three methods DKM improves about 24%in accuracy,35%in precision,41%in recall,and 40%in F1 score,respectively.In general,DKM is an effective method for complex lithofacies identification.This work also discussed the optimal structure and classifier for DKM.Experimental re-sults show that(m_(1),m_(2),O)is the optimal model structure and linear svM is the optimal classifier.(m_(1),m_(2),O)means there are m KPCAs,and then m2 residual units.A workflow to determine an optimal classifier in DKM for lithofacies identification is proposed,too. 展开更多
关键词 Lithofacies identification Deepkernel method Well logs Residual unit kernel principal component analysis Gradient-free optimization
下载PDF
Air Traffic Operation Complexity Analysis Based on Metrics System
14
作者 Xie Hua Cong Wei +1 位作者 Hu Ming hua Liu Sifeng 《Transactions of Nanjing University of Aeronautics and Astronautics》 EI CSCD 2016年第4期461-468,共8页
In order to quantitatively analyze air traffic operation complexity,multidimensional metrics were selected based on the operational characteristics of traffic flow.The kernel principal component analysis method was ut... In order to quantitatively analyze air traffic operation complexity,multidimensional metrics were selected based on the operational characteristics of traffic flow.The kernel principal component analysis method was utilized to reduce the dimensionality of metrics,therefore to extract crucial information in the metrics.The hierarchical clustering method was used to analyze the complexity of different airspace.Fourteen sectors of Guangzhou Area Control Center were taken as samples.The operation complexity of traffic situation in each sector was calculated based on real flight radar data.Clustering analysis verified the feasibility and rationality of the method,and provided a reference for airspace operation and management. 展开更多
关键词 operation complexity traffic metrics kernel primary component analysis hierarchical clustering
下载PDF
FEATURE-EXTRACT ANALYSIS OF SERIAL ANALYSIS OF GENE EXPRESSION DATA
15
作者 Su Hongquan Zhu Yisheng 《Journal of Electronics(China)》 2010年第6期848-852,共5页
Serial Analysis of Gene Expression (SAGE) is a powerful tool to analyze whole-genome expression profiles. SAGE data, characterized by large quantity and high dimensions, need reducing their dimensions and extract feat... Serial Analysis of Gene Expression (SAGE) is a powerful tool to analyze whole-genome expression profiles. SAGE data, characterized by large quantity and high dimensions, need reducing their dimensions and extract feature to improve the accuracy and efficiency when they are used for pattern recognition and clustering analysis. A Poisson Model-based Kernel (PMK) was proposed based on the Poisson distribution of the SAGE data. Kernel Principle Component Analysis (KPCA) with PMK was proposed and used in feature-extract analysis of mouse retinal SAGE data. The computa-tional results show that this algorithm can extract feature effectively and reduce dimensions of SAGE data. 展开更多
关键词 Serial analysis of Gene Expression (SAGE) Poisson distribution kernel methods Principle component analysis (PCA)
下载PDF
Multivariate Cluster and Principle Component Analyses of Selected Yield Traits in Uzbek Bread Wheat Cultivars 被引量:1
16
作者 Shokista Sh. Adilova Dilafruz E. Qulmamatova +2 位作者 Saidmurad K. Baboev Tohir A. Bozorov Aleksey I. Morgunov 《American Journal of Plant Sciences》 2020年第6期903-912,共10页
Investigation of genetic diversity of geographically distant wheat genotypes is </span><span style="font-family:Verdana;">a </span><span style="font-family:Verdana;">useful ... Investigation of genetic diversity of geographically distant wheat genotypes is </span><span style="font-family:Verdana;">a </span><span style="font-family:Verdana;">useful approach in wheat breeding providing efficient crop varieties. This article presents multivariate cluster and principal component analyses (PCA) of some yield traits of wheat, such as thousand-kernel weight (TKW), grain number, grain yield and plant height. Based on the results, an evaluation of economically valuable attributes by eigenvalues made it possible to determine the components that significantly contribute to the yield of common wheat genotypes. Twenty-five genotypes were grouped into four clusters on the basis of average linkage. The PCA showed four principal components (PC) with eigenvalues ></span><span style="font-family:""> </span><span style="font-family:Verdana;">1, explaining approximately 90.8% of the total variability. According to PC analysis, the variance in the eigenvalues was </span><span style="font-family:Verdana;">the </span><span style="font-family:Verdana;">greatest (4.33) for PC-1, PC-2 (1.86) and PC-3 (1.01). The cluster analysis revealed the classification of 25 accessions into four diverse groups. Averages, standard deviations and variances for clusters based on morpho-physiological traits showed that the maximum average values for grain yield (742.2), biomass (1756.7), grains square meter (18</span><span style="font-family:Verdana;">,</span><span style="font-family:Verdana;">373.7), and grains per spike (45.3) were higher in cluster C compared to other clusters. Cluster D exhibited the maximum thousand-kernel weight (TKW) (46.6). 展开更多
关键词 Bread Wheat Principal component analysis Dispersion Cluster analysis Grain Yield Spike Number Per Square Meter Drought Stress Thousand-kernel Weight
下载PDF
Anomaly Detection of UAV State Data Based on Single-Class Triangular Global Alignment Kernel Extreme Learning Machine
17
作者 Feisha Hu Qi Wang +2 位作者 Haijian Shao Shang Gao Hualong Yu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第9期2405-2424,共20页
Unmanned Aerial Vehicles(UAVs)are widely used and meet many demands in military and civilian fields.With the continuous enrichment and extensive expansion of application scenarios,the safety of UAVs is constantly bein... Unmanned Aerial Vehicles(UAVs)are widely used and meet many demands in military and civilian fields.With the continuous enrichment and extensive expansion of application scenarios,the safety of UAVs is constantly being challenged.To address this challenge,we propose algorithms to detect anomalous data collected from drones to improve drone safety.We deployed a one-class kernel extreme learning machine(OCKELM)to detect anomalies in drone data.By default,OCKELM uses the radial basis(RBF)kernel function as the kernel function of themodel.To improve the performance ofOCKELM,we choose a TriangularGlobalAlignmentKernel(TGAK)instead of anRBF Kernel and introduce the Fast Independent Component Analysis(FastICA)algorithm to reconstruct UAV data.Based on the above improvements,we create a novel anomaly detection strategy FastICA-TGAK-OCELM.The method is finally validated on the UCI dataset and detected on the Aeronautical Laboratory Failures and Anomalies(ALFA)dataset.The experimental results show that compared with other methods,the accuracy of this method is improved by more than 30%,and point anomalies are effectively detected. 展开更多
关键词 UAV safety kernel extreme learning machine triangular global alignment kernel fast independent component analysis
下载PDF
流程生产安全数智化监测系统传感器故障诊断研究
18
作者 张建荣 张伟 +1 位作者 赵挺生 苗雨 《中国安全生产科学技术》 CAS CSCD 北大核心 2024年第4期34-41,共8页
为保障流程生产安全监测数据的准确性,提出1种结合核主元分析和累积残差贡献率法的故障诊断方法。首先提出“感知-汇聚-决策”的多层级数智化监控系统架构;针对感知层传感器,基于核主元分析构建故障检测模型并通过累积残差贡献率法定位... 为保障流程生产安全监测数据的准确性,提出1种结合核主元分析和累积残差贡献率法的故障诊断方法。首先提出“感知-汇聚-决策”的多层级数智化监控系统架构;针对感知层传感器,基于核主元分析构建故障检测模型并通过累积残差贡献率法定位故障传感器;以DYTG转炉厂连铸作业区进行实证分析。研究结果表明:该故障诊断方法在SPE指标上的平均检测率和平均误检率分别为95.28%和2.61%,在T^(2)指标上的平均检测率和平均误检率分别为84.36%和1.71%,且针对4种故障形式均能精准定位故障传感器。研究结果有助于降低监测系统的维护成本,提升流程生产安全管控水平。 展开更多
关键词 流程生产 传感器 故障诊断 核主元分析 累积残差
下载PDF
基于机器学习的茶树DNA聚类算法
19
作者 杨小平 倪萍 +4 位作者 诸葛天秋 罗跃新 郭春雨 庞月兰 吴雨婷 《广西大学学报(自然科学版)》 CAS 北大核心 2024年第2期386-399,共14页
为了研究茶树基因序列的聚类问题,设计一种基于累计方差贡献率进行改进的核主成分分析(KPCA)与k均值(k-means)++聚类算法相结合的降维聚类算法(KPCA-k-means++)。将基因库数据集筛选分组后,利用k-mers算法提取基因数据的数据特征,根据... 为了研究茶树基因序列的聚类问题,设计一种基于累计方差贡献率进行改进的核主成分分析(KPCA)与k均值(k-means)++聚类算法相结合的降维聚类算法(KPCA-k-means++)。将基因库数据集筛选分组后,利用k-mers算法提取基因数据的数据特征,根据累计方差贡献率的占比大于85%的标准确定降维主元个数对KPCA进行降维改进并采用k-means++算法对降维后数据聚类,通过CH(Calinski-Harabaze Index)指标和响应时间分析聚类结果。结果表明:在单独聚类、KPCA聚类、改进PCA聚类、改进KPCA聚类4种处理方式中,改进KPCA-k-means++算法在不同处理方式和不同样本数的对比下,CH指标均为最高,与未改进时相比平均高出33%。在响应时间方面,改进KPCA-k-means++算法与同样改进PCA-k-means++算法在不同聚类数和样本数的对比下响应时间均较短。改进KPCA-k-means++算法能够保证对于茶树的基因序列的聚类准确率和聚类速度,表现出极好的聚类稳定性。 展开更多
关键词 核主成分分析 累计方差贡献率 K均值聚类算法 基因聚类
下载PDF
基于KPCA-PSO-ELM算法的地表水化学需氧量紫外-可见吸收光谱检测研究
20
作者 郑培超 周椿棪 +5 位作者 王金梅 尹义同 张莉 吕强 曾金锐 何雨欣 《光谱学与光谱分析》 SCIE EI CAS CSCD 北大核心 2024年第3期707-713,共7页
化学需氧量(COD)是水质检测重要指标之一,反映水体有机物含量。传统的COD化学检测方法存在操作繁琐,等待时间长,二次污染等缺点。紫外-可见吸收光谱法是目前水体化学需氧量检测中应用最为广泛的方法之一,具有检测快速、无污染等特点。... 化学需氧量(COD)是水质检测重要指标之一,反映水体有机物含量。传统的COD化学检测方法存在操作繁琐,等待时间长,二次污染等缺点。紫外-可见吸收光谱法是目前水体化学需氧量检测中应用最为广泛的方法之一,具有检测快速、无污染等特点。为了满足地表水化学需氧量快速、实时、在线监测等要求,采用紫外-可见吸收光谱进行测量,提出了内核主成分分析(KPCA)结合粒子群优化极限学习机(PSO-ELM)预测模型,满足当前对地表水化学需氧量快速、实时监测的要求。对光谱进行Savitzky-Golay(SG)滤波以降低随机噪声的影响;用积分光谱代替原光谱,以降低信号波动带来的影响;再将得到的光谱信息归一化,消除不同光谱数据量纲的影响。将预处理后的数据利用KPCA算法将全光谱数据压缩为5个特征,有效解决光谱信息冗余的问题;采用PSO算法对ELM的权重和偏置进行优化极大提高了模型的精度。对217个河流、长江及支流、湖库等地表水样本按照7∶3随机划分成训练集和测试集,并进行建模测试,其中训练集拟合优度(R2)为0.930 2、均方根误差(RMSE)为0.363 0 mg·L^(-1)、测试集拟合优度R2为0.931 9、均方根误差(RMSE)为0.400 7 mg·L^(-1)。为了验证提出的基于KPCA全光谱数据压缩方法对预测模型的提升效果,分别对比了主成分分析(PCA)、连续投影算法(SPA)、套索回归(LASSO)等特征处理算法。PCA-PSO-ELM模型的RMSE为0.715 1 mg·L^(-1)、 SPA-PSO-ELM模型的RMSE为0.473 7 mg·L^(-1)、 LASSO-PSO-ELM模型的RMSE为0.412 6 mg·L^(-1), KPCA-PSO-ELM模型较上述三种模型,RMSE分别降低了78.46%、 18.22%、 2.97%,结果表明KPCA是一种高效的光谱降维算法,能够有效消除光谱冗余信息,提升模型预测精度。基于KPCA-PSO-ELM预测模型结合紫外-可见吸收光谱可以实现对地表水COD快速、实时检测,为在线COD检测场景提供方法支撑。 展开更多
关键词 化学需氧量 紫外-可见吸收光谱 内核主成分分析 极限学习机
下载PDF
上一页 1 2 66 下一页 到第
使用帮助 返回顶部