期刊文献+
共找到212篇文章
< 1 2 11 >
每页显示 20 50 100
A Short-Term Traffic Flow Forecasting Method Based on a Three-Layer K-Nearest Neighbor Non-Parametric Regression Algorithm 被引量:7
1
作者 Xiyu Pang Cheng Wang Guolin Huang 《Journal of Transportation Technologies》 2016年第4期200-206,共7页
Short-term traffic flow is one of the core technologies to realize traffic flow guidance. In this article, in view of the characteristics that the traffic flow changes repeatedly, a short-term traffic flow forecasting... Short-term traffic flow is one of the core technologies to realize traffic flow guidance. In this article, in view of the characteristics that the traffic flow changes repeatedly, a short-term traffic flow forecasting method based on a three-layer K-nearest neighbor non-parametric regression algorithm is proposed. Specifically, two screening layers based on shape similarity were introduced in K-nearest neighbor non-parametric regression method, and the forecasting results were output using the weighted averaging on the reciprocal values of the shape similarity distances and the most-similar-point distance adjustment method. According to the experimental results, the proposed algorithm has improved the predictive ability of the traditional K-nearest neighbor non-parametric regression method, and greatly enhanced the accuracy and real-time performance of short-term traffic flow forecasting. 展开更多
关键词 Three-Layer Traffic Flow Forecasting K-Nearest neighbor Non-Parametric regression
下载PDF
The k Nearest Neighbors Estimator of the M-Regression in Functional Statistics 被引量:4
2
作者 Ahmed Bachir Ibrahim Mufrah Almanjahie Mohammed Kadi Attouch 《Computers, Materials & Continua》 SCIE EI 2020年第12期2049-2064,共16页
It is well known that the nonparametric estimation of the regression function is highly sensitive to the presence of even a small proportion of outliers in the data.To solve the problem of typical observations when th... It is well known that the nonparametric estimation of the regression function is highly sensitive to the presence of even a small proportion of outliers in the data.To solve the problem of typical observations when the covariates of the nonparametric component are functional,the robust estimates for the regression parameter and regression operator are introduced.The main propose of the paper is to consider data-driven methods of selecting the number of neighbors in order to make the proposed processes fully automatic.We use thek Nearest Neighbors procedure(kNN)to construct the kernel estimator of the proposed robust model.Under some regularity conditions,we state consistency results for kNN functional estimators,which are uniform in the number of neighbors(UINN).Furthermore,a simulation study and an empirical application to a real data analysis of octane gasoline predictions are carried out to illustrate the higher predictive performances and the usefulness of the kNN approach. 展开更多
关键词 Functional data analysis quantile regression kNN method uniform nearest neighbor(UNN)consistency functional nonparametric statistics almost complete convergence rate
下载PDF
Unveiling the Predictive Capabilities of Machine Learning in Air Quality Data Analysis: A Comparative Evaluation of Different Regression Models
3
作者 Mosammat Mustari Khanaum Md Saidul Borhan +2 位作者 Farzana Ferdoush Mohammed Ali Nause Russel Mustafa Murshed 《Open Journal of Air Pollution》 2023年第4期142-159,共18页
Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for rep... Air quality is a critical concern for public health and environmental regulation. The Air Quality Index (AQI), a widely adopted index by the US Environmental Protection Agency (EPA), serves as a crucial metric for reporting site-specific air pollution levels. Accurately predicting air quality, as measured by the AQI, is essential for effective air pollution management. In this study, we aim to identify the most reliable regression model among linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), logistic regression, and K-nearest neighbors (KNN). We conducted four different regression analyses using a machine learning approach to determine the model with the best performance. By employing the confusion matrix and error percentages, we selected the best-performing model, which yielded prediction error rates of 22%, 23%, 20%, and 27%, respectively, for LDA, QDA, logistic regression, and KNN models. The logistic regression model outperformed the other three statistical models in predicting AQI. Understanding these models' performance can help address an existing gap in air quality research and contribute to the integration of regression techniques in AQI studies, ultimately benefiting stakeholders like environmental regulators, healthcare professionals, urban planners, and researchers. 展开更多
关键词 regression Analysis Air Quality Index Linear Discriminant Analysis Quadratic Discriminant Analysis Logistic regression K-Nearest neighbors Machine Learning Big Data Analysis
下载PDF
Improved scheme to accelerate support vector regression 被引量:1
4
作者 Zhao Yongping Sun Jianguo 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2009年第5期1086-1090,共5页
The computational cost of support vector regression in the training phase is O (N^3), which is very expensive for a large scale problem. In addition, the solution of support vector regression is of parsimoniousness,... The computational cost of support vector regression in the training phase is O (N^3), which is very expensive for a large scale problem. In addition, the solution of support vector regression is of parsimoniousness, which has relation to a part of the whole training data set. Hence, it is reasonable to reduce the training data set. Aiming at the scheme based on k-nearest neighbors to reduce the training data set with the computational complexity O (kMN^2), an improved scheme is proposed to accelerate the reducing phase, which cuts down the computational complexity from O (kMN^2) to O (MN^2). Finally, experimental results on benchmark data sets validate the effectiveness of the improved scheme. 展开更多
关键词 support vector regression parsimoniousness k-nearest neighbors computational complexity.
下载PDF
Floating Car Data Based Nonparametric Regression Model for Short-Term Travel Speed Prediction 被引量:2
5
作者 翁剑成 扈中伟 +1 位作者 于泉 任福田 《Journal of Southwest Jiaotong University(English Edition)》 2007年第3期223-230,共8页
A K-nearest neighbor (K-NN) based nonparametric regression model was proposed to predict travel speed for Beijing expressway. By using the historical traffic data collected from the detectors in Beijing expressways,... A K-nearest neighbor (K-NN) based nonparametric regression model was proposed to predict travel speed for Beijing expressway. By using the historical traffic data collected from the detectors in Beijing expressways, a specically designed database was developed via the processes including data filtering, wavelet analysis and clustering. The relativity based weighted Euclidean distance was used as the distance metric to identify the K groups of nearest data series. Then, a K-NN nonparametric regression model was built to predict the average travel speeds up to 6 min into the future. Several randomly selected travel speed data series, collected from the floating car data (FCD) system, were used to validate the model. The results indicate that using the FCD, the model can predict average travel speeds with an accuracy of above 90%, and hence is feasible and effective. 展开更多
关键词 K-Nearest neighbor Short-term prediction Travel speed Nonparametric regression Intelligence transportation system( ITS Floating car data (FCD)
下载PDF
煤矿井下钻进速度影响因素及其智能预测方法研究
6
作者 戴剑博 王忠宾 +6 位作者 张琰 司垒 魏东 周文博 顾进恒 邹筱瑜 宋雨雨 《煤炭科学技术》 EI CAS CSCD 北大核心 2024年第7期209-221,共13页
在煤矿井下钻探领域,钻进速度(DR)是评估钻探作业最有效的指标之一,钻速预测是实现煤矿钻进智能化的前提条件,对于优化钻机钻进参数、降低作业成本、实现安全高效钻探具有重要意义。为此,提出煤矿井下钻进速度影响因素及其智能预测方法... 在煤矿井下钻探领域,钻进速度(DR)是评估钻探作业最有效的指标之一,钻速预测是实现煤矿钻进智能化的前提条件,对于优化钻机钻进参数、降低作业成本、实现安全高效钻探具有重要意义。为此,提出煤矿井下钻进速度影响因素及其智能预测方法研究,探索基于钻压、转速、扭矩以及钻进深度等少量钻机参数采用机器学习算法实现钻进速度精准预测。首先通过实验室微钻试验,深入分析煤岩力学性能、钻压、转速和钻进深度对扭矩、钻进速度影响规律。研究结果显示,在煤矿井下钻进过程中,随着钻进压力增大,钻进速度呈逐渐升高趋势,在较高的转速条件下钻进压力对钻进速度影响更加明显,转速增加有利于提高钻进速度,但转速对硬度较低的煤层钻进速度影响更为显著;然后,根据煤矿井下防冲钻孔现场数据,采用K–近邻(KNN)、支持向量回归(SVR)和随机森林回归(RFR)3种不同的机器学习算法建立钻进速度预测模型,并结合粒子群算法(PSO)对3种模型超参数进行优化,最后对比分析PSO–KNN,PSO–SVR和PSO–RFR三种钻进速度预测模型预测结果。研究结果表明,PSO–RFR模型准确性最好,决定系数R2高达0.963,均方误差MSE仅有29.742,而PSO–SVR模型鲁棒性最好,在对抗攻击后评价指标变化率最小。本文研究有助于实现煤矿井下钻进速度的精准预测,为煤矿井下智能钻进参数优化提供理论支撑。 展开更多
关键词 钻机参数 K–近邻 随机森林回归 支持向量回归 粒子群算法 钻进速度预测
下载PDF
基于集成学习的交通事故严重程度预测研究与应用 被引量:1
7
作者 单永航 张希 +2 位作者 胡川 丁涛军 姚远 《计算机工程》 CAS CSCD 北大核心 2024年第2期33-42,共10页
目前自动驾驶技术重点是关注如何主动避免碰撞,然而在面对其他交通参与者入侵而导致不可避免的碰撞事故场景时,预测车辆在不同行驶模式下的碰撞严重程度来降低事故严重程度的研究却很少。为此,提出一种双层Stacking事故严重程度预测模... 目前自动驾驶技术重点是关注如何主动避免碰撞,然而在面对其他交通参与者入侵而导致不可避免的碰撞事故场景时,预测车辆在不同行驶模式下的碰撞严重程度来降低事故严重程度的研究却很少。为此,提出一种双层Stacking事故严重程度预测模型。基于真实交通事故数据集NASS-CDS完成训练,模型输入为车辆传感器可感知得到的事故相关特征,输出为车内乘员最高受伤级别。在第1层中,通过实验对不同学习器组合进行训练,最终综合考虑预测性能以及耗时挑选K近邻、自适应提升树、极度梯度提升树作为基学习器;在第2层中,为降低过拟合,采用逻辑回归作为元学习器。实验结果表明,该方法准确率达到85.01%,在精确率、召回率和F1值方面优于其他个体模型和集成模型,该预测结果可作为智能车辆决策规划模块先验信息,帮助车辆做出正确的决策,减缓事故损害。最后阐述了模型在L_(2)辅助驾驶与L_(4)自动驾驶车辆中的应用,在常规车辆安全防护的基础上进一步提升车辆的安全性。 展开更多
关键词 交通安全 交通事故严重程度预测 智能车辆 集成学习 K近邻 自适应提升树 极度梯度提升树 逻辑回归
下载PDF
A Study of EM Algorithm as an Imputation Method: A Model-Based Simulation Study with Application to a Synthetic Compositional Data
8
作者 Yisa Adeniyi Abolade Yichuan Zhao 《Open Journal of Modelling and Simulation》 2024年第2期33-42,共10页
Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear mode... Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear model is the most used technique for identifying hidden relationships between underlying random variables of interest. However, data quality is a significant challenge in machine learning, especially when missing data is present. The linear regression model is a commonly used statistical modeling technique used in various applications to find relationships between variables of interest. When estimating linear regression parameters which are useful for things like future prediction and partial effects analysis of independent variables, maximum likelihood estimation (MLE) is the method of choice. However, many datasets contain missing observations, which can lead to costly and time-consuming data recovery. To address this issue, the expectation-maximization (EM) algorithm has been suggested as a solution for situations including missing data. The EM algorithm repeatedly finds the best estimates of parameters in statistical models that depend on variables or data that have not been observed. This is called maximum likelihood or maximum a posteriori (MAP). Using the present estimate as input, the expectation (E) step constructs a log-likelihood function. Finding the parameters that maximize the anticipated log-likelihood, as determined in the E step, is the job of the maximization (M) phase. This study looked at how well the EM algorithm worked on a made-up compositional dataset with missing observations. It used both the robust least square version and ordinary least square regression techniques. The efficacy of the EM algorithm was compared with two alternative imputation techniques, k-Nearest Neighbor (k-NN) and mean imputation (), in terms of Aitchison distances and covariance. 展开更多
关键词 Compositional Data Linear regression Model Least Square Method Robust Least Square Method Synthetic Data Aitchison Distance Maximum Likelihood Estimation Expectation-Maximization Algorithm k-Nearest neighbor and Mean imputation
下载PDF
集成多方法的废酸装置风机K7200轴承故障诊断
9
作者 王姣娟 豆宏斌 何宇春 《石油工业技术监督》 2024年第1期11-15,共5页
在废酸装置风机K7200中,轴承作为重要的机械部件,准确判断其故障(健康状态、内圈故障、外圈故障和滚动体故障)可以提高维修效率。克服实际作业场景中人工诊断的缺点,提出了集成多方法的轴承故障诊断策略:分别采用K最近邻算法(简称KNN)... 在废酸装置风机K7200中,轴承作为重要的机械部件,准确判断其故障(健康状态、内圈故障、外圈故障和滚动体故障)可以提高维修效率。克服实际作业场景中人工诊断的缺点,提出了集成多方法的轴承故障诊断策略:分别采用K最近邻算法(简称KNN)、逻辑回归(简称LR)和决策树(简称DT)进行诊断,对结果进行投票集成。实验结果表明,采用集成多方法的故障诊断法较KNN、LR和DT算法,故障诊断的准确率分别提升了3.69%、5.03%、6.3%。 展开更多
关键词 废酸装置风机 轴承 故障诊断 K最近邻算法 逻辑回归 决策树 集成
下载PDF
Propagation Path Loss Models at 28 GHz Using K-Nearest Neighbor Algorithm
10
作者 Vu Thanh Quang Dinh Van Linh To Thi Thao 《通讯和计算机(中英文版)》 2022年第1期1-8,共8页
In this paper,we develop and apply K-Nearest Neighbor algorithm to propagation pathloss regression.The path loss models present the dependency of attenuation value on distance using machine learning algorithms based o... In this paper,we develop and apply K-Nearest Neighbor algorithm to propagation pathloss regression.The path loss models present the dependency of attenuation value on distance using machine learning algorithms based on the experimental data.The algorithm is performed by choosing k nearest points and training dataset to find the optimal k value.The proposed method is applied to impove and adjust pathloss model at 28 GHz in Keangnam area,Hanoi,Vietnam.The experiments in both line-of-sight and non-line-of-sight scenarios used many combinations of transmit and receive antennas at different transmit antenna heights and random locations of receive antenna have been carried out using Wireless Insite Software.The results have been compared with 3GPP and NYU Wireless Path Loss Models in order to verify the performance of the proposed approach. 展开更多
关键词 K-nearest neighbor regression 5G millimeter waves path loss
下载PDF
Wireless Communication Signal Strength Prediction Method Based on the K-nearest Neighbor Algorithm
11
作者 Zhao Chen Ning Xiong +6 位作者 Yujue Wang Yong Ding Hengkui Xiang Chenjun Tang Lingang Liu Xiuqing Zou Decun Luo 《国际计算机前沿大会会议论文集》 2019年第1期238-240,共3页
Existing interference protection systems lack automatic evaluation methods to provide scientific, objective and accurate assessment results. To address this issue, this paper develops a layout scheme by geometrically ... Existing interference protection systems lack automatic evaluation methods to provide scientific, objective and accurate assessment results. To address this issue, this paper develops a layout scheme by geometrically modeling the actual scene, so that the hand-held full-band spectrum analyzer would be able to collect signal field strength values for indoor complex scenes. An improved prediction algorithm based on the K-nearest neighbor non-parametric kernel regression was proposed to predict the signal field strengths for the whole plane before and after being shield. Then the highest accuracy set of data could be picked out by comparison. The experimental results show that the improved prediction algorithm based on the K-nearest neighbor non-parametric kernel regression can scientifically and objectively predict the indoor complex scenes’ signal strength and evaluate the interference protection with high accuracy. 展开更多
关键词 INTERFERENCE protection K-nearest neighbor algorithm NON-PARAMETRIC KERNEL regression SIGNAL field STRENGTH
下载PDF
融合分数阶微分与PIMP-RF算法的集成学习模型预测成熟期苹果可溶性固形物含量
12
作者 黄华 刘亚 +5 位作者 库尔班古丽·都力昆 曾繁琳 玛依热·麦麦提 阿瓦古丽·麦麦提 买地努尔汗·艾则孜 郭俊先 《光谱学与光谱分析》 SCIE EI CAS CSCD 北大核心 2023年第10期3059-3066,共8页
可溶性固形物含量(SSC)是反映苹果品质和成熟度的重要生理指标,能够用于苹果品质分析和成熟度预测。以新疆阿克苏冰糖心红富士苹果为研究对象,从果实膨大定形期至完熟期,以等间隔周期3 d采摘样本,测其380~1100 nm的可见/近红外光谱和SSC... 可溶性固形物含量(SSC)是反映苹果品质和成熟度的重要生理指标,能够用于苹果品质分析和成熟度预测。以新疆阿克苏冰糖心红富士苹果为研究对象,从果实膨大定形期至完熟期,以等间隔周期3 d采摘样本,测其380~1100 nm的可见/近红外光谱和SSC,共552个样本。然后融合分数阶微分(FD)及置换重要性-随机森林(PIMP-RF)算法,构建成熟期苹果SSC预测的集成学习模型。结果表明,基于PLS模型优选的分数阶微分阶次为0阶、0.4阶、1.1阶和1.6阶,且通过PIMP-RF算法进行特征重要性和可解释性分析结果显示,利用可见/近红外光谱预测成熟期苹果SSC的关键波长主要为可见光波段,这为今后研发新疆冰糖心红富士苹果的快速无损检测设备提供参考;基于分数阶微分技术和PIMP-RF算法构建的成熟期苹果SSC集成学习模型具有很好的预测能力,其训练集的相关系数r等于0.9892,平均绝对误差MAE等于0.2412,均方根误差RMSE等于0.3091,平均绝对百分误差等于0.0183;测试集的相关系数r等于0.9038,平均绝对误差MAE等于0.5499,均方根误差RMSE等于0.7408,平均绝对百分误差等于0.0434,相比于FD0-PIMP-RF、FD0.4-PIMP-RF、FD1.1-PIMP-RF和FD1.6-PIMP-RF模型,集成学习模型为最优。故而,集成分数阶微分技术与PIMP-RF算法,结合可见近红外光谱技术可有效地实现成熟期苹果的可溶性固形物含量预测。 展开更多
关键词 可见/近红外光谱 分数阶微分 置换重要性-随机森林 K近邻(KNN)回归 可溶性固形物含量
下载PDF
基于K近邻非参数回归的压痕弹性模量估计
13
作者 金宏平 《湖北汽车工业学院学报》 2023年第4期76-80,共5页
基于球压痕过程的有限元分析,建立了无量纲压痕功和无量纲压痕弹性模量的数据集。采用相关性分析方法发现球压痕的特征参数之间存在明显的非线性特性。结果表明:相对于K近邻、加权K近邻和高斯K近邻,采用5近邻和曼哈顿距离的模糊K近邻回... 基于球压痕过程的有限元分析,建立了无量纲压痕功和无量纲压痕弹性模量的数据集。采用相关性分析方法发现球压痕的特征参数之间存在明显的非线性特性。结果表明:相对于K近邻、加权K近邻和高斯K近邻,采用5近邻和曼哈顿距离的模糊K近邻回归估计算法来估计压痕弹性模量,能够获得较高精度的压痕弹性模量。 展开更多
关键词 压痕弹性模量 K近邻 回归 曼哈顿距离
下载PDF
基于纵向联邦学习的短期风电功率协同预测方法 被引量:1
14
作者 赵寒亭 张耀 +3 位作者 霍巍 王建学 吴峰 张衡 《电力系统自动化》 EI CSCD 北大核心 2023年第16期44-53,共10页
由于风力资源具有时空相关性,使用邻近场站的相关数据可以提高待预测场站的预测精度。然而不同场站通常分属不同发电集团,由于商业竞争和数据安全考量,彼此难以获得对方的隐私数据。针对上述问题,首先,提出了基于改进k近邻算法的岭回归... 由于风力资源具有时空相关性,使用邻近场站的相关数据可以提高待预测场站的预测精度。然而不同场站通常分属不同发电集团,由于商业竞争和数据安全考量,彼此难以获得对方的隐私数据。针对上述问题,首先,提出了基于改进k近邻算法的岭回归预测模型;然后,在纵向联邦学习的机制下,采用同步梯度下降算法对所提预测模型进行迭代求解;利用梯度向量可拆分计算的特点,推导了风电预测模型的分布式训练过程和分布式预测过程,将原本的大规模预测问题分解为大量的小规模子问题,且每个子问题由相应的风电场站在本地进行计算。在保证各参与方数据隐私安全的基础上,可以有效利用邻近场站的数据信息,从而提高风电功率预测精度。最后,以实际算例验证了所提方法的有效性。 展开更多
关键词 风电预测 岭回归 K近邻算法 梯度下降 纵向联邦学习 分布式优化
下载PDF
应用机器学习模型与线性模型预测森林蓄积生长量的精度 被引量:2
15
作者 雷媛媛 王新杰 《东北林业大学学报》 CAS CSCD 北大核心 2023年第9期72-75,82,共5页
为了探究不同模型对森林蓄积生长量的预测精度和影响因素,以吉林省汪清金沟岭林场为研究区,利用林场198个固定样地(20 m×20 m)中的乔木数据,建立多元线性回归(MLR)、随机森林(RF)、支持向量机(SVM)、最近邻法(KNN)模型,分析树木蓄... 为了探究不同模型对森林蓄积生长量的预测精度和影响因素,以吉林省汪清金沟岭林场为研究区,利用林场198个固定样地(20 m×20 m)中的乔木数据,建立多元线性回归(MLR)、随机森林(RF)、支持向量机(SVM)、最近邻法(KNN)模型,分析树木蓄积生长量与林分因子、地形因子和气候因子的关系,并比较不同模型的预测精度。结果表明:森林蓄积生长量与林分每公顷断面积(B_(A))、大于对象木断面积(B_(L))、海拔(A_(L))、林分密度(N)、年均降水量(P)显著相关,与平均胸径(D)、坡向(A_(s))、坡度(S_(L))、年均气温(T)并不相关;不同模型的预测结果存在差异,随机森林(RF)方法所有测试指标最佳;随机森林方法得出的平均蓄积生长量预测值为51.45 m^(3)·hm^(-2),该模型均方根误差(R_(MSE))最低(4.62),决定系数(R2)最高(0.91)。 展开更多
关键词 林分蓄积 多元线性回归 最近邻法 支持向量机 随机森林
下载PDF
基于数据填补的煤自燃温度预测模型 被引量:3
16
作者 翟小伟 罗金雷 +3 位作者 张羽琛 宋波波 郝乐 周妤婕 《工矿自动化》 CSCD 北大核心 2023年第1期28-35,98,共9页
现有煤自燃温度预测模型的建立大多基于较为完整的指标气体样本数据,但指标气体数据受仪器或人为因素影响,往往存在数据缺失现象,导致煤自燃温度预测准确率较低和过拟合等问题。针对上述问题,提出了将K近邻算法(KNN)、随机森林(RF)、决... 现有煤自燃温度预测模型的建立大多基于较为完整的指标气体样本数据,但指标气体数据受仪器或人为因素影响,往往存在数据缺失现象,导致煤自燃温度预测准确率较低和过拟合等问题。针对上述问题,提出了将K近邻算法(KNN)、随机森林(RF)、决策树(DT)及基于粒子群优化的支持向量回归等填补算法(PSO-SVR)应用于缺失值填补,缺失数据和填补后的数据通过RF、SVR和极限梯度提升树(XGBoost)算法分别进行训练,并通过PSO算法优化参数,构建了基于数据填补的RF、XGBoost和SVR煤自燃温度预测模型。利用煤自然发火实验选取CO,CO_(2),CH4,C_(2)H_(6),O_(2)作为指标气体,并设计整体缺失率为10%,20%,30%和CO,CO_(2)缺失率为40%,50%,60%共6种随机数据缺失,采用平均绝对误差百分比(MAPE)作为填补效果评价指标,采用MAPE、判断系数R^(2)和均方根误差(RMSE)作为模型性能评价指标,对4种填补算法和3种预测模型进行对比。对比分析结果表明:在6种数据缺失情况下,DT填补算法填补效果优于其他3种算法,在CO,CO_(2)存在较多缺失值时,RF算法的填补值与实际值的MAPE偏大;在不调参的情况下,XGBoost模型虽然在训练集效果极好,但极易过拟合,而SVR模型预测效果极差,无法满足预测要求;在6种数据缺失情况下,基于DT填补算法的PSO-SVR、RF与PSO-RF煤自燃温度预测模型的MAPE均在4%左右,基于DT填补算法的RF模型无需优化就能较好地预测出煤自燃温度,具有良好的稳定性。 展开更多
关键词 煤自燃 温度预测 指标气体 数据缺失填补 K近邻填补算法 随机森林填补算法 决策树回归填补算法 基于粒子群优化的支持向量回归填补算法
下载PDF
基于互邻信息的树型近邻分类方法
17
作者 尹涛 胡新平 +2 位作者 鞠恒荣 黄嘉爽 丁卫平 《南京理工大学学报》 CAS CSCD 北大核心 2023年第2期166-173,共8页
为了提升分布不均匀样本的分类性能,该文提出了一种基于互邻信息的树型近邻(Tree-based k近邻,k Tree)分类方法,以此提高k近邻分类的准确率。首先,采用回归模型刻画样本之间的紧密程度,获取每个样本的最优k值,从而获得最优邻居,并采用k ... 为了提升分布不均匀样本的分类性能,该文提出了一种基于互邻信息的树型近邻(Tree-based k近邻,k Tree)分类方法,以此提高k近邻分类的准确率。首先,采用回归模型刻画样本之间的紧密程度,获取每个样本的最优k值,从而获得最优邻居,并采用k Tree提升搜索效率。其次,对于每一个测试样本,基于互邻信息准则,确定其邻域空间,完成k近邻分类。最后,数据集的试验结果表明,该文提出的基于互邻信息的k Tree分类准确率高于传统k近邻分类等其他分类算法。该文提出的方法也为k近邻分类的改进提供了新的方向。 展开更多
关键词 k近邻分类算法 最优邻居 回归模型 树型近邻模型 数据挖掘
下载PDF
基于回归近邻成分分析和GBRT的室内定位方法
18
作者 王斌涛 冷腾飞 +1 位作者 王益涵 郑家骅 《传感器与微系统》 CSCD 北大核心 2023年第11期66-69,共4页
WiFi指纹定位方法性能易受到室内无线信号波动的影响使得离线指纹存在冗余噪声而导致定位精度不足。对此,本文提出一种改进近邻成分分析(NCA)结合渐近梯度回归树(GBRT)室内定位方法。首先,构造连续可微的目标函数将离散优化问题转化为... WiFi指纹定位方法性能易受到室内无线信号波动的影响使得离线指纹存在冗余噪声而导致定位精度不足。对此,本文提出一种改进近邻成分分析(NCA)结合渐近梯度回归树(GBRT)室内定位方法。首先,构造连续可微的目标函数将离散优化问题转化为连续优化问题,并对离线指纹数据库进行特征提取去除冗余得到离线指纹的主要特征;然后,利用提取特征后的位置指纹数据和特征对应的坐标迭代构造多个CART TREE,利用每个CART TREE损失函数的负梯度值构造集成多个CART TREE得到GBRT定位模型;最后,利用待定位点位置指纹信号特征结合GBRT定位模型预测待定位点位置。实验结果表明:所提出算法相较于其他同类算法误差分别减少14.7%,22.4%,37.1%,能够有效提高定位精度。 展开更多
关键词 室内定位 冗余噪声 近邻成分分析 位置指纹 渐进梯度回归树
下载PDF
煤矿大型机械设备滚动轴承故障诊断改进方法研究 被引量:4
19
作者 彭强 《煤炭工程》 北大核心 2023年第4期141-146,共6页
针对传统的特征选择算法用于煤矿机械轴承故障诊断时将嵌入学习和特征排序分开,无法准确选择出能表征高维数据集的子集和故障诊断准确率不高的问题。文章提出了一种基于嵌入学习与稀疏回归的煤矿机械轴承故障诊断方法。该方法首先构造... 针对传统的特征选择算法用于煤矿机械轴承故障诊断时将嵌入学习和特征排序分开,无法准确选择出能表征高维数据集的子集和故障诊断准确率不高的问题。文章提出了一种基于嵌入学习与稀疏回归的煤矿机械轴承故障诊断方法。该方法首先构造嵌入学习模型,学习高维数据的流形结构;其次,在回归模型中引入具有组稀疏性的l 2,1范数,有效剔除冗余特征;然后联合嵌入学习和稀疏回归构造特征选择框架,选择出能准确表征原始高维数据的本质特征;最后通过与K-最近邻算法相结合进行煤矿机械轴承的故障诊断。实验结果表明,提出的模型显著提高了煤矿机械轴承的故障诊断精度。 展开更多
关键词 煤矿机械轴承 故障诊断 特征选择 嵌入学习 稀疏回归 K-最近邻
下载PDF
助训练框架下的半监督软测量建模方法
20
作者 何罗苏阳 熊伟丽 《智能系统学报》 CSCD 北大核心 2023年第2期231-239,共9页
为了充分利用工业过程中大量无标签样本信息,并减少过程的不确定因素对无标签样本质量的影响,提出一种助训练框架下的半监督孪生支持向量回归软测量建模方法。采用孪生支持向量回归机构建主学习器,对高置信度无标签样本添加伪标签;同时... 为了充分利用工业过程中大量无标签样本信息,并减少过程的不确定因素对无标签样本质量的影响,提出一种助训练框架下的半监督孪生支持向量回归软测量建模方法。采用孪生支持向量回归机构建主学习器,对高置信度无标签样本添加伪标签;同时,基于K近邻算法构建辅学习器,最大化学习器在近邻样本集上的均方误差,经过此项指标筛选后的待处理样本集包含了更多的数据信息;主、辅学习器二者相辅相成,一定程度上提高了模型的泛化性;再利用所构建的助训练框架提高样本利用率后得到预测模型,实现对无标签样本信息的充分挖掘。通过对脱丁烷塔工业过程中的实际数据进行建模仿真,所得结果表明此模型具有良好的预测性能。 展开更多
关键词 软测量建模 半监督 助训练 孪生支持向量回归 K近邻 置信度 学习器 脱丁烷塔
下载PDF
上一页 1 2 11 下一页 到第
使用帮助 返回顶部