期刊文献+
共找到1,605篇文章
< 1 2 81 >
每页显示 20 50 100
Differentially private SGD with random features
1
作者 WANG Yi-guang GUO Zheng-chu 《Applied Mathematics(A Journal of Chinese Universities)》 SCIE CSCD 2024年第1期1-23,共23页
In the realm of large-scale machine learning,it is crucial to explore methods for reducing computational complexity and memory demands while maintaining generalization performance.Additionally,since the collected data... In the realm of large-scale machine learning,it is crucial to explore methods for reducing computational complexity and memory demands while maintaining generalization performance.Additionally,since the collected data may contain some sensitive information,it is also of great significance to study privacy-preserving machine learning algorithms.This paper focuses on the performance of the differentially private stochastic gradient descent(SGD)algorithm based on random features.To begin,the algorithm maps the original data into a lowdimensional space,thereby avoiding the traditional kernel method for large-scale data storage requirement.Subsequently,the algorithm iteratively optimizes parameters using the stochastic gradient descent approach.Lastly,the output perturbation mechanism is employed to introduce random noise,ensuring algorithmic privacy.We prove that the proposed algorithm satisfies the differential privacy while achieving fast convergence rates under some mild conditions. 展开更多
关键词 learning theory differential privacy stochastic gradient descent random features reproducing kernel Hilbert spaces
下载PDF
Parkinson’s Disease Classification Using Random Forest Kerb Feature Selection
2
作者 E.Bharath T.Rajagopalan 《Intelligent Automation & Soft Computing》 SCIE 2023年第5期1417-1433,共17页
Parkinson’s disease(PD)is a neurodegenerative disease cause by a deficiency of dopamine.Investigators have identified the voice as the underlying symptom of PD.Advanced vocal disorder studies provide adequate treatment... Parkinson’s disease(PD)is a neurodegenerative disease cause by a deficiency of dopamine.Investigators have identified the voice as the underlying symptom of PD.Advanced vocal disorder studies provide adequate treatment and support for accurate PD detection.Machine learning(ML)models have recently helped to solve problems in the classification of chronic diseases.This work aims to analyze the effect of selecting features on ML efficiency on a voice-based PD detection system.It includes PD classification models of Random forest,decision Tree,neural network,logistic regression and support vector machine.The feature selection is made by RF mean-decrease in accuracy and mean-decrease in Gini techniques.Random forest kerb feature selection(RFKFS)selects only 17 features from 754 attributes.The proposed technique uses validation metrics to assess the performance of ML models.The results of the RF model with feature selection performed well among all other models with high accuracy score of 96.56%and a precision of 88.02%,a sensitivity of 98.26%,a specificity of 96.06%.The respective validation score has an Non polynomial vector(NPV)of 99.47%,a Geometric Mean(GM)of 97.15%,a Youden’s index(YI)of 94.32%,and a Matthews’s correlation method(MCC)90.84%.The proposed model is also more robust than other models.It was also realised that using the RFKFS approach in the PD results in an effective and high-performing medical classifier. 展开更多
关键词 Parkinson’s disease machine learning healthcare random forest feature selection CLASSIFICATION
下载PDF
Prediction of Alzheimer’s Using Random Forest with Radiomic Features
3
作者 Anuj Singh Raman Kumar Arvind Kumar Tiwari 《Computer Systems Science & Engineering》 SCIE EI 2023年第4期513-530,共18页
Alzheimer’s disease is a non-reversible,non-curable,and progressive neurological disorder that induces the shrinkage and death of a specific neuronal population associated with memory formation and retention.It is a ... Alzheimer’s disease is a non-reversible,non-curable,and progressive neurological disorder that induces the shrinkage and death of a specific neuronal population associated with memory formation and retention.It is a frequently occurring mental illness that occurs in about 60%–80%of cases of dementia.It is usually observed between people in the age group of 60 years and above.Depending upon the severity of symptoms the patients can be categorized in Cognitive Normal(CN),Mild Cognitive Impairment(MCI)and Alzheimer’s Disease(AD).Alzheimer’s disease is the last phase of the disease where the brain is severely damaged,and the patients are not able to live on their own.Radiomics is an approach to extracting a huge number of features from medical images with the help of data characterization algorithms.Here,105 number of radiomic features are extracted and used to predict the alzhimer’s.This paper uses Support Vector Machine,K-Nearest Neighbour,Gaussian Naïve Bayes,eXtreme Gradient Boosting(XGBoost)and Random Forest to predict Alzheimer’s disease.The proposed random forest-based approach with the Radiomic features achieved an accuracy of 85%.This proposed approach also achieved 88%accuracy,88%recall,88%precision and 87%F1-score for AD vs.CN,it achieved 72%accuracy,73%recall,72%precisionand 71%F1-score for AD vs.MCI and it achieved 69%accuracy,69%recall,68%precision and 69%F1-score for MCI vs.CN.The comparative analysis shows that the proposed approach performs better than others approaches. 展开更多
关键词 Alzheimer’s disease radiomic features cognitive normal support vector machine mild cognitive impairment extreme gradient boosting random forest
下载PDF
Random forest classifier combined with feature selection for breast cancer diagnosis and prognostic 被引量:3
4
作者 Cuong Nguyen Yong Wang Ha Nam Nguyen 《Journal of Biomedical Science and Engineering》 2013年第5期551-560,共10页
As the incidence of this disease has increased significantly in the recent years, expert systems and machine learning techniques to this problem have also taken a great attention from many scholars. This study aims at... As the incidence of this disease has increased significantly in the recent years, expert systems and machine learning techniques to this problem have also taken a great attention from many scholars. This study aims at diagnosing and prognosticating breast cancer with a machine learning method based on random forest classifier and feature selection technique. By weighting, keeping useful features and removing redundant features in datasets, the method was obtained to solve diagnosis problems via classifying Wisconsin Breast Cancer Diagnosis Dataset and to solve prognosis problem via classifying Wisconsin Breast Cancer Prognostic Dataset. On these datasets we obtained classification accuracy of 100% in the best case and of around 99.8% on average. This is very promising compared to the previously reported results. This result is for Wisconsin Breast Cancer Dataset but it states that this method can be used confidently for other breast cancer diagnosis problems, too. 展开更多
关键词 BREAST Cancer Diagnosis PROGNOSIS feature Selection random FOREST
下载PDF
Feature Fusion Multi-View Hashing Based on Random Kernel Canonical Correlation Analysis 被引量:2
5
作者 Junshan Tan Rong Duan +2 位作者 Jiaohua Qin Xuyu Xiang Yun Tan 《Computers, Materials & Continua》 SCIE EI 2020年第5期675-689,共15页
Hashing technology has the advantages of reducing data storage and improving the efficiency of the learning system,making it more and more widely used in image retrieval.Multi-view data describes image information mor... Hashing technology has the advantages of reducing data storage and improving the efficiency of the learning system,making it more and more widely used in image retrieval.Multi-view data describes image information more comprehensively than traditional methods using a single-view.How to use hashing to combine multi-view data for image retrieval is still a challenge.In this paper,a multi-view fusion hashing method based on RKCCA(Random Kernel Canonical Correlation Analysis)is proposed.In order to describe image content more accurately,we use deep learning dense convolutional network feature DenseNet to construct multi-view by combining GIST feature or BoW_SIFT(Bag-of-Words model+SIFT feature)feature.This algorithm uses RKCCA method to fuse multi-view features to construct association features and apply them to image retrieval.The algorithm generates binary hash code with minimal distortion error by designing quantization regularization terms.A large number of experiments on benchmark datasets show that this method is superior to other multi-view hashing methods. 展开更多
关键词 HASHING multi-view data random kernel canonical correlation analysis feature fusion deep learning
下载PDF
Research on color image matching method based on feature point compensation in dark light environment
6
作者 唐华鹏 QIN Danyang +2 位作者 YAN Mengying YANG Jiaqiang ZHANG Gengxin 《High Technology Letters》 EI CAS 2023年第1期78-86,共9页
Image matching refers to the process of matching two or more images obtained at different time,different sensors or different conditions through a large number of feature points in the image.At present,image matching ... Image matching refers to the process of matching two or more images obtained at different time,different sensors or different conditions through a large number of feature points in the image.At present,image matching is widely used in target recognition and tracking,indoor positioning and navigation.Local features missing,however,often occurs in color images taken in dark light,making the extracted feature points greatly reduced in number,so as to affect image matching and even fail the target recognition.An unsharp masking(USM)based denoising model is established and a local adaptive enhancement algorithm is proposed to achieve feature point compensation by strengthening local features of the dark image in order to increase amount of image information effectively.Fast library for approximate nearest neighbors(FLANN)and random sample consensus(RANSAC)are image matching algorithms.Experimental results show that the number of effective feature points obtained by the proposed algorithm from images in dark light environment is increased,and the accuracy of image matching can be improved obviously. 展开更多
关键词 dark light environment unsharp masking(USM) denoising model feature point compensation fast library for approximate nearest neighbor(FLANN) random sample consensus(RANSAC)
下载PDF
Feature Selection for Intrusion Detection Using Random Forest 被引量:10
7
作者 Md. Al Mehedi Hasan Mohammed Nasser +1 位作者 Shamim Ahmad Khademul Islam Molla 《Journal of Information Security》 2016年第3期129-140,共12页
An intrusion detection system collects and analyzes information from different areas within a computer or a network to identify possible security threats that include threats from both outside as well as inside of the... An intrusion detection system collects and analyzes information from different areas within a computer or a network to identify possible security threats that include threats from both outside as well as inside of the organization. It deals with large amount of data, which contains various ir-relevant and redundant features and results in increased processing time and low detection rate. Therefore, feature selection should be treated as an indispensable pre-processing step to improve the overall system performance significantly while mining on huge datasets. In this context, in this paper, we focus on a two-step approach of feature selection based on Random Forest. The first step selects the features with higher variable importance score and guides the initialization of search process for the second step whose outputs the final feature subset for classification and in-terpretation. The effectiveness of this algorithm is demonstrated on KDD’99 intrusion detection datasets, which are based on DARPA 98 dataset, provides labeled data for researchers working in the field of intrusion detection. The important deficiency in the KDD’99 data set is the huge number of redundant records as observed earlier. Therefore, we have derived a data set RRE-KDD by eliminating redundant record from KDD’99 train and test dataset, so the classifiers and feature selection method will not be biased towards more frequent records. This RRE-KDD consists of both KDD99Train+ and KDD99Test+ dataset for training and testing purposes, respectively. The experimental results show that the Random Forest based proposed approach can select most im-portant and relevant features useful for classification, which, in turn, reduces not only the number of input features and time but also increases the classification accuracy. 展开更多
关键词 feature Selection KDD’99 Dataset RRE-KDD Dataset random Forest Permuted Importance Measure
下载PDF
Feature Selection Using Grey Wolf Optimization with Random Differential Grouping 被引量:1
8
作者 R.S.Latha B.Saravana Balaji +3 位作者 Nebojsa Bacanin Ivana Strumberger Miodrag Zivkovic Milos Kabiljo 《Computer Systems Science & Engineering》 SCIE EI 2022年第10期317-332,共16页
Big data are regarded as a tremendous technology for processing a huge variety of data in a short time and with a large storage capacity.The user’s access over the internet creates massive data processing over the in... Big data are regarded as a tremendous technology for processing a huge variety of data in a short time and with a large storage capacity.The user’s access over the internet creates massive data processing over the internet.Big data require an intelligent feature selection model by addressing huge varieties of data.Traditional feature selection techniques are only applicable to simple data mining.Intelligent techniques are needed in big data processing and machine learning for an efficient classification.Major feature selection algorithms read the input features as they are.Then,the features are preprocessed and classified.Here,an algorithm does not consider the relatedness.During feature selection,all features are misread as outputs.Accordingly,a less optimal solution is achieved.In our proposed research,we focus on the feature selection by using supervised learning techniques called grey wolf optimization(GWO)with decomposed random differential grouping(DrnDG-GWO).First,decomposition of features into subsets based on relatedness in variables is performed.Random differential grouping is performed using a fitness value of two variables.Now,every subset is regarded as a population in GWO techniques.The combination of supervised machine learning with swarm intelligence techniques produces best feature optimization results in this research.Once the features are optimized,we classify using advanced kNN process for accurate data classification.The result of DrnDGGWO is compared with those of the standard GWO and GWO with PSO for feature selection to compare the efficiency of the proposed algorithm.The accuracy and time complexity of the proposed algorithm are 98%and 5 s,which are better than the existing techniques. 展开更多
关键词 feature selection data optimization supervised learning swarm intelligence decomposed random differential grouping grey wolf optimization
下载PDF
A real-time intelligent lithology identification method based on a dynamic felling strategy weighted random forest algorithm
9
作者 Tie Yan Rui Xu +2 位作者 Shi-Hui Sun Zhao-Kai Hou Jin-Yu Feng 《Petroleum Science》 SCIE EI CAS CSCD 2024年第2期1135-1148,共14页
Real-time intelligent lithology identification while drilling is vital to realizing downhole closed-loop drilling. The complex and changeable geological environment in the drilling makes lithology identification face ... Real-time intelligent lithology identification while drilling is vital to realizing downhole closed-loop drilling. The complex and changeable geological environment in the drilling makes lithology identification face many challenges. This paper studies the problems of difficult feature information extraction,low precision of thin-layer identification and limited applicability of the model in intelligent lithologic identification. The author tries to improve the comprehensive performance of the lithology identification model from three aspects: data feature extraction, class balance, and model design. A new real-time intelligent lithology identification model of dynamic felling strategy weighted random forest algorithm(DFW-RF) is proposed. According to the feature selection results, gamma ray and 2 MHz phase resistivity are the logging while drilling(LWD) parameters that significantly influence lithology identification. The comprehensive performance of the DFW-RF lithology identification model has been verified in the application of 3 wells in different areas. By comparing the prediction results of five typical lithology identification algorithms, the DFW-RF model has a higher lithology identification accuracy rate and F1 score. This model improves the identification accuracy of thin-layer lithology and is effective and feasible in different geological environments. The DFW-RF model plays a truly efficient role in the realtime intelligent identification of lithologic information in closed-loop drilling and has greater applicability, which is worthy of being widely used in logging interpretation. 展开更多
关键词 Intelligent drilling Closed-loop drilling Lithology identification random forest algorithm feature extraction
下载PDF
Role of Feature Selection on Leaf Image Classification 被引量:1
10
作者 Arun Kumar Vinod Patidar +1 位作者 Deepak Khazanchi Poonam Saini 《Journal of Data Analysis and Information Processing》 2015年第4期175-183,共9页
The digital images have been studied for image classification, enhancement, image compression and image segmentation purposes. In the present work, it is proposed to study the effects of feature selection algorithm on... The digital images have been studied for image classification, enhancement, image compression and image segmentation purposes. In the present work, it is proposed to study the effects of feature selection algorithm on the predictive classification accuracy of algorithms used for discriminating the different plant leaf images. The process involves extracting the important texture features from the digital images and then subjecting them to feature selection and further classification process. The leaf image features have been extracted by using Gabor texture features and these Gabor features are subjected to Random Forest feature selection algorithm for extracting important texture features. The four classification algorithms like K-Nearest Neighbour, J48, Classification and Regression Trees and Random Forest have been used for classification purpose. This study shows that there is a net improvement in the predictive classification accuracy values, when classification algorithms have been applied on selected features over the complete set of features. 展开更多
关键词 LEAF Image feature Selection Algorithm random FOREST GABOR TEXTURE features
下载PDF
A Global Training Model for Beat Classification Using Basic Electrocardiogram Morphological Features
11
作者 Shubha Sumesh John Yearwood +1 位作者 Shamsul Huda Shafiq Ahmad 《Computers, Materials & Continua》 SCIE EI 2022年第3期4503-4521,共19页
Clinical Study and automatic diagnosis of electrocardiogram(ECG)data always remain a challenge in diagnosing cardiovascular activities.The analysis of ECG data relies on various factors like morphological features,cla... Clinical Study and automatic diagnosis of electrocardiogram(ECG)data always remain a challenge in diagnosing cardiovascular activities.The analysis of ECG data relies on various factors like morphological features,classification techniques,methods or models used to diagnose and its performance improvement.Another crucial factor in themethodology is howto train the model for each patient.Existing approaches use standard training model which faces challenges when training data has variation due to individual patient characteristics resulting in a lower detection accuracy.This paper proposes an adaptive approach to identify performance improvement in building a training model that analyze global trainingmethodology against an individual training methodology and identifying a gap between them.We provide our investigation and comparative study on these methods and model with standard classification techniques with basic morphological features and Heart RateVariability(HRV)thatmay aid real time application.This approach helps in analyzing and evaluating the performance of different techniques and can suggests adoption of a best model identification with efficient technique and efficient attribute set for real-time systems. 展开更多
关键词 ECG morphological feature HRV GLOBAL adaptive training multilayer perceptron(MLP) support vector machine(SVM) random forest(RF)
下载PDF
城市实景模型结构化线面特征重构方法 被引量:1
12
作者 梅熙 王义 +1 位作者 曲英杰 邓非 《同济大学学报(自然科学版)》 EI CAS CSCD 北大核心 2024年第1期130-136,共7页
为了改善城市实景模型边缘模糊,提出了一种曲率引导的结构化线面特征重构方法。根据曲率特征将网格分割为平面、可展凹、可展凸以及不可展曲面4类,在平面分割结果内提取平面,在可展凹和可展凸分割结果内提取直线,对过度弯曲的不可展区... 为了改善城市实景模型边缘模糊,提出了一种曲率引导的结构化线面特征重构方法。根据曲率特征将网格分割为平面、可展凹、可展凸以及不可展曲面4类,在平面分割结果内提取平面,在可展凹和可展凸分割结果内提取直线,对过度弯曲的不可展区域进行保留,最终形成包含几何特征的复合网格模型。结果表明,结合曲率信息预先设置几何特征的潜在范围,使得结构化线面特征更可靠,同时保证城市实景中复杂的树结构不被错误地提取为平面。 展开更多
关键词 实景三维模型 三维重建 网格 线特征 面特征 马尔科夫随机场(MRF) 简化
下载PDF
Detecting XSS with Random Forest and Multi-Channel Feature Extraction
13
作者 Qiurong Qin Yueqin Li +3 位作者 Yajie Mi Jinhui Shen Kexin Wu Zhenzhao Wang 《Computers, Materials & Continua》 SCIE EI 2024年第7期843-874,共32页
In the era of the Internet,widely used web applications have become the target of hacker attacks because they contain a large amount of personal information.Among these vulnerabilities,stealing private data through cr... In the era of the Internet,widely used web applications have become the target of hacker attacks because they contain a large amount of personal information.Among these vulnerabilities,stealing private data through crosssite scripting(XSS)attacks is one of the most commonly used attacks by hackers.Currently,deep learning-based XSS attack detection methods have good application prospects;however,they suffer from problems such as being prone to overfitting,a high false alarm rate,and low accuracy.To address these issues,we propose a multi-stage feature extraction and fusion model for XSS detection based on Random Forest feature enhancement.The model utilizes RandomForests to capture the intrinsic structure and patterns of the data by extracting leaf node indices as features,which are subsequentlymergedwith the original data features to forma feature setwith richer information content.Further feature extraction is conducted through three parallel channels.Channel I utilizes parallel onedimensional convolutional layers(1Dconvolutional layers)with different convolutional kernel sizes to extract local features at different scales and performmulti-scale feature fusion;Channel II employsmaximum one-dimensional pooling layers(max 1D pooling layers)of various sizes to extract key features from the data;and Channel III extracts global information bi-directionally using a Bi-Directional Long-Short TermMemory Network(Bi-LSTM)and incorporates a multi-head attention mechanism to enhance global features.Finally,effective classification and prediction of XSS are performed by fusing the features of the three channels.To test the effectiveness of the model,we conduct experiments on six datasets.We achieve an accuracy of 100%on the UNSW-NB15 dataset and 99.99%on the CICIDS2017 dataset,which is higher than that of the existing models. 展开更多
关键词 random forest feature enhancement three-channel parallelism XSS detection
下载PDF
Optimization of Random Feature Method in the High-Precision Regime
14
作者 Jingrun Chen Weinan E Yifei Sun 《Communications on Applied Mathematics and Computation》 EI 2024年第2期1490-1517,共28页
Machine learning has been widely used for solving partial differential equations(PDEs)in recent years,among which the random feature method(RFM)exhibits spectral accuracy and can compete with traditional solvers in te... Machine learning has been widely used for solving partial differential equations(PDEs)in recent years,among which the random feature method(RFM)exhibits spectral accuracy and can compete with traditional solvers in terms of both accuracy and efficiency.Potentially,the optimization problem in the RFM is more difficult to solve than those that arise in traditional methods.Unlike the broader machine-learning research,which frequently targets tasks within the low-precision regime,our study focuses on the high-precision regime crucial for solving PDEs.In this work,we study this problem from the following aspects:(i)we analyze the coeffcient matrix that arises in the RFM by studying the distribution of singular values;(ii)we investigate whether the continuous training causes the overfitting issue;(ii)we test direct and iterative methods as well as randomized methods for solving the optimization problem.Based on these results,we find that direct methods are superior to other methods if memory is not an issue,while iterative methods typically have low accuracy and can be improved by preconditioning to some extent. 展开更多
关键词 random feature method(RFM) Partial differential equation(PDE) Least-squares problem Direct method Iterative method
下载PDF
基于特征变量扩展的含气饱和度随机森林预测方法 被引量:1
15
作者 桂金咏 李胜军 +2 位作者 高建虎 刘炳杨 郭欣 《岩性油气藏》 CAS CSCD 北大核心 2024年第2期65-75,共11页
采用数据驱动的方式,提出了一种基于随机森林机器学习算法训练出含气饱和度地震预测方法,并将该方法应用于中国西部复杂天然气藏中,分别对单井资料和二维地震资料进行了含气饱和度预测与分析。研究结果表明:(1)抽取井旁道纵波速度、横... 采用数据驱动的方式,提出了一种基于随机森林机器学习算法训练出含气饱和度地震预测方法,并将该方法应用于中国西部复杂天然气藏中,分别对单井资料和二维地震资料进行了含气饱和度预测与分析。研究结果表明:(1)抽取井旁道纵波速度、横波速度和密度3个弹性参数叠前地震反演结果作为基本特征变量样本,引入边界合成少数类过采样技术对基本特征变量样本和对应的含气饱和度样本进行平衡化处理;利用扩展弹性阻抗结合数学变换自动生成一系列的扩展变量;再利用随机森林对特征变量进行含气饱和度预测重要性排名,并优选重要性较高的特征变量进行含气饱和度随机森林训练。(2)该方法大幅减少了特征变量提取和优选的人工工作量,且有效减少了信息冗余以及因含气饱和度样本不平衡导致的训练偏倚问题,有效增强了随机森林算法在含气饱和度地震预测方面的能力。(3)实际单井应用中预测的含气饱和度与测井解释的含气饱和度的相关系数可达0.9855;在二维地震资料应用中,该方法比基于常规未平衡化的11个弹性参数作为随机森林输入预测出的含气饱和度精度更高。 展开更多
关键词 含气饱和度 随机森林 纵波速度 横波速度 密度 特征变量 不平衡数据 机器学习 气层预测 地震预测
下载PDF
基于随机多权重TOPSIS法的目标威胁评估
16
作者 刘畅 李姜 +3 位作者 陈阳 郭立红 王烨 于洋 《兵器装备工程学报》 CAS CSCD 北大核心 2024年第7期50-55,96,共7页
在目标威胁评估问题中,为了解决目标各项特征权重选取可能存在主观决策误差的问题,提出了一种基于随机多权重TOPSIS法的空中目标威胁评估方法。基于层次分析法确定了各项特征之间的权重,对难以定量衡量的目标特性进行了特征提取,并基于... 在目标威胁评估问题中,为了解决目标各项特征权重选取可能存在主观决策误差的问题,提出了一种基于随机多权重TOPSIS法的空中目标威胁评估方法。基于层次分析法确定了各项特征之间的权重,对难以定量衡量的目标特性进行了特征提取,并基于随机多权重TOPSIS法将空中目标的威胁度进行了评估。仿真实例表明:随机多权重TOPSIS法威胁评估与AHP、TOPSIS方法排序一致,但给出了各目标威胁度的不确定范围,实验中不确定范围值最低为0.08%,最高为3.78%。战场指挥人员可以通过本文中提出的威胁度不确定性范围得到更多参考信息。 展开更多
关键词 目标威胁度评估 目标特征提取 TOPSIS法 随机多权重 层次分析法
下载PDF
基于近红外光谱的卷烟配方模块香型预测
17
作者 王林 郑明明 +3 位作者 王翀 吴庆华 崔南方 李建斌 《华中农业大学学报》 CAS CSCD 北大核心 2024年第1期226-231,共6页
为提高卷烟配方模块的分类识别准确率,并为卷烟配方模块的科学评估提供技术支撑,提出了一种基于近红外光谱特征筛选的卷烟配方模块香型预测方法。选取2017—2019年238个卷烟配方模块样品的近红外光谱数据,结合特征工程中的递归特征消除... 为提高卷烟配方模块的分类识别准确率,并为卷烟配方模块的科学评估提供技术支撑,提出了一种基于近红外光谱特征筛选的卷烟配方模块香型预测方法。选取2017—2019年238个卷烟配方模块样品的近红外光谱数据,结合特征工程中的递归特征消除法和BP神经网络、随机森林、XGBoost3种机器学习技术,构建了基于特征变量的香型预测模型。与全光谱数据训练的分类效果对比,经过递归特征消除法筛选后的光谱特征变量能够有效提升卷烟配方模块香型的识别准确率,其中,XGBoost算法分类效果最佳,模型对测试集的识别准确率达到了90.41%。结果表明,基于近红外光谱特征筛选的香型预测方法对卷烟配方模块的快速定位、科学评价及卷烟配方设计等有一定的辅助决策作用。 展开更多
关键词 烟叶 香型 近红外光谱 递归特征消除 随机森林 XGBoost
下载PDF
基于精度突变的随机森林特征优选方法研究与应用
18
作者 潘建平 尚栋 +3 位作者 谢鹏 郭志豪 齐晨 李逸萌 《测绘工程》 2024年第3期26-35,共10页
随机森林特征优选法是遥感解译中常用的特征选取方法,可以减少特征冗余提高提取精度。但该算法在构建决策树时会随机选择特征子集,导致某些重要的特征被丢失,从而使优选结果不是最优特征子集。以水稻提取为例,设计了一种基于精度突变的... 随机森林特征优选法是遥感解译中常用的特征选取方法,可以减少特征冗余提高提取精度。但该算法在构建决策树时会随机选择特征子集,导致某些重要的特征被丢失,从而使优选结果不是最优特征子集。以水稻提取为例,设计了一种基于精度突变的随机森林特征优选方法:利用随机森林特征优选方法对研究区进行特征排序;通过逐级组合的方式进行水稻提取;将精度突变的特征重新加入到特征优选子集;基于最优特征优选子集提取水稻。实验结果显示,文中方法将水稻提取中被丢失的特征重新加入到特征优选子集中,其总体提取精度可提升2.7%,表明文中方法可提高水稻的提取精度,同时该方法在地物分类和变化检测等相关领域也有一定的参考价值。 展开更多
关键词 特征优选 精度突变 随机森林 水稻 谷歌地球引擎
下载PDF
应用多时序特征的哨兵系列影像对南方丘陵区树种识别
19
作者 王洁 李恒凯 +1 位作者 龙北平 张建莹 《东北林业大学学报》 CAS CSCD 北大核心 2024年第3期60-68,共9页
树种分类是森林资源调查和监测的重要工作,杉木和油茶作为袁州区主要经济树种,准确获取树种空间分布信息,对产量估算和资源管理具有重要意义。以江西省宜春市袁州区为研究区,试验融合时序哨兵-1(Sentinel-1)、哨兵-2(Sentinel-2)等数据... 树种分类是森林资源调查和监测的重要工作,杉木和油茶作为袁州区主要经济树种,准确获取树种空间分布信息,对产量估算和资源管理具有重要意义。以江西省宜春市袁州区为研究区,试验融合时序哨兵-1(Sentinel-1)、哨兵-2(Sentinel-2)等数据,结合中国南方丘陵区树种特点,提取植被指数、红边植被指数、地形特征和纹理特征等构建特征变量组合,分别利用分离阈值法(SEaTH)和特征权重算法(ReliefF)进行特征重要性排序和特征优选,分析各特征对树种分类的影响。结果表明:(1)在使用光谱特征和植被-水体指数的基础上加入不同特征后,树种分类精度均有提升,其中纹理特征的加入更有利于树种分类。(2)结合随机森林算法和特征权重算法(ReliefF)对树种分类的精度最高,总体精度为85.33%,Kappa系数为0.81,优于相同特征组下的支持向量机算法和分类回归树算法。 展开更多
关键词 树种分类 哨兵-1 哨兵-2 特征优选 随机森林 中国南方丘陵
下载PDF
基于分量相依性的风速随机性建模方法
20
作者 张家安 王军燕 +2 位作者 刘辉 吴林林 王向伟 《太阳能学报》 EI CAS CSCD 北大核心 2024年第2期109-115,共7页
由于微地形、微气象的影响,风电场风速的随机性特征复杂,为准确描述风速的随机性特征,提出基于分量相依性的风速随机性特征建模方法。首先对风速序列的随机性特征进行提取,采用变分模态分解(VMD)将风速分解为多个不同频率的模态分量,以... 由于微地形、微气象的影响,风电场风速的随机性特征复杂,为准确描述风速的随机性特征,提出基于分量相依性的风速随机性特征建模方法。首先对风速序列的随机性特征进行提取,采用变分模态分解(VMD)将风速分解为多个不同频率的模态分量,以序列自相关系数(AC)为指标,对风速成分进行划分,得到风速的波动性分量和随机性分量。然后,考虑风速随机性分量对波动性分量的相依性,以正态分布描述不同风速下的随机性特征,建立基于分量相依性的风速随机性模型。以华北张家口某风电场的运行数据为例,验证该方法的有效性。实验结果表明该文方法能更好地复现风速序列的随机性特征。 展开更多
关键词 风速 随机性 特征提取 分量相依性 统计方法
下载PDF
上一页 1 2 81 下一页 到第
使用帮助 返回顶部