期刊文献+
共找到24篇文章
< 1 2 >
每页显示 20 50 100
Basic Tenets of Classification Algorithms K-Nearest-Neighbor, Support Vector Machine, Random Forest and Neural Network: A Review 被引量:1
1
作者 Ernest Yeboah Boateng Joseph Otoo Daniel A. Abaye 《Journal of Data Analysis and Information Processing》 2020年第4期341-357,共17页
In this paper, sixty-eight research articles published between 2000 and 2017 as well as textbooks which employed four classification algorithms: K-Nearest-Neighbor (KNN), Support Vector Machines (SVM), Random Forest (... In this paper, sixty-eight research articles published between 2000 and 2017 as well as textbooks which employed four classification algorithms: K-Nearest-Neighbor (KNN), Support Vector Machines (SVM), Random Forest (RF) and Neural Network (NN) as the main statistical tools were reviewed. The aim was to examine and compare these nonparametric classification methods on the following attributes: robustness to training data, sensitivity to changes, data fitting, stability, ability to handle large data sizes, sensitivity to noise, time invested in parameter tuning, and accuracy. The performances, strengths and shortcomings of each of the algorithms were examined, and finally, a conclusion was arrived at on which one has higher performance. It was evident from the literature reviewed that RF is too sensitive to small changes in the training dataset and is occasionally unstable and tends to overfit in the model. KNN is easy to implement and understand but has a major drawback of becoming significantly slow as the size of the data in use grows, while the ideal value of K for the KNN classifier is difficult to set. SVM and RF are insensitive to noise or overtraining, which shows their ability in dealing with unbalanced data. Larger input datasets will lengthen classification times for NN and KNN more than for SVM and RF. Among these nonparametric classification methods, NN has the potential to become a more widely used classification algorithm, but because of their time-consuming parameter tuning procedure, high level of complexity in computational processing, the numerous types of NN architectures to choose from and the high number of algorithms used for training, most researchers recommend SVM and RF as easier and wieldy used methods which repeatedly achieve results with high accuracies and are often faster to implement. 展开更多
关键词 classification algorithms NON-PARAMETRIC K-Nearest-Neighbor Neural Networks Random Forest Support Vector Machines
下载PDF
A Short Review of Classification Algorithms Accuracy for Data Prediction in Data Mining Applications 被引量:1
2
作者 Ibrahim Ba’abbad Thamer Althubiti +2 位作者 Abdulmohsen Alharbi Khalid Alfarsi Saim Rasheed 《Journal of Data Analysis and Information Processing》 2021年第3期162-174,共13页
Many business applications rely on their historical data to predict their business future. The marketing products process is one of the core processes for the business. Customer needs give a useful piece of informatio... Many business applications rely on their historical data to predict their business future. The marketing products process is one of the core processes for the business. Customer needs give a useful piece of information that help</span><span style="font-family:Verdana;"><span style="font-family:Verdana;">s</span></span><span style="font-family:Verdana;"> to market the appropriate products at the appropriate time. Moreover, services are considered recently as products. The development of education and health services </span><span style="font-family:Verdana;"><span style="font-family:Verdana;">is</span></span><span style="font-family:Verdana;"> depending on historical data. For the more, reducing online social media networks problems and crimes need a significant source of information. Data analysts need to use an efficient classification algorithm to predict the future of such businesses. However, dealing with a huge quantity of data requires great time to process. Data mining involves many useful techniques that are used to predict statistical data in a variety of business applications. The classification technique is one of the most widely used with a variety of algorithms. In this paper, various classification algorithms are revised in terms of accuracy in different areas of data mining applications. A comprehensive analysis is made after delegated reading of 20 papers in the literature. This paper aims to help data analysts to choose the most suitable classification algorithm for different business applications including business in general, online social media networks, agriculture, health, and education. Results show FFBPN is the most accurate algorithm in the business domain. The Random Forest algorithm is the most accurate in classifying online social networks (OSN) activities. Na<span style="white-space:nowrap;">&#239</span>ve Bayes algorithm is the most accurate to classify agriculture datasets. OneR is the most accurate algorithm to classify instances within the health domain. The C4.5 Decision Tree algorithm is the most accurate to classify students’ records to predict degree completion time. 展开更多
关键词 Data Prediction Techniques ACCURACY classification algorithms Data Mining Applications
下载PDF
A Comparative Study of Image Classification Algorithms for Landscape Assessment of the Niger Delta Region
3
作者 Omoleomo Olutoyin Omo-Irabor 《Journal of Geographic Information System》 2016年第2期163-170,共8页
A critical problem associated with the southern part of Nigeria is the rapid alteration of the landscape as a result of logging, agricultural practices, human migration and expansion, oil exploration, exploitation and... A critical problem associated with the southern part of Nigeria is the rapid alteration of the landscape as a result of logging, agricultural practices, human migration and expansion, oil exploration, exploitation and production activities. These processes have had both positive and negative effects on the economic and socio-political development of the country in general. The negative impacts have led not only to the degradation of the ecosystem but also posing hazards to human health and polluting surface and ground water resources. This has created the need for the development of a rapid, cost effective and efficient land use/land cover (LULC) classification technique to monitor the biophysical dynamics in the region. Due to the complex land cover patterns existing in the study area and the occasionally indistinguishable relationship between land cover and spectral signals, this paper introduces a combined use of unsupervised and supervised image classification for detecting land use/land cover (LULC) classes. With the continuous conflict over the impact of oil activities in the area, this work provides a procedure for detecting LULC change, which is an important factor to consider in the design of an environmental decision-making framework. Results from the use of this technique on Landsat TM and ETM+ of 1987 and 2002 are discussed. The results reveal the pros and cons of the two methods and the effects of their overall accuracy on post-classification change detection. 展开更多
关键词 Land Cover Supervised and Unsupervised classification algorithms Landsat Images Change Detection Niger Delta
下载PDF
Prediction of geological characteristics from shield operational parameters by integrating grid search and K-fold cross validation into stacking classification algorithm 被引量:5
4
作者 Tao Yan Shui-Long Shen +1 位作者 Annan Zhou Xiangsheng Chen 《Journal of Rock Mechanics and Geotechnical Engineering》 SCIE CSCD 2022年第4期1292-1303,共12页
This study presents a framework for predicting geological characteristics based on integrating a stacking classification algorithm(SCA) with a grid search(GS) and K-fold cross validation(K-CV). The SCA includes two le... This study presents a framework for predicting geological characteristics based on integrating a stacking classification algorithm(SCA) with a grid search(GS) and K-fold cross validation(K-CV). The SCA includes two learner layers: a primary learner’s layer and meta-classifier layer. The accuracy of the SCA can be improved by using the GS and K-CV. The GS was developed to match the hyper-parameters and optimise complicated problems. The K-CV is commonly applied to changing the validation set in a training set. In general, a GS is usually combined with K-CV to produce a corresponding evaluation index and select the best hyper-parameters. The torque penetration index(TPI) and field penetration index(FPI) are proposed based on shield parameters to express the geological characteristics. The elbow method(EM) and silhouette coefficient(Si) are employed to determine the types of geological characteristics(K) in a Kmeans++ algorithm. A case study on mixed ground in Guangzhou is adopted to validate the applicability of the developed model. The results show that with the developed framework, the four selected parameters, i.e. thrust, advance rate, cutterhead rotation speed and cutterhead torque, can be used to effectively predict the corresponding geological characteristics. 展开更多
关键词 Geological characteristics Stacking classification algorithm(SCA) K-fold cross-validation(K-CV) K-means++
下载PDF
A novel fast classification filtering algorithm for LiDAR point clouds based on small grid density clustering 被引量:3
5
作者 Xingsheng Deng Guo Tang Qingyang Wang 《Geodesy and Geodynamics》 CSCD 2022年第1期38-49,共12页
Clustering filtering is usually a practical method for light detection and ranging(LiDAR)point clouds filtering according to their characteristic attributes.However,the amount of point cloud data is extremely large in... Clustering filtering is usually a practical method for light detection and ranging(LiDAR)point clouds filtering according to their characteristic attributes.However,the amount of point cloud data is extremely large in practice,making it impossible to cluster point clouds data directly,and the filtering error is also too large.Moreover,many existing filtering algorithms have poor classification results in discontinuous terrain.This article proposes a new fast classification filtering algorithm based on density clustering,which can solve the problem of point clouds classification in discontinuous terrain.Based on the spatial density of LiDAR point clouds,also the features of the ground object point clouds and the terrain point clouds,the point clouds are clustered firstly by their elevations,and then the plane point clouds are selected.Thus the number of samples and feature dimensions of data are reduced.Using the DBSCAN clustering filtering method,the original point clouds are finally divided into noise point clouds,ground object point clouds,and terrain point clouds.The experiment uses 15 sets of data samples provided by the International Society for Photogrammetry and Remote Sensing(ISPRS),and the results of the proposed algorithm are compared with the other eight classical filtering algorithms.Quantitative and qualitative analysis shows that the proposed algorithm has good applicability in urban areas and rural areas,and is significantly better than other classic filtering algorithms in discontinuous terrain,with a total error of about 10%.The results show that the proposed method is feasible and can be used in different terrains. 展开更多
关键词 Small grid density clustering DBSCAN Fast classification filtering algorithm
下载PDF
AUTOMATIC FAST CLASSIFICATION OF PRODUCT-IMAGES WITH CLASS-SPECIFIC DESCRIPTOR 被引量:1
6
作者 Jia Shijie Kong Xiangwei Jin Guang 《Journal of Electronics(China)》 2010年第6期808-814,共7页
To achieve online automatic classification of product is a great need of e-commerce de-velopment. By analyzing the characteristics of product images, we proposed a fast supervised image classifier which is based on cl... To achieve online automatic classification of product is a great need of e-commerce de-velopment. By analyzing the characteristics of product images, we proposed a fast supervised image classifier which is based on class-specific Pyramid Histogram Of Words (PHOW) descriptor and Im-age-to-Class distance (PHOW/I2C). In the training phase, the local features are densely sampled and represented as soft-voting PHOW descriptors, and then the class-specific descriptors are built with the means and variances of distribution of each visual word in each labelled class. For online testing, the normalized chi-square distance is calculated between the descriptor of query image and each class-specific descriptor. The class label corresponding to the least I2C distance is taken as the final winner. Experiments demonstrate the effectiveness and quickness of our method in the tasks of product clas-sification. 展开更多
关键词 Class-specific descriptor Fast classification algorithm Product image
下载PDF
A Classification Algorithm for Ground Moving Targets Based on Magnetic Sensors
7
作者 崔逊学 刘綦 刘坤 《Defence Technology(防务技术)》 SCIE EI CAS 2011年第1期52-58,共7页
A novel classification algorithm based on abnormal magnetic signals is proposed for ground moving targets which are made of ferromagnetic material. According to the effect of diverse targets on earth's magnetism,t... A novel classification algorithm based on abnormal magnetic signals is proposed for ground moving targets which are made of ferromagnetic material. According to the effect of diverse targets on earth's magnetism,the moving targets are detected by a magnetic sensor and classified with a simple computation method. The detection sensor is used for collecting a disturbance signal of earth magnetic field from an undetermined target. An optimum category match pattern of target signature is tested by training some statistical samples and designing a classification machine. Three ordinary targets are researched in the paper. The experimental results show that the algorithm has a low computation cost and a better sorting accuracy. This classification method can be applied to ground reconnaissance and target intrusion detection. 展开更多
关键词 information processing magnetic sensor abnormal magnetic signal target detection target classification classification algorithm
下载PDF
Study and Implementation of Web Mining Classification Algorithm Based on Building Tree of Detection Class Threshold
8
作者 陈俊杰 宋瀚涛 陆玉昌 《Journal of Beijing Institute of Technology》 EI CAS 2005年第2期126-129,共4页
A new classification algorithm for web mining is proposed on the basis of general classification algorithm for data mining in order to implement personalized information services. The building tree method of detecting... A new classification algorithm for web mining is proposed on the basis of general classification algorithm for data mining in order to implement personalized information services. The building tree method of detecting class threshold is used for construction of decision tree according to the concept of user expectation so as to find classification rules in different layers. Compared with the traditional C4.5 algorithm, the disadvantage of excessive adaptation in C4.5 has been improved so that classification results not only have much higher accuracy but also statistic meaning. 展开更多
关键词 data mining classification algorithm class threshold induced concept
下载PDF
Aspect Based User Reviews Classification
9
作者 Roch DANISTAN Thulasika ARUNAKIRINATHAN +2 位作者 Archchana SIVARAJAH Yanusha MEHENDRAN Jayalath EKANAYAKE 《Instrumentation》 2020年第2期9-19,共11页
At present online shopping is very popular as it is very convenient for the customers.However,selecting smartphones from online shops is bit difficult only from the pictures and a short description about the item,and ... At present online shopping is very popular as it is very convenient for the customers.However,selecting smartphones from online shops is bit difficult only from the pictures and a short description about the item,and hence,the customers refer user reviews and star rating.Since user reviews are represented in human languages,sometimes the real semantic of the reviews and satisfaction of the customers are different than what the star rating shows.Also,reading all the reviews are not possible as typically,a smartphone gets thousands of reviews in popular online shopping platform like Amazon.Hence,this work aims to develop a recommended system for smartphones based on aspects of the phones such as screen size,resolution,camera quality,battery life etc.reviewed by users.To that end we apply hybrid approach,which includes three lexicon-based methods and three machine learning modals to analyze specific aspects of user reviews and classify the reviews into six categories--best,better,good or somewhat for positive comments and for negative comments bad or not recommended--.The lexicon-based tool called AFINN together with Random Forest prediction model provides the best classification F1-score 0.95.This system can be customized according to the required aspects of smartphones and the classification of reviews can be done accordingly. 展开更多
关键词 User-reviews classification Aspects of Smartphones Reviews of Smart Phones classification algorithms Lexicon-based Methods Sentimental Analysis
下载PDF
In-Vehicle Network Injection Attacks Detection Based on Feature Selection and Classification
10
作者 Haojie Ji Liyong Wang +3 位作者 Hongmao Qin Yinghui Wang Junjie Zhang Biao Chen 《Automotive Innovation》 EI CSCD 2024年第1期138-149,共12页
Detecting abnormal data generated from cyberattacks has emerged as a crucial approach for identifying security threats within in-vehicle networks.The transmission of information through in-vehicle networks needs to fo... Detecting abnormal data generated from cyberattacks has emerged as a crucial approach for identifying security threats within in-vehicle networks.The transmission of information through in-vehicle networks needs to follow specific data for-mats and communication protocols regulations.Typically,statistical algorithms are employed to learn these variation rules and facilitate the identification of abnormal data.However,the effectiveness of anomaly detection outcomes often falls short when confronted with highly deceptive in-vehicle network attacks.In this study,seven representative classification algorithms are selected to detect common in-vehicle network attacks,and a comparative analysis is employed to identify the most suitable and favorable detection method.In consideration of the communication protocol characteristics of in-vehicle networks,an optimal convolutional neural network(CNN)detection algorithm is proposed that uses data field characteristics and classifier selection,and its comprehensive performance is tested.In addition,the concept of Hamming distance between two adjacent packets within the in-vehicle network is introduced,enabling the proposal of an enhanced CNN algorithm that achieves robust detection of challenging-to-identify abnormal data.This paper also presents the proposed CNN classifica-tion algorithm that effectively addresses the issue of high false negative rate(FNR)in abnormal data detection based on the timestamp feature of data packets.The experimental results validate the efficacy of the proposed abnormal data detection algorithm,highlighting its strong detection performance and its potential to provide an effective solution for safeguarding the security of in-vehicle network information. 展开更多
关键词 classification algorithm Anomaly detection In-vehicle network Feature extraction Injecting attack
原文传递
Spectral matching techniques (SMTs) and automated cropland classification algorithms (ACCAs) for mapping croplands of Australia using MODIS 250-m time-series (2000–2015) data 被引量:5
11
作者 Pardhasaradhi Teluguntla Prasad S.Thenkabail +7 位作者 Jun Xiong Murali Krishna Gumma Russell G.Congalton Adam Oliphant Justin Poehnelt Kamini Yadav Mahesh Rao Richard Massey 《International Journal of Digital Earth》 SCIE EI 2017年第9期944-977,共34页
Mapping croplands,including fallow areas,are an important measure to determine the quantity of food that is produced,where they are produced,and when they are produced(e.g.seasonality).Furthermore,croplands are known ... Mapping croplands,including fallow areas,are an important measure to determine the quantity of food that is produced,where they are produced,and when they are produced(e.g.seasonality).Furthermore,croplands are known as water guzzlers by consuming anywhere between 70%and 90%of all human water use globally.Given these facts and the increase in global population to nearly 10 billion by the year 2050,the need for routine,rapid,and automated cropland mapping year-after-year and/or season-after-season is of great importance.The overarching goal of this study was to generate standard and routine cropland products,year-after-year,over very large areas through the use of two novel methods:(a)quantitative spectral matching techniques(QSMTs)applied at continental level and(b)rule-based Automated Cropland Classification Algorithm(ACCA)with the ability to hind-cast,now-cast,and future-cast.Australia was chosen for the study given its extensive croplands,rich history of agriculture,and yet nonexistent routine yearly generated cropland products using multi-temporal remote sensing.This research produced three distinct cropland products using Moderate Resolution Imaging Spectroradiometer(MODIS)250-m normalized difference vegetation index 16-day composite time-series data for 16 years:2000 through 2015.The products consisted of:(1)cropland extent/areas versus cropland fallow areas,(2)irrigated versus rainfed croplands,and(3)cropping intensities:single,double,and continuous cropping.An accurate reference cropland product(RCP)for the year 2014(RCP2014)produced using QSMT was used as a knowledge base to train and develop the ACCA algorithm that was then applied to the MODIS time-series data for the years 2000–2015.A comparison between the ACCA-derived cropland products(ACPs)for the year 2014(ACP2014)versus RCP2014 provided an overall agreement of 89.4%(kappa=0.814)with six classes:(a)producer’s accuracies varying between 72%and 90%and(b)user’s accuracies varying between 79%and 90%.ACPs for the individual years 2000–2013 and 2015(ACP2000–ACP2013,ACP2015)showed very strong similarities with several other studies.The extent and vigor of the Australian croplands versus cropland fallows were accurately captured by the ACCA algorithm for the years 2000–2015,thus highlighting the value of the study in food security analysis. 展开更多
关键词 Croplands food security automated cropland classification algorithms machine learning algorithms quantitative spectral matching techniques AUSTRALIA
原文传递
Predicting the Type of Crime: Intelligence Gathering and Crime Analysis 被引量:3
12
作者 Saleh Albahli Anadil Alsaqabi +3 位作者 Fatimah Aldhubayi Hafiz Tayyab Rauf Muhammad Arif Mazin Abed Mohammed 《Computers, Materials & Continua》 SCIE EI 2021年第3期2317-2341,共25页
Crimes are expected to rise with an increase in population and the rising gap between society’s income levels.Crimes contribute to a significant portion of the socioeconomic loss to any society,not only through its i... Crimes are expected to rise with an increase in population and the rising gap between society’s income levels.Crimes contribute to a significant portion of the socioeconomic loss to any society,not only through its indirect damage to the social fabric and peace but also the more direct negative impacts on the economy,social parameters,and reputation of a nation.Policing and other preventive resources are limited and have to be utilized.The conventional methods are being superseded by more modern approaches of machine learning algorithms capable of making predictions where the relationships between the features and the outcomes are complex.Making it possible for such algorithms to provide indicators of specific areas that may become criminal hot-spots.These predictions can be used by policymakers and police personals alike to make effective and informed strategies that can curtail criminal activities and contribute to the nation’s development.This paper aims to predict factors that most affected crimes in Saudi Arabia by developing a machine learning model to predict an acceptable output value.Our results show that FAMD as features selection methods showed more accuracy on machine learning classifiers than the PCA method.The naïve Bayes classifier performs better than other classifiers on both features selections methods with an accuracy of 97.53%for FAMD,and PCA equals to 97.10%. 展开更多
关键词 PREDICTION machine learning crime prevention naïve bayes crime prediction classification algorithms
下载PDF
An improved fast fractal image compression using spatial texture correlation 被引量:2
13
作者 王兴元 王远星 云娇娇 《Chinese Physics B》 SCIE EI CAS CSCD 2011年第10期228-238,共11页
This paper utilizes a spatial texture correlation and the intelligent classification algorithm (ICA) search strategy to speed up the encoding process and improve the bit rate for fractal image compression. Texture f... This paper utilizes a spatial texture correlation and the intelligent classification algorithm (ICA) search strategy to speed up the encoding process and improve the bit rate for fractal image compression. Texture features is one of the most important properties for the representation of an image. Entropy and maximum entry from co-occurrence matrices are used for representing texture features in an image. For a range block, concerned domain blocks of neighbouring range blocks with similar texture features can be searched. In addition, domain blocks with similar texture features are searched in the ICA search process. Experiments show that in comparison with some typical methods, the proposed algorithm significantly speeds up the encoding process and achieves a higher compression ratio, with a slight diminution in the quality of the reconstructed image; in comparison with a spatial correlation scheme, the proposed scheme spends much less encoding time while the compression ratio and the quality of the reconstructed image are almost the same. 展开更多
关键词 fractal image compression texture features intelligent classification algorithm spatialcorrelation
下载PDF
A Time-Frequency Associated MUSIC Algorithm Research on Human Target Detection by Through-Wall Radar 被引量:1
14
作者 Xianyu Dong Wu Ren +2 位作者 Zhenghui Xue Xuetian Wang Weiming Li 《Journal of Beijing Institute of Technology》 EI CAS 2022年第1期123-130,共8页
In this paper,a time-frequency associated multiple signal classification(MUSIC)al-gorithm which is suitable for through-wall detection is proposed.The technology of detecting hu-man targets by through-wall radar can b... In this paper,a time-frequency associated multiple signal classification(MUSIC)al-gorithm which is suitable for through-wall detection is proposed.The technology of detecting hu-man targets by through-wall radar can be used to monitor the status and the location information of human targets behind the wall.However,the detection is out of order when classical MUSIC al-gorithm is applied to estimate the direction of arrival.In order to solve the problem,a time-fre-quency associated MUSIC algorithm suitable for through-wall detection and based on S-band stepped frequency continuous wave(SFCW)radar is researched.By associating inverse fast Fouri-er transform(IFFT)algorithm with MUSIC algorithm,the power enhancement of the target sig-nal is completed according to the distance calculation results in the time domain.Then convert the signal to the frequency domain for direction of arrival(DOA)estimation.The simulations of two-dimensional human target detection in free space and the processing of measured data are com-pleted.By comparing the processing results of the two algorithms on the measured data,accuracy of DOA estimation of proposed algorithm is more than 75%,which is 50%higher than classical MUSIC algorithm.It is verified that the distance and angle of human target can be effectively de-tected via proposed algorithm. 展开更多
关键词 through-wall radar multiple signal classification(MUSIC)algorithm inverse fast Four-ier transform(IFFT)algorithm target detection
下载PDF
Sensors-Based Ambient Assistant Living via E-Monitoring Technology
15
作者 Sadaf Hafeez Yazeed Yasin Ghadi +4 位作者 Mohammed Alarfaj Tamara al Shloul Ahmad Jalal Shaharyar Kama Dong-Seong Kim 《Computers, Materials & Continua》 SCIE EI 2022年第12期4935-4952,共18页
Independent human living systems require smart,intelligent,and sustainable online monitoring so that an individual can be assisted timely.Apart from ambient assisted living,the task of monitoring human activities play... Independent human living systems require smart,intelligent,and sustainable online monitoring so that an individual can be assisted timely.Apart from ambient assisted living,the task of monitoring human activities plays an important role in different fields including virtual reality,surveillance security,and human interaction with robots.Such systems have been developed in the past with the use of various wearable inertial sensors and depth cameras to capture the human actions.In this paper,we propose multiple methods such as random occupancy pattern,spatio temporal cloud,waypoint trajectory,Hilbert transform,Walsh Hadamard transform and bone pair descriptors to extract optimal features corresponding to different human actions.These features sets are then normalized using min-max normalization and optimized using the Fuzzy optimization method.Finally,the Masi entropy classifier is applied for action recognition and classification.Experiments have been performed on three challenging datasets,namely,UTDMHAD,50 Salad,and CMU-MMAC.During experimental evaluation,the proposed novel approach of recognizing human actions has achieved an accuracy rate of 90.1%with UTD-MHAD dataset,90.6%with 50 Salad dataset,and 89.5%with CMU-MMAC dataset.Hence experimental results validated the proposed system. 展开更多
关键词 classification algorithm human action recognition motion sensors machine learning Masi entropy
下载PDF
Designing a Model to Study Data Mining in Distributed Environment
16
作者 Md. Abadur Rahman Masud Karim 《Journal of Data Analysis and Information Processing》 2021年第1期23-29,共7页
To make business policy, market analysis, corporate decision, fraud detection, etc., we have to analyze and work with huge amount of data. Generally, such data are taken from different sources. Researchers are using d... To make business policy, market analysis, corporate decision, fraud detection, etc., we have to analyze and work with huge amount of data. Generally, such data are taken from different sources. Researchers are using data mining to perform such tasks. Data mining techniques are used to find hidden information from large data source. Data mining is using for various fields: Artificial intelligence, Bank, health and medical, corruption, legal issues, corporate business, marketing, etc. Special interest is given to associate rules, data mining algorithms, decision tree and distributed approach. Data is becoming larger and spreading geographically. So it is difficult to find better result from only a central data source. For knowledge discovery, we have to work with distributed database. On the other hand, security and privacy considerations are also another factor for de-motivation of working with centralized data. For this reason, distributed database is essential for future processing. In this paper, we have proposed a framework to study data mining in distributed environment. The paper presents a framework to bring out actionable knowledge. We have shown some level by which we can generate actionable knowledge. Possible tools and technique for these levels are discussed. 展开更多
关键词 Data Mining Distributed Database Knowledge Discovery classification Algorithm
下载PDF
An examination of thematic research,development,and trends in remote sensing applied to conservation agriculture
17
作者 Zobaer Ahmed Aaron Shew +3 位作者 Lawton Nalley Michael Popp V.Steven Green Kristofor Brye 《International Soil and Water Conservation Research》 SCIE CSCD 2024年第1期77-95,共19页
Conservation agriculture seeks to reduce environmental degradation through sustainable management of agricultural land.Since the 1990s,agricultural research has been conducted using remote sensing technologies;however... Conservation agriculture seeks to reduce environmental degradation through sustainable management of agricultural land.Since the 1990s,agricultural research has been conducted using remote sensing technologies;however,few previous reviews have been conducted focused on different conservation management practices.Most of the previous literature has focused on the application of remote sensing in agriculture without focusing exclusively on conservation practices,with some only providing a narrative review,others using biophysical remote sensing for quantitative estimates of the bio-geo-chemical-physical properties of soils and crops,and few others focused on single agricultural management practices.This paper used the preferred reporting items for systematic review(PRISMA)methodology to examine the last 30 years of thematic research,development,and trends associated with remote sensing technologies and methods applied to conservation agriculture research at various spatial and temporal scales.A set of predefined key concepts and keywords were applied in three databases:Scopus,Web of Science,and Google Scholar.A total of 188 articles were compiled for initial examination,where 68 articles were selected for final analysis and grouped into cover crops,crop residue,crop rotation,mulching,and tillage practices.Publications on conservation agriculture research using remote sensing have been increasing since 1991 and peaked at 10 publications in 2020.Among the 68 articles,94%used a pixel-based,while only 6%used an object-based classification method.Prior to 2005,tillage practices were abundantly studied,then crop residue was a focused theme between 2004 and 2012.From 2012 to 2020,the focus shifted again to cover crops.Ten spectral indices were used in 76%of the 68 studies.This examination offered a summary of the new potential and identifies crucial future research needs and directions that could improve the contribution of remote sensing to the provision of long-term operational services for various conservation agriculture applications. 展开更多
关键词 Remote sensing Conservation agriculture classification algorithm Spatial resolution SATELLITE Spectral indices PRISMA
原文传递
Unstructured Big Data Threat Intelligence Parallel Mining Algorithm
18
作者 Zhihua Li Xinye Yu +1 位作者 Tao Wei Junhao Qian 《Big Data Mining and Analytics》 EI CSCD 2024年第2期531-546,共16页
To efficiently mine threat intelligence from the vast array of open-source cybersecurity analysis reports on the web,we have developed the Parallel Deep Forest-based Multi-Label Classification(PDFMLC)algorithm.Initial... To efficiently mine threat intelligence from the vast array of open-source cybersecurity analysis reports on the web,we have developed the Parallel Deep Forest-based Multi-Label Classification(PDFMLC)algorithm.Initially,open-source cybersecurity analysis reports are collected and converted into a standardized text format.Subsequently,five tactics category labels are annotated,creating a multi-label dataset for tactics classification.Addressing the limitations of low execution efficiency and scalability in the sequential deep forest algorithm,our PDFMLC algorithm employs broadcast variables and the Lempel-Ziv-Welch(LZW)algorithm,significantly enhancing its acceleration ratio.Furthermore,our proposed PDFMLC algorithm incorporates label mutual information from the established dataset as input features.This captures latent label associations,significantly improving classification accuracy.Finally,we present the PDFMLC-based Threat Intelligence Mining(PDFMLC-TIM)method.Experimental results demonstrate that the PDFMLC algorithm exhibits exceptional node scalability and execution efficiency.Simultaneously,the PDFMLC-TIM method proficiently conducts text classification on cybersecurity analysis reports,extracting tactics entities to construct comprehensive threat intelligence.As a result,successfully formatted STIX2.1 threat intelligence is established. 展开更多
关键词 unstructured big data mining parallel deep forest multi-label classification algorithm threat intelligence
原文传递
Video Recommendation System Using Machine-Learning Techniques
19
作者 Meesala Sravani Ch Vidyadhari S Anjali Devi 《Journal of Harbin Institute of Technology(New Series)》 CAS 2024年第4期24-33,共10页
In the realm of contemporary artificial intelligence,machine learning enables automation,allowing systems to naturally acquire and enhance their capabilities through learning.In this cycle,Video recommendation is fini... In the realm of contemporary artificial intelligence,machine learning enables automation,allowing systems to naturally acquire and enhance their capabilities through learning.In this cycle,Video recommendation is finished by utilizing machine learning strategies.A suggestion framework is an interaction of data sifting framework,which is utilized to foresee the“rating”or“inclination”given by the different clients.The expectation depends on past evaluations,history,interest,IMDB rating,and so on.This can be carried out by utilizing collective and substance-based separating approaches which utilize the data given by the different clients,examine them,and afterward suggest the video that suits the client at that specific time.The required datasets for the video are taken from Grouplens.This recommender framework is executed by utilizing Python Programming Language.For building this video recommender framework,two calculations are utilized,for example,K-implies Clustering and KNN grouping.K-implies is one of the unaided AI calculations and the fundamental goal is to bunch comparable sort of information focuses together and discover the examples.For that K-implies searches for a steady‘k'of bunches in a dataset.A group is an assortment of information focuses collected due to specific similitudes.K-Nearest Neighbor is an administered learning calculation utilized for characterization,with the given information;KNN can group new information by examination of the‘k'number of the closest information focuses.The last qualities acquired are through bunching qualities and root mean squared mistake,by using this algorithm we can recommend videos more appropriately based on user previous records and ratings. 展开更多
关键词 video recommendation system KNN algorithms collaborative filtering content⁃based filtering classification algorithms artificial intelligence
下载PDF
A review of land use/land cover change mapping in the China-Central Asia-West Asia economic corridor countries 被引量:2
20
作者 Amin Naboureh Jinhu Bian +1 位作者 Guangbin Lei Ainong Li 《Big Earth Data》 EI 2021年第2期237-257,共21页
Large-scale projects,such as the construction of railways and highways,usually cause an extensive Land Use Land Cover Change(LULCC).The China-Central Asia-West Asia Economic Corridor(CCAWAEC),one key large-scale proje... Large-scale projects,such as the construction of railways and highways,usually cause an extensive Land Use Land Cover Change(LULCC).The China-Central Asia-West Asia Economic Corridor(CCAWAEC),one key large-scale project of the Belt and Road Initiative(BRI),covers a region that is home to more than 1.6 billion people.Although numerous studies have been conducted on strategies and the economic potential of the Economic Corridor,reviewing LULCC mapping studies in this area has not been studied.This study provides a comprehensive review of the recent research progress and discusses the challenges in LULCC monitoring and driving factors identifying in the study area.The review will be helpful for the decision-making of sustainable development and construction in the Economic Corridor.To this end,350 peer-reviewed journal and conference papers,as well as book chapters were analyzed based on 17 attributes,such as main driving factors of LULCC,data collection methods,classification algorithms,and accuracy assessment methods.It was observed that:(1)rapid urbanization,industrialization,population growth,and climate change have been recognized as major causes of LULCC in the study area;(2)LULCC has,directly and indirectly,caused several environmental issues,such as biodiversity loss,air pollution,water pollution,desertification,and land degradation;(3)there is a lack of well-annotated national land use data in the region;(4)there is a lack of reliable training and reference datasets to accurately study the long-term LULCC in most parts of the study area;and(5)several technical issues still require more attention from the scientific community.Finally,several recommendations were proposed to address the identified issues. 展开更多
关键词 Land use change land cover change China-Central Asia-West Asia Economic Corridor accuracy assessment reference data classification algorithm
原文传递
上一页 1 2 下一页 到第
使用帮助 返回顶部