期刊文献+
共找到4,543篇文章
< 1 2 228 >
每页显示 20 50 100
A Power Data Anomaly Detection Model Based on Deep Learning with Adaptive Feature Fusion
1
作者 Xiu Liu Liang Gu +3 位作者 Xin Gong Long An Xurui Gao Juying Wu 《Computers, Materials & Continua》 SCIE EI 2024年第6期4045-4061,共17页
With the popularisation of intelligent power,power devices have different shapes,numbers and specifications.This means that the power data has distributional variability,the model learning process cannot achieve suffi... With the popularisation of intelligent power,power devices have different shapes,numbers and specifications.This means that the power data has distributional variability,the model learning process cannot achieve sufficient extraction of data features,which seriously affects the accuracy and performance of anomaly detection.Therefore,this paper proposes a deep learning-based anomaly detection model for power data,which integrates a data alignment enhancement technique based on random sampling and an adaptive feature fusion method leveraging dimension reduction.Aiming at the distribution variability of power data,this paper developed a sliding window-based data adjustment method for this model,which solves the problem of high-dimensional feature noise and low-dimensional missing data.To address the problem of insufficient feature fusion,an adaptive feature fusion method based on feature dimension reduction and dictionary learning is proposed to improve the anomaly data detection accuracy of the model.In order to verify the effectiveness of the proposed method,we conducted effectiveness comparisons through elimination experiments.The experimental results show that compared with the traditional anomaly detection methods,the method proposed in this paper not only has an advantage in model accuracy,but also reduces the amount of parameter calculation of the model in the process of feature matching and improves the detection speed. 展开更多
关键词 data alignment dimension reduction feature fusion data anomaly detection deep learning
下载PDF
Machine Learning Security Defense Algorithms Based on Metadata Correlation Features
2
作者 Ruchun Jia Jianwei Zhang Yi Lin 《Computers, Materials & Continua》 SCIE EI 2024年第2期2391-2418,共28页
With the popularization of the Internet and the development of technology,cyber threats are increasing day by day.Threats such as malware,hacking,and data breaches have had a serious impact on cybersecurity.The networ... With the popularization of the Internet and the development of technology,cyber threats are increasing day by day.Threats such as malware,hacking,and data breaches have had a serious impact on cybersecurity.The network security environment in the era of big data presents the characteristics of large amounts of data,high diversity,and high real-time requirements.Traditional security defense methods and tools have been unable to cope with the complex and changing network security threats.This paper proposes a machine-learning security defense algorithm based on metadata association features.Emphasize control over unauthorized users through privacy,integrity,and availability.The user model is established and the mapping between the user model and the metadata of the data source is generated.By analyzing the user model and its corresponding mapping relationship,the query of the user model can be decomposed into the query of various heterogeneous data sources,and the integration of heterogeneous data sources based on the metadata association characteristics can be realized.Define and classify customer information,automatically identify and perceive sensitive data,build a behavior audit and analysis platform,analyze user behavior trajectories,and complete the construction of a machine learning customer information security defense system.The experimental results show that when the data volume is 5×103 bit,the data storage integrity of the proposed method is 92%.The data accuracy is 98%,and the success rate of data intrusion is only 2.6%.It can be concluded that the data storage method in this paper is safe,the data accuracy is always at a high level,and the data disaster recovery performance is good.This method can effectively resist data intrusion and has high air traffic control security.It can not only detect all viruses in user data storage,but also realize integrated virus processing,and further optimize the security defense effect of user big data. 展开更多
关键词 data-oriented architecture METAdata correlation features machine learning security defense data source integration
下载PDF
Terrorism Attack Classification Using Machine Learning: The Effectiveness of Using Textual Features Extracted from GTD Dataset
3
作者 Mohammed Abdalsalam Chunlin Li +1 位作者 Abdelghani Dahou Natalia Kryvinska 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第2期1427-1467,共41页
One of the biggest dangers to society today is terrorism, where attacks have become one of the most significantrisks to international peace and national security. Big data, information analysis, and artificial intelli... One of the biggest dangers to society today is terrorism, where attacks have become one of the most significantrisks to international peace and national security. Big data, information analysis, and artificial intelligence (AI) havebecome the basis for making strategic decisions in many sensitive areas, such as fraud detection, risk management,medical diagnosis, and counter-terrorism. However, there is still a need to assess how terrorist attacks are related,initiated, and detected. For this purpose, we propose a novel framework for classifying and predicting terroristattacks. The proposed framework posits that neglected text attributes included in the Global Terrorism Database(GTD) can influence the accuracy of the model’s classification of terrorist attacks, where each part of the datacan provide vital information to enrich the ability of classifier learning. Each data point in a multiclass taxonomyhas one or more tags attached to it, referred as “related tags.” We applied machine learning classifiers to classifyterrorist attack incidents obtained from the GTD. A transformer-based technique called DistilBERT extracts andlearns contextual features from text attributes to acquiremore information from text data. The extracted contextualfeatures are combined with the “key features” of the dataset and used to perform the final classification. Thestudy explored different experimental setups with various classifiers to evaluate the model’s performance. Theexperimental results show that the proposed framework outperforms the latest techniques for classifying terroristattacks with an accuracy of 98.7% using a combined feature set and extreme gradient boosting classifier. 展开更多
关键词 Artificial intelligence machine learning natural language processing data analytic DistilBERT feature extraction terrorism classification GTD dataset
下载PDF
Mapping winter wheat using phenological feature of peak before winter on the North China Plain based on time-series MODIS data 被引量:17
4
作者 TAO Jian-bin WU Wen-bin +2 位作者 ZHOU Yong WANG Yu JIANG Yan 《Journal of Integrative Agriculture》 SCIE CAS CSCD 2017年第2期348-359,共12页
By employing the unique phenological feature of winter wheat extracted from peak before winter (PBW) and the advantages of moderate resolution imaging spectroradiometer (MODIS) data with high temporal resolution a... By employing the unique phenological feature of winter wheat extracted from peak before winter (PBW) and the advantages of moderate resolution imaging spectroradiometer (MODIS) data with high temporal resolution and intermediate spatial resolution, a remote sensing-based model for mapping winter wheat on the North China Plain was built through integration with Landsat images and land-use data. First, a phenological window, PBW was drawn from time-series MODIS data. Next, feature extraction was performed for the PBW to reduce feature dimension and enhance its information. Finally, a regression model was built to model the relationship of the phenological feature and the sample data. The amount of information of the PBW was evaluated and compared with that of the main peak (MP). The relative precision of the mapping reached up to 92% in comparison to the Landsat sample data, and ranged between 87 and 96% in comparison to the statistical data. These results were sufficient to satisfy the accuracy requirements for winter wheat mapping at a large scale. Moreover, the proposed method has the ability to obtain the distribution information for winter wheat in an earlier period than previous studies. This study could throw light on the monitoring of winter wheat in China by using unique phenological feature of winter wheat. 展开更多
关键词 time-series MODIS data phenological feature peak before wintering winter wheat mapping
下载PDF
Identifying Metabolite and Protein Biomarkers in Unstable Angina In-patients by Feature Selection Based Data Mining Method 被引量:8
5
作者 SHI Cheng-he ZHAO Hui-hui +8 位作者 HOU Na CHEN Jian-xin SHI Qi XU Xue-gong WANG Juan ZHENG Cheng-long ZHAO Ling-yan YANG Yi WANG Wei 《Chemical Research in Chinese Universities》 SCIE CAS CSCD 2011年第1期87-93,共7页
Unstable angina(UA) is the most dangerous type of Coronary Heart Disease(CHD) to cause more and more mortal and morbid world wide. Identification of biomarkers for UA at the level of proteomics and metabolomics is... Unstable angina(UA) is the most dangerous type of Coronary Heart Disease(CHD) to cause more and more mortal and morbid world wide. Identification of biomarkers for UA at the level of proteomics and metabolomics is a better avenue to understand the inner mechanism of it. Feature selection based data mining method is better suited to identify biomarkers of UA. In this study, we carried out clinical epidemiology to collect plasmas of UA in-patients and controls. Proteomics and metabolomics data were obtained via two-dimensional difference gel electrophoresis and gas chromatography techniques. We presented a novel computational strategy to select biomarkers as few as possible for UA in the two groups of data. Firstly, decision tree was used to select biomarkers for UA and 3-fold cross validation was used to evaluate computational performanees for the three methods. Alternatively, we combined inde- pendent t test and classification based data mining method as well as backward elimination technique to select, as few as possible, protein and metabolite biomarkers with best classification performances. By the method, we selected 6 proteins and 5 metabolites for UA. The novel method presented here provides a better insight into the pathology of a disease. 展开更多
关键词 BIOMARKER Metabolomics PROTEOME feature selection data mining Unstable angina
下载PDF
An Embedded Feature Selection Method for Imbalanced Data Classification 被引量:16
6
作者 Haoyue Liu MengChu Zhou Qing Liu 《IEEE/CAA Journal of Automatica Sinica》 EI CSCD 2019年第3期703-715,共13页
Imbalanced data is one type of datasets that are frequently found in real-world applications, e.g., fraud detection and cancer diagnosis. For this type of datasets, improving the accuracy to identify their minority cl... Imbalanced data is one type of datasets that are frequently found in real-world applications, e.g., fraud detection and cancer diagnosis. For this type of datasets, improving the accuracy to identify their minority class is a critically important issue.Feature selection is one method to address this issue. An effective feature selection method can choose a subset of features that favor in the accurate determination of the minority class. A decision tree is a classifier that can be built up by using different splitting criteria. Its advantage is the ease of detecting which feature is used as a splitting node. Thus, it is possible to use a decision tree splitting criterion as a feature selection method. In this paper, an embedded feature selection method using our proposed weighted Gini index(WGI) is proposed. Its comparison results with Chi2, F-statistic and Gini index feature selection methods show that F-statistic and Chi2 reach the best performance when only a few features are selected. As the number of selected features increases, our proposed method has the highest probability of achieving the best performance. The area under a receiver operating characteristic curve(ROC AUC) and F-measure are used as evaluation criteria. Experimental results with two datasets show that ROC AUC performance can be high, even if only a few features are selected and used, and only changes slightly as more and more features are selected. However, the performance of Fmeasure achieves excellent performance only if 20% or more of features are chosen. The results are helpful for practitioners to select a proper feature selection method when facing a practical problem. 展开更多
关键词 Classification and regression TREE feature selection imbalanced data WEIGHTED GINI INDEX (WGI)
下载PDF
Effective and Efficient Feature Selection for Large-scale Data Using Bayes' Theorem 被引量:7
7
作者 Subramanian Appavu Alias Balamurugan Ramasamy Rajaram 《International Journal of Automation and computing》 EI 2009年第1期62-71,共10页
This paper proposes one method of feature selection by using Bayes' theorem. The purpose of the proposed method is to reduce the computational complexity and increase the classification accuracy of the selected featu... This paper proposes one method of feature selection by using Bayes' theorem. The purpose of the proposed method is to reduce the computational complexity and increase the classification accuracy of the selected feature subsets. The dependence between two attributes (binary) is determined based on the probabilities of their joint values that contribute to positive and negative classification decisions. If opposing sets of attribute values do not lead to opposing classification decisions (zero probability), then the two attributes are considered independent of each other, otherwise dependent, and one of them can be removed and thus the number of attributes is reduced. The process must be repeated on all combinations of attributes. The paper also evaluates the approach by comparing it with existing feature selection algorithms over 8 datasets from University of California, Irvine (UCI) machine learning databases. The proposed method shows better results in terms of number of selected features, classification accuracy, and running time than most existing algorithms. 展开更多
关键词 data mining CLASSIFICATION feature selection dimensionality reduction Bayes' theorem.
下载PDF
TIDAL FEATURES IN THE CHINA SEAS AND THEIR ADJACENT SEA AREAS AS DERIVED FROM TOPEX/POSEIDON ALTIMETER DATA 被引量:9
8
作者 胡建宇 Hiroshi KAWAMURA +2 位作者 洪华生 Fumiaki KOBASⅢ 谢强 《Chinese Journal of Oceanology and Limnology》 SCIE CAS CSCD 2001年第4期293-305,共13页
Some important tidal features of 8 major tidal constituents ( M 2, S 2, K 1, O 1, P 1, Sa, N 2 and K 2 ) in the China Seas and their adjacent sea areas were obtained using six years’ TOPEX/POSEIDON altimeter data. Th... Some important tidal features of 8 major tidal constituents ( M 2, S 2, K 1, O 1, P 1, Sa, N 2 and K 2 ) in the China Seas and their adjacent sea areas were obtained using six years’ TOPEX/POSEIDON altimeter data. The results showed that the obtained co tidal and co range charts for these major tidal constituents agreed well with those of previous researches using observational data from coastal tidal gauge stations and numerical models. 展开更多
关键词 TOPEX/POSEIDON altimeter data tidal features the China Seas tidal features
下载PDF
Importance of Features Selection,Attributes Selection,Challenges and Future Directions for Medical Imaging Data:A Review 被引量:6
9
作者 Nazish Naheed Muhammad Shaheen +2 位作者 Sajid Ali Khan Mohammed Alawairdhi Muhammad Attique Khan 《Computer Modeling in Engineering & Sciences》 SCIE EI 2020年第10期315-344,共30页
In the area of pattern recognition and machine learning,features play a key role in prediction.The famous applications of features are medical imaging,image classification,and name a few more.With the exponential grow... In the area of pattern recognition and machine learning,features play a key role in prediction.The famous applications of features are medical imaging,image classification,and name a few more.With the exponential growth of information investments in medical data repositories and health service provision,medical institutions are collecting large volumes of data.These data repositories contain details information essential to support medical diagnostic decisions and also improve patient care quality.On the other hand,this growth also made it difficult to comprehend and utilize data for various purposes.The results of imaging data can become biased because of extraneous features present in larger datasets.Feature selection gives a chance to decrease the number of components in such large datasets.Through selection techniques,ousting the unimportant features and selecting a subset of components that produces prevalent characterization precision.The correct decision to find a good attribute produces a precise grouping model,which enhances learning pace and forecast control.This paper presents a review of feature selection techniques and attributes selection measures for medical imaging.This review is meant to describe feature selection techniques in a medical domainwith their pros and cons and to signify its application in imaging data and data mining algorithms.The review reveals the shortcomings of the existing feature and attributes selection techniques to multi-sourced data.Moreover,this review provides the importance of feature selection for correct classification of medical infections.In the end,critical analysis and future directions are provided. 展开更多
关键词 Medical imaging imaging data feature selection data mining attribute selection medical challenges future directions
下载PDF
Magnetic resonance imaging ancillary features used in Liver Imaging Reporting and Data System:An illustrative review 被引量:3
10
作者 David Campos-Correia Joao Cruz +2 位作者 António P Matos Filipa Figueiredo Miguel Ramalho 《World Journal of Radiology》 CAS 2018年第2期9-23,共15页
Hepatocellular carcinoma (HCC) usually develops in the setting of chronic liver disease. In the adequate clinical context, both multiphasic contrast-enhanced CT and magnetic resonance imaging are non-invasive modaliti... Hepatocellular carcinoma (HCC) usually develops in the setting of chronic liver disease. In the adequate clinical context, both multiphasic contrast-enhanced CT and magnetic resonance imaging are non-invasive modalities that allow accurate diagnosis and staging of HCC, although the latter demonstrates greater sensitivity and specificity. Imaging criteria for HCC diagnosis rely on hemodynamic features such as hyperenhancement in the arterial phase and washout in the portal or equilibrium phase. However, imaging performance drops considerably for small (< 20 mm) nodules because their tendency to exhibit atypical enhancement patterns. In order to improve accuracy in the diagnosis and staging of HCC, particularly in cases of atypical nodules, ancillary features, i.e., imaging characteristics that modify the likelihood of HCC, have been described and incorporated into clinical reports, especially in Liver Imaging Reporting and Data System. In this paper, ancillary imaging features will be reviewed and illustrated. 展开更多
关键词 HEPATOCELLULAR carcinoma Ancillary featureS Magnetic resonance IMAGING LIVER IMAGING REPORTING and data System CIRRHOSIS LIVER
下载PDF
Feature Selection with Optimal Stacked Sparse Autoencoder for Data Mining 被引量:4
11
作者 Manar Ahmed Hamza Siwar Ben Haj Hassine +5 位作者 Ibrahim Abunadi Fahd N.Al-Wesabi Hadeel Alsolai Anwer Mustafa Hilal Ishfaq Yaseen Abdelwahed Motwakel 《Computers, Materials & Continua》 SCIE EI 2022年第8期2581-2596,共16页
Data mining in the educational field can be used to optimize the teaching and learning performance among the students.The recently developed machine learning(ML)and deep learning(DL)approaches can be utilized to mine ... Data mining in the educational field can be used to optimize the teaching and learning performance among the students.The recently developed machine learning(ML)and deep learning(DL)approaches can be utilized to mine the data effectively.This study proposes an Improved Sailfish Optimizer-based Feature SelectionwithOptimal Stacked Sparse Autoencoder(ISOFS-OSSAE)for data mining and pattern recognition in the educational sector.The proposed ISOFS-OSSAE model aims to mine the educational data and derive decisions based on the feature selection and classification process.Moreover,the ISOFS-OSSAEmodel involves the design of the ISOFS technique to choose an optimal subset of features.Moreover,the swallow swarm optimization(SSO)with the SSAE model is derived to perform the classification process.To showcase the enhanced outcomes of the ISOFSOSSAE model,a wide range of experiments were taken place on a benchmark dataset from the University of California Irvine(UCI)Machine Learning Repository.The simulation results pointed out the improved classification performance of the ISOFS-OSSAE model over the recent state of art approaches interms of different performance measures. 展开更多
关键词 data mining pattern recognition feature selection data classification SSAE model
下载PDF
Improving Knowledge Based Spam Detection Methods: The Effect of Malicious Related Features in Imbalance Data Distribution 被引量:5
12
作者 Jafar Alqatawna Hossam Faris +2 位作者 Khalid Jaradat Malek Al-Zewairi Omar Adwan 《International Journal of Communications, Network and System Sciences》 2015年第5期118-129,共12页
Spam is no longer just commercial unsolicited email messages that waste our time, it consumes network traffic and mail servers’ storage. Furthermore, spam has become a major component of several attack vectors includ... Spam is no longer just commercial unsolicited email messages that waste our time, it consumes network traffic and mail servers’ storage. Furthermore, spam has become a major component of several attack vectors including attacks such as phishing, cross-site scripting, cross-site request forgery and malware infection. Statistics show that the amount of spam containing malicious contents increased compared to the one advertising legitimate products and services. In this paper, the issue of spam detection is investigated with the aim to develop an efficient method to identify spam email based on the analysis of the content of email messages. We identify a set of features that have a considerable number of malicious related features. Our goal is to study the effect of these features in helping the classical classifiers in identifying spam emails. To make the problem more challenging, we developed spam classification models based on imbalanced data where spam emails form the rare class with only 16.5% of the total emails. Different metrics were utilized in the evaluation of the developed models. Results show noticeable improvement of spam classification models when trained by dataset that includes malicious related features. 展开更多
关键词 SPAM E-MAIL MALICIOUS SPAM SPAM Detection SPAM featureS Security Mechanism data Mining
下载PDF
An intelligent prediction model of epidemic characters based on multi-feature
13
作者 Xiaoying Wang Chunmei Li +6 位作者 Yilei Wang Lin Yin Qilin Zhou Rui Zheng Qingwu Wu Yuqi Zhou Min Dai 《CAAI Transactions on Intelligence Technology》 SCIE EI 2024年第3期595-607,共13页
The epidemic characters of Omicron(e.g.large-scale transmission)are significantly different from the initial variants of COVID-19.The data generated by large-scale transmission is important to predict the trend of epi... The epidemic characters of Omicron(e.g.large-scale transmission)are significantly different from the initial variants of COVID-19.The data generated by large-scale transmission is important to predict the trend of epidemic characters.However,the re-sults of current prediction models are inaccurate since they are not closely combined with the actual situation of Omicron transmission.In consequence,these inaccurate results have negative impacts on the process of the manufacturing and the service industry,for example,the production of masks and the recovery of the tourism industry.The authors have studied the epidemic characters in two ways,that is,investigation and prediction.First,a large amount of data is collected by utilising the Baidu index and conduct questionnaire survey concerning epidemic characters.Second,theβ-SEIDR model is established,where the population is classified as Susceptible,Exposed,Infected,Dead andβ-Recovered persons,to intelligently predict the epidemic characters of COVID-19.Note thatβ-Recovered persons denote that the Recovered persons may become Sus-ceptible persons with probabilityβ.The simulation results show that the model can accurately predict the epidemic characters. 展开更多
关键词 artificial intelligence big data data analysis evaluation feature extraction intelligent information processing medical applications
下载PDF
A novel technique for automatic seismic data processing using both integral and local feature of seismograms 被引量:3
14
作者 Ping Jin Chengliu Zhang +4 位作者 Xufeng Shen Hongchun Wang Changzhou Pan Na Lu Xiong Xu 《Earthquake Science》 2014年第3期337-349,共13页
A novel technique for automatic seismic data processing using both integral and local feature of seismograms was presented in this paper. Here, the term integral feature of seismograms refers to feature which may depi... A novel technique for automatic seismic data processing using both integral and local feature of seismograms was presented in this paper. Here, the term integral feature of seismograms refers to feature which may depict the shape of the whole seismograms. However, unlike some previous efforts which completely abandon the DIAL approach, i.e., signal detection, phase identifi- cation, association, and event localization, and seek to use envelope cross-correlation to detect seismic events directly, our technique keeps following the DIAL approach, but in addition to detect signals corresponding to individual seismic phases, it also detects continuous wave-trains and explores their feature for phase-type identification and signal association. More concrete ideas about how to define wave-trains and combine them with various detections, as well as how to measure and utilize their feature in the seismic data processing were expatiated in the paper. This approach has been applied to the routine data processing by us for years, and test results for a 16 days' period using data from the Xinjiang seismic station network were presented. The automatic processing results have fairly low false and missed event rate simultaneously, showing that the new technique has good application prospects for improvement of the automatic seismic data processing. 展开更多
关键词 Seismic - Automatic data processing feature of seismograms
下载PDF
T_(BB) DATA-REVEALED FEATURES OF ASIAN-AUSTRALIAN MONSOON SEASONAL TRANSITION AND ASIAN SUMMER MONSOOM ESTABLISHMENT 被引量:3
15
作者 何金海 朱乾根 《Journal of Tropical Meteorology》 SCIE 1997年第1期18-26,共9页
Based on TBB data from Meteorological Institute Research of Japan, study is carried out of the features of seasonal transition of Asian-Australian monsoons and Asian summer monsoon establishment,indicating that the tr... Based on TBB data from Meteorological Institute Research of Japan, study is carried out of the features of seasonal transition of Asian-Australian monsoons and Asian summer monsoon establishment,indicating that the transition begins as early as in April, followed by abrupt change in May-June; the Asian summer monsoon situation is fully established in June. The winter convective center in Sumatra moved steadily northwestward across the "land bridge" of the maritime continent and the Indo-China Peninsula as time goes from winter to summer, thus giving rise to the change in large scale circulations that is responsible for the summer monsoon establishment over SE Asia and India; the South China Sea to the western Pacific summer monsoon onset bears a close relation to the active convection in the Indo China Peninsula and steady eastward retreat of the subtropical TBB high-value band,corresponding to the western Pacific subtropical high. 展开更多
关键词 T_(BB) data Asian-Australian monsoon reston seasonal transition features of summer monsoon establishment
下载PDF
Curve Classification Based onMean-Variance Feature Weighting and Its Application
16
作者 Zewen Zhang Sheng Zhou Chunzheng Cao 《Computers, Materials & Continua》 SCIE EI 2024年第5期2465-2480,共16页
The classification of functional data has drawn much attention in recent years.The main challenge is representing infinite-dimensional functional data by finite-dimensional features while utilizing those features to a... The classification of functional data has drawn much attention in recent years.The main challenge is representing infinite-dimensional functional data by finite-dimensional features while utilizing those features to achieve better classification accuracy.In this paper,we propose a mean-variance-based(MV)feature weighting method for classifying functional data or functional curves.In the feature extraction stage,each sample curve is approximated by B-splines to transfer features to the coefficients of the spline basis.After that,a feature weighting approach based on statistical principles is introduced by comprehensively considering the between-class differences and within-class variations of the coefficients.We also introduce a scaling parameter to adjust the gap between the weights of features.The new feature weighting approach can adaptively enhance noteworthy local features while mitigating the impact of confusing features.The algorithms for feature weighted K-nearest neighbor and support vector machine classifiers are both provided.Moreover,the new approach can be well integrated into existing functional data classifiers,such as the generalized functional linear model and functional linear discriminant analysis,resulting in a more accurate classification.The performance of the mean-variance-based classifiers is evaluated by simulation studies and real data.The results show that the newfeatureweighting approach significantly improves the classification accuracy for complex functional data. 展开更多
关键词 Functional data analysis CLASSIFICATION feature weighting B-SPLINES
下载PDF
Hierarchical Optimization Method for Federated Learning with Feature Alignment and Decision Fusion
17
作者 Ke Li Xiaofeng Wang Hu Wang 《Computers, Materials & Continua》 SCIE EI 2024年第10期1391-1407,共17页
In the realm of data privacy protection,federated learning aims to collaboratively train a global model.However,heterogeneous data between clients presents challenges,often resulting in slow convergence and inadequate... In the realm of data privacy protection,federated learning aims to collaboratively train a global model.However,heterogeneous data between clients presents challenges,often resulting in slow convergence and inadequate accuracy of the global model.Utilizing shared feature representations alongside customized classifiers for individual clients emerges as a promising personalized solution.Nonetheless,previous research has frequently neglected the integration of global knowledge into local representation learning and the synergy between global and local classifiers,thereby limiting model performance.To tackle these issues,this study proposes a hierarchical optimization method for federated learning with feature alignment and the fusion of classification decisions(FedFCD).FedFCD regularizes the relationship between global and local feature representations to achieve alignment and incorporates decision information from the global classifier,facilitating the late fusion of decision outputs from both global and local classifiers.Additionally,FedFCD employs a hierarchical optimization strategy to flexibly optimize model parameters.Through experiments on the Fashion-MNIST,CIFAR-10 and CIFAR-100 datasets,we demonstrate the effectiveness and superiority of FedFCD.For instance,on the CIFAR-100 dataset,FedFCD exhibited a significant improvement in average test accuracy by 6.83%compared to four outstanding personalized federated learning approaches.Furthermore,extended experiments confirm the robustness of FedFCD across various hyperparameter values. 展开更多
关键词 Federated learning data heterogeneity feature alignment decision fusion hierarchical optimization
下载PDF
A novel type of neural networks for feature engineering of geological data:Case studies of coal and gas hydrate-bearing sediments 被引量:3
18
作者 Lishuai Jiang Yang Zhao +2 位作者 Naser Golsanami Lianjun Chen Weichao Yan 《Geoscience Frontiers》 SCIE CAS CSCD 2020年第5期1511-1531,共21页
The nature of the measured data varies among different disciplines of geosciences.In rock engineering,features of data play a leading role in determining the feasible methods of its proper manipulation.The present stu... The nature of the measured data varies among different disciplines of geosciences.In rock engineering,features of data play a leading role in determining the feasible methods of its proper manipulation.The present study focuses on resolving one of the major deficiencies of conventional neural networks(NNs)in dealing with rock engineering data.Herein,since the samples are obtained from hundreds of meters below the surface with the utmost difficulty,the number of samples is always limited.Meanwhile,the experimental analysis of these samples may result in many repetitive values and 0 s.However,conventional neural networks are incapable of making robust models in the presence of such data.On the other hand,these networks strongly depend on the initial weights and bias values for making reliable predictions.With this in mind,the current research introduces a novel kind of neural network processing framework for the geological that does not suffer from the limitations of the conventional NNs.The introduced single-data-based feature engineering network extracts all the information wrapped in every single data point without being affected by the other points.This method,being completely different from the conventional NNs,re-arranges all the basic elements of the neuron model into a new structure.Therefore,its mathematical calculations were performed from the very beginning.Moreover,the corresponding programming codes were developed in MATLAB and Python since they could not be found in any common programming software at the time being.This new kind of network was first evaluated through computer-based simulations of rock cracks in the 3 DEC environment.After the model’s reliability was confirmed,it was adopted in two case studies for estimating respectively tensile strength and shear strength of real rock samples.These samples were coal core samples from the Southern Qinshui Basin of China,and gas hydrate-bearing sediment(GHBS)samples from the Nankai Trough of Japan.The coal samples used in the experiments underwent nuclear magnetic resonance(NMR)measurements,and Scanning Electron Microscopy(SEM)imaging to investigate their original micro and macro fractures.Once done with these experiments,measurement of the rock mechanical properties,including tensile strength,was performed using a rock mechanical test system.However,the shear strength of GHBS samples was acquired through triaxial and direct shear tests.According to the obtained result,the new network structure outperformed the conventional neural networks in both cases of simulation-based and case study estimations of the tensile and shear strength.Even though the proposed approach of the current study originally aimed at resolving the issue of having a limited dataset,its unique properties would also be applied to larger datasets from other subsurface measurements. 展开更多
关键词 Tensile strength Shear strength Gas Hydrate feature engineering Rock engineering data Neuron model
下载PDF
Structural features in the mid-southern section of the Kyushu–Palau Ridge based on satellite altimetry gravity anomaly
19
作者 Feifei Zhang Dingding Wang +3 位作者 Xiaolin Ji Fanghui Hou Yuan Yang Wanyin Wang 《Acta Oceanologica Sinica》 SCIE CAS CSCD 2024年第4期50-60,共11页
The Kyushu–Palau Ridge(KPR),an anti-S-shaped submarine highland at the center of the Philippine Sea Plate(PSP),is considered the residual arc of the Izu–Bonin–Mariana Island Arc,which retains key information about ... The Kyushu–Palau Ridge(KPR),an anti-S-shaped submarine highland at the center of the Philippine Sea Plate(PSP),is considered the residual arc of the Izu–Bonin–Mariana Island Arc,which retains key information about the cessation of the Western Philippine Basin(WPB)expansion and the Parece Vela Basin(PVB)breakup.Herein,using the new generation of satellite altimetry gravity data,high-precision seafloor topography data,and newly acquired ship-borne gravity data,the topographic and gravity characteristics of the KPR mid-southern section and adjacent region are depicted.The distribution characteristics of the faults were delineated using the normalized vertical derivative–total horizontal derivative method(NVDR-THDR)and the minimum curvature potential field separation method.The Moho depth and crustal thickness were inverted using the rapid inversion method for a double-interface model with depth constraints.Based on these results,the crust structure features in the KPR mid-southern section,and the“triangular”structure geological significance where the KPR and Central Basin Rift(CBR)of the WPB intersect are interpreted.The KPR crustal thickness is approximately 6–16 km,with a distinct discontinuity that is slightly thicker than the normal oceanic crust.The KPR mid-southern section crust structure was divided into four segments(S1–S4)from north to south,formed by the CBR eastward extension joint action and clockwise rotation of the PVB expansion axis and the Mindanao fault zone blocking effect. 展开更多
关键词 structural features satellite altimetry gravity data Kyushu-Palau Ridge Central Basin Rift FAULTS Moho depth
下载PDF
FP-STE: A Novel Node Failure Prediction Method Based on Spatio-Temporal Feature Extraction in Data Centers 被引量:2
20
作者 Yang Yang Jing Dong +2 位作者 Chao Fang Ping Xie Na An 《Computer Modeling in Engineering & Sciences》 SCIE EI 2020年第6期1015-1031,共17页
The development of cloud computing and virtualization technology has brought great challenges to the reliability of data center services.Data centers typically contain a large number of compute and storage nodes which... The development of cloud computing and virtualization technology has brought great challenges to the reliability of data center services.Data centers typically contain a large number of compute and storage nodes which may fail and affect the quality of service.Failure prediction is an important means of ensuring service availability.Predicting node failure in cloud-based data centers is challenging because the failure symptoms reflected have complex characteristics,and the distribution imbalance between the failure sample and the normal sample is widespread,resulting in inaccurate failure prediction.Targeting these challenges,this paper proposes a novel failure prediction method FP-STE(Failure Prediction based on Spatio-temporal Feature Extraction).Firstly,an improved recurrent neural network HW-GRU(Improved GRU based on HighWay network)and a convolutional neural network CNN are used to extract the temporal features and spatial features of multivariate data respectively to increase the discrimination of different types of failure symptoms which improves the accuracy of prediction.Then the intermediate results of the two models are added as features into SCSXGBoost to predict the possibility and the precise type of node failure in the future.SCS-XGBoost is an ensemble learning model that is improved by the integrated strategy of oversampling and cost-sensitive learning.Experimental results based on real data sets confirm the effectiveness and superiority of FP-STE. 展开更多
关键词 Failure prediction data center features extraction XGBoost service availability
下载PDF
上一页 1 2 228 下一页 到第
使用帮助 返回顶部