期刊文献+
共找到9,766篇文章
< 1 2 250 >
每页显示 20 50 100
Finer topographic data improves distribution modeling of Picea crassifolia in the northern Qilian Mountains
1
作者 ZHANG Xiang GAO Linlin +3 位作者 LUO Yu YUAN Yiyun MA Baolong DENG Yang 《Journal of Mountain Science》 SCIE CSCD 2024年第10期3306-3317,共12页
The Qilian Mountains, a national key ecological function zone in Western China, play a pivotal role in ecosystem services. However, the distribution of its dominant tree species, Picea crassifolia (Qinghai spruce), ha... The Qilian Mountains, a national key ecological function zone in Western China, play a pivotal role in ecosystem services. However, the distribution of its dominant tree species, Picea crassifolia (Qinghai spruce), has decreased dramatically in the past decades due to climate change and human activity, which may have influenced its ecological functions. To restore its ecological functions, reasonable reforestation is the key measure. Many previous efforts have predicted the potential distribution of Picea crassifolia, which provides guidance on regional reforestation policy. However, all of them were performed at low spatial resolution, thus ignoring the natural characteristics of the patchy distribution of Picea crassifolia. Here, we modeled the distribution of Picea crassifolia with species distribution models at high spatial resolutions. For many models, the area under the receiver operating characteristic curve (AUC) is larger than 0.9, suggesting their excellent precision. The AUC of models at 30 m is higher than that of models at 90 m, and the current potential distribution of Picea crassifolia is more closely aligned with its actual distribution at 30 m, demonstrating that finer data resolution improves model performance. Besides, for models at 90 m resolution, annual precipitation (Bio12) played the paramount influence on the distribution of Picea crassifolia, while the aspect became the most important one at 30 m, indicating the crucial role of finer topographic data in modeling species with patchy distribution. The current distribution of Picea crassifolia was concentrated in the northern and central parts of the study area, and this pattern will be maintained under future scenarios, although some habitat loss in the central parts and gain in the eastern regions is expected owing to increasing temperatures and precipitation. Our findings can guide protective and restoration strategies for the Qilian Mountains, which would benefit regional ecological balance. 展开更多
关键词 Species distribution modeling Picea crassifolia High resolution topographic data Climate change Qilian Mountains Nature Reserve Climate scenarios
下载PDF
Blockchain-Enabled Federated Learning for Privacy-Preserving Non-IID Data Sharing in Industrial Internet
2
作者 Qiuyan Wang Haibing Dong +2 位作者 Yongfei Huang Zenglei Liu Yundong Gou 《Computers, Materials & Continua》 SCIE EI 2024年第8期1967-1983,共17页
Sharing data while protecting privacy in the industrial Internet is a significant challenge.Traditional machine learning methods require a combination of all data for training;however,this approach can be limited by d... Sharing data while protecting privacy in the industrial Internet is a significant challenge.Traditional machine learning methods require a combination of all data for training;however,this approach can be limited by data availability and privacy concerns.Federated learning(FL)has gained considerable attention because it allows for decentralized training on multiple local datasets.However,the training data collected by data providers are often non-independent and identically distributed(non-IID),resulting in poor FL performance.This paper proposes a privacy-preserving approach for sharing non-IID data in the industrial Internet using an FL approach based on blockchain technology.To overcome the problem of non-IID data leading to poor training accuracy,we propose dynamically updating the local model based on the divergence of the global and local models.This approach can significantly improve the accuracy of FL training when there is relatively large dispersion.In addition,we design a dynamic gradient clipping algorithm to alleviate the influence of noise on the model accuracy to reduce potential privacy leakage caused by sharing model parameters.Finally,we evaluate the performance of the proposed scheme using commonly used open-source image datasets.The simulation results demonstrate that our method can significantly enhance the accuracy while protecting privacy and maintaining efficiency,thereby providing a new solution to data-sharing and privacy-protection challenges in the industrial Internet. 展开更多
关键词 Federated learning data sharing non-iid data differential privacy blockchain
下载PDF
Data complexity-based batch sanitization method against poison in distributed learning
3
作者 Silv Wang Kai Fan +2 位作者 Kuan Zhang Hui Li Yintang Yang 《Digital Communications and Networks》 SCIE CSCD 2024年第2期416-428,共13页
The security of Federated Learning(FL)/Distributed Machine Learning(DML)is gravely threatened by data poisoning attacks,which destroy the usability of the model by contaminating training samples,so such attacks are ca... The security of Federated Learning(FL)/Distributed Machine Learning(DML)is gravely threatened by data poisoning attacks,which destroy the usability of the model by contaminating training samples,so such attacks are called causative availability indiscriminate attacks.Facing the problem that existing data sanitization methods are hard to apply to real-time applications due to their tedious process and heavy computations,we propose a new supervised batch detection method for poison,which can fleetly sanitize the training dataset before the local model training.We design a training dataset generation method that helps to enhance accuracy and uses data complexity features to train a detection model,which will be used in an efficient batch hierarchical detection process.Our model stockpiles knowledge about poison,which can be expanded by retraining to adapt to new attacks.Being neither attack-specific nor scenario-specific,our method is applicable to FL/DML or other online or offline scenarios. 展开更多
关键词 distributed machine learning security Federated learning data poisoning attacks data sanitization Batch detection data complexity
下载PDF
Research on Tensor Multi-Clustering Distributed Incremental Updating Method for Big Data
4
作者 Hongjun Zhang Zeyu Zhang +3 位作者 Yilong Ruan Hao Ye Peng Li Desheng Shi 《Computers, Materials & Continua》 SCIE EI 2024年第10期1409-1432,共24页
The scale and complexity of big data are growing continuously,posing severe challenges to traditional data processing methods,especially in the field of clustering analysis.To address this issue,this paper introduces ... The scale and complexity of big data are growing continuously,posing severe challenges to traditional data processing methods,especially in the field of clustering analysis.To address this issue,this paper introduces a new method named Big Data Tensor Multi-Cluster Distributed Incremental Update(BDTMCDIncreUpdate),which combines distributed computing,storage technology,and incremental update techniques to provide an efficient and effective means for clustering analysis.Firstly,the original dataset is divided into multiple subblocks,and distributed computing resources are utilized to process the sub-blocks in parallel,enhancing efficiency.Then,initial clustering is performed on each sub-block using tensor-based multi-clustering techniques to obtain preliminary results.When new data arrives,incremental update technology is employed to update the core tensor and factor matrix,ensuring that the clustering model can adapt to changes in data.Finally,by combining the updated core tensor and factor matrix with historical computational results,refined clustering results are obtained,achieving real-time adaptation to dynamic data.Through experimental simulation on the Aminer dataset,the BDTMCDIncreUpdate method has demonstrated outstanding performance in terms of accuracy(ACC)and normalized mutual information(NMI)metrics,achieving an accuracy rate of 90%and an NMI score of 0.85,which outperforms existing methods such as TClusInitUpdate and TKLClusUpdate in most scenarios.Therefore,the BDTMCDIncreUpdate method offers an innovative solution to the field of big data analysis,integrating distributed computing,incremental updates,and tensor-based multi-clustering techniques.It not only improves the efficiency and scalability in processing large-scale high-dimensional datasets but also has been validated for its effectiveness and accuracy through experiments.This method shows great potential in real-world applications where dynamic data growth is common,and it is of significant importance for advancing the development of data analysis technology. 展开更多
关键词 TENSOR incremental update distributED clustering processing big data
下载PDF
Big Data Application Simulation Platform Design for Onboard Distributed Processing of LEO Mega-Constellation Networks
5
作者 Zhang Zhikai Gu Shushi +1 位作者 Zhang Qinyu Xue Jiayin 《China Communications》 SCIE CSCD 2024年第7期334-345,共12页
Due to the restricted satellite payloads in LEO mega-constellation networks(LMCNs),remote sensing image analysis,online learning and other big data services desirably need onboard distributed processing(OBDP).In exist... Due to the restricted satellite payloads in LEO mega-constellation networks(LMCNs),remote sensing image analysis,online learning and other big data services desirably need onboard distributed processing(OBDP).In existing technologies,the efficiency of big data applications(BDAs)in distributed systems hinges on the stable-state and low-latency links between worker nodes.However,LMCNs with high-dynamic nodes and long-distance links can not provide the above conditions,which makes the performance of OBDP hard to be intuitively measured.To bridge this gap,a multidimensional simulation platform is indispensable that can simulate the network environment of LMCNs and put BDAs in it for performance testing.Using STK's APIs and parallel computing framework,we achieve real-time simulation for thousands of satellite nodes,which are mapped as application nodes through software defined network(SDN)and container technologies.We elaborate the architecture and mechanism of the simulation platform,and take the Starlink and Hadoop as realistic examples for simulations.The results indicate that LMCNs have dynamic end-to-end latency which fluctuates periodically with the constellation movement.Compared to ground data center networks(GDCNs),LMCNs deteriorate the computing and storage job throughput,which can be alleviated by the utilization of erasure codes and data flow scheduling of worker nodes. 展开更多
关键词 big data application Hadoop LEO mega-constellation multidimensional simulation onboard distributed processing
下载PDF
A Robust Framework for Multimodal Sentiment Analysis with Noisy Labels Generated from Distributed Data Annotation
6
作者 Kai Jiang Bin Cao Jing Fan 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第6期2965-2984,共20页
Multimodal sentiment analysis utilizes multimodal data such as text,facial expressions and voice to detect people’s attitudes.With the advent of distributed data collection and annotation,we can easily obtain and sha... Multimodal sentiment analysis utilizes multimodal data such as text,facial expressions and voice to detect people’s attitudes.With the advent of distributed data collection and annotation,we can easily obtain and share such multimodal data.However,due to professional discrepancies among annotators and lax quality control,noisy labels might be introduced.Recent research suggests that deep neural networks(DNNs)will overfit noisy labels,leading to the poor performance of the DNNs.To address this challenging problem,we present a Multimodal Robust Meta Learning framework(MRML)for multimodal sentiment analysis to resist noisy labels and correlate distinct modalities simultaneously.Specifically,we propose a two-layer fusion net to deeply fuse different modalities and improve the quality of the multimodal data features for label correction and network training.Besides,a multiple meta-learner(label corrector)strategy is proposed to enhance the label correction approach and prevent models from overfitting to noisy labels.We conducted experiments on three popular multimodal datasets to verify the superiority of ourmethod by comparing it with four baselines. 展开更多
关键词 distributed data collection multimodal sentiment analysis meta learning learn with noisy labels
下载PDF
Correlation knowledge extraction based on data mining for distribution network planning 被引量:2
7
作者 Zhifang Zhu Zihan Lin +4 位作者 Liping Chen Hong Dong Yanna Gao Xinyi Liang Jiahao Deng 《Global Energy Interconnection》 EI CSCD 2023年第4期485-492,共8页
Traditional distribution network planning relies on the professional knowledge of planners,especially when analyzing the correlations between the problems existing in the network and the crucial influencing factors.Th... Traditional distribution network planning relies on the professional knowledge of planners,especially when analyzing the correlations between the problems existing in the network and the crucial influencing factors.The inherent laws reflected by the historical data of the distribution network are ignored,which affects the objectivity of the planning scheme.In this study,to improve the efficiency and accuracy of distribution network planning,the characteristics of distribution network data were extracted using a data-mining technique,and correlation knowledge of existing problems in the network was obtained.A data-mining model based on correlation rules was established.The inputs of the model were the electrical characteristic indices screened using the gray correlation method.The Apriori algorithm was used to extract correlation knowledge from the operational data of the distribution network and obtain strong correlation rules.Degree of promotion and chi-square tests were used to verify the rationality of the strong correlation rules of the model output.In this study,the correlation relationship between heavy load or overload problems of distribution network feeders in different regions and related characteristic indices was determined,and the confidence of the correlation rules was obtained.These results can provide an effective basis for the formulation of a distribution network planning scheme. 展开更多
关键词 distribution network planning data mining Apriori algorithm Gray correlation analysis Chi-square test
下载PDF
Exploring a New Lifetime Distribution for Modelling the Waiting Time of Bank Customers
8
作者 Simon A. Ogumeyo Jacob C. Ehiwario Festus C. Opone 《Journal of Applied Mathematics and Physics》 2024年第1期194-209,共16页
The fitting of lifetime distribution in real-life data has been studied in various fields of research. With the theory of evolution still applicable, more complex data from real-world scenarios will continue to emerge... The fitting of lifetime distribution in real-life data has been studied in various fields of research. With the theory of evolution still applicable, more complex data from real-world scenarios will continue to emerge. Despite this, many researchers have made commendable efforts to develop new lifetime distributions that can fit this complex data. In this paper, we utilized the KM-transformation technique to increase the flexibility of the power Lindley distribution, resulting in the Kavya-Manoharan Power Lindley (KMPL) distribution. We study the mathematical treatments of the KMPL distribution in detail and adapt the widely used method of maximum likelihood to estimate the unknown parameters of the KMPL distribution. We carry out a Monte Carlo simulation study to investigate the performance of the Maximum Likelihood Estimates (MLEs) of the parameters of the KMPL distribution. To demonstrate the effectiveness of the KMPL distribution for data fitting, we use a real dataset comprising the waiting time of 100 bank customers. We compare the KMPL distribution with other models that are extensions of the power Lindley distribution. Based on some statistical model selection criteria, the summary results of the analysis were in favor of the KMPL distribution. We further investigate the density fit and probability-probability (p-p) plots to validate the superiority of the KMPL distribution over the competing distributions for fitting the waiting time dataset. 展开更多
关键词 KM-Transformation Power Lindley distribution data Fitting MOMENTS QUANTILES
下载PDF
Similarity matching method of power distribution system operating data based on neural information retrieval
9
作者 Kai Xiao Daoxing Li +2 位作者 Pengtian Guo Xiaohui Wang Yong Chen 《Global Energy Interconnection》 EI CAS CSCD 2023年第1期15-25,共11页
Operation control of power systems has become challenging with an increase in the scale and complexity of power distribution systems and extensive access to renewable energy.Therefore,improvement of the ability of dat... Operation control of power systems has become challenging with an increase in the scale and complexity of power distribution systems and extensive access to renewable energy.Therefore,improvement of the ability of data-driven operation management,intelligent analysis,and mining is urgently required.To investigate and explore similar regularities of the historical operating section of the power distribution system and assist the power grid in obtaining high-value historical operation,maintenance experience,and knowledge by rule and line,a neural information retrieval model with an attention mechanism is proposed based on graph data computing technology.Based on the processing flow of the operating data of the power distribution system,a technical framework of neural information retrieval is established.Combined with the natural graph characteristics of the power distribution system,a unified graph data structure and a data fusion method of data access,data complement,and multi-source data are constructed.Further,a graph node feature-embedding representation learning algorithm and a neural information retrieval algorithm model are constructed.The neural information retrieval algorithm model is trained and tested using the generated graph node feature representation vector set.The model is verified on the operating section of the power distribution system of a provincial grid area.The results show that the proposed method demonstrates high accuracy in the similarity matching of historical operation characteristics and effectively supports intelligent fault diagnosis and elimination in power distribution systems. 展开更多
关键词 Neural information retrieval Power distribution Graph data Operating section Similarity matching
下载PDF
Risk Analysis Using Multi-Source Data for Distribution Networks Facing Extreme Natural Disasters
10
作者 Jun Yang Nannan Wang +1 位作者 Jiang Wang Yashuai Luo 《Energy Engineering》 EI 2023年第9期2079-2096,共18页
Distribution networks denote important public infrastructure necessary for people’s livelihoods.However,extreme natural disasters,such as earthquakes,typhoons,and mudslides,severely threaten the safe and stable opera... Distribution networks denote important public infrastructure necessary for people’s livelihoods.However,extreme natural disasters,such as earthquakes,typhoons,and mudslides,severely threaten the safe and stable operation of distribution networks and power supplies needed for daily life.Therefore,considering the requirements for distribution network disaster prevention and mitigation,there is an urgent need for in-depth research on risk assessment methods of distribution networks under extreme natural disaster conditions.This paper accessesmultisource data,presents the data quality improvement methods of distribution networks,and conducts data-driven active fault diagnosis and disaster damage analysis and evaluation using data-driven theory.Furthermore,the paper realizes real-time,accurate access to distribution network disaster information.The proposed approach performs an accurate and rapid assessment of cross-sectional risk through case study.The minimal average annual outage time can be reduced to 3 h/a in the ring network through case study.The approach proposed in this paper can provide technical support to the further improvement of the ability of distribution networks to cope with extreme natural disasters. 展开更多
关键词 distribution network disaster damage analysis fault judgment multi-source data
下载PDF
A Novel Approach to Design Distribution Preserving Framework for Big Data
11
作者 Mini Prince P.M.Joe Prathap 《Intelligent Automation & Soft Computing》 SCIE 2023年第3期2789-2803,共15页
In several fields like financial dealing,industry,business,medicine,et cetera,Big Data(BD)has been utilized extensively,which is nothing but a collection of a huge amount of data.However,it is highly complicated alon... In several fields like financial dealing,industry,business,medicine,et cetera,Big Data(BD)has been utilized extensively,which is nothing but a collection of a huge amount of data.However,it is highly complicated along with time-consuming to process a massive amount of data.Thus,to design the Distribution Preserving Framework for BD,a novel methodology has been proposed utilizing Manhattan Distance(MD)-centered Partition Around Medoid(MD–PAM)along with Conjugate Gradient Artificial Neural Network(CG-ANN),which undergoes various steps to reduce the complications of BD.Firstly,the data are processed in the pre-processing phase by mitigating the data repetition utilizing the map-reduce function;subsequently,the missing data are handled by substituting or by ignoring the missed values.After that,the data are transmuted into a normalized form.Next,to enhance the classification performance,the data’s dimensionalities are minimized by employing Gaussian Kernel(GK)-Fisher Discriminant Analysis(GK-FDA).Afterwards,the processed data is submitted to the partitioning phase after transmuting it into a structured format.In the partition phase,by utilizing the MD-PAM,the data are partitioned along with grouped into a cluster.Lastly,by employing CG-ANN,the data are classified in the classification phase so that the needed data can be effortlessly retrieved by the user.To analogize the outcomes of the CG-ANN with the prevailing methodologies,the NSL-KDD openly accessible datasets are utilized.The experiential outcomes displayed that an efficient result along with a reduced computation cost was shown by the proposed CG-ANN.The proposed work outperforms well in terms of accuracy,sensitivity and specificity than the existing systems. 展开更多
关键词 Big data artificial neural network fisher discriminant analysis distribution preserving framework manhattan distance
下载PDF
Load-balancing data distribution in publish/subscribe mode
12
作者 李凯 汪芸 +1 位作者 殷奕 袁飞飞 《Journal of Southeast University(English Edition)》 EI CAS 2014年第4期428-433,共6页
To improve data distribution efficiency a load-balancing data distribution LBDD method is proposed in publish/subscribe mode.In the LBDD method subscribers are involved in distribution tasks and data transfers while r... To improve data distribution efficiency a load-balancing data distribution LBDD method is proposed in publish/subscribe mode.In the LBDD method subscribers are involved in distribution tasks and data transfers while receiving data themselves.A dissemination tree is constructed among the subscribers based on MD5 where the publisher acts as the root. The proposed method provides bucket construction target selection and path updates furthermore the property of one-way dissemination is proven.That the average out-going degree of a node is 2 is guaranteed with the proposed LBDD.The experiments on data distribution delay data distribution rate and load distribution are conducted. Experimental results show that the LBDD method aids in shaping the task load between the publisher and subscribers and outperforms the point-to-point approach. 展开更多
关键词 data distribution publish/subscribe mode load balance dissemination tree
下载PDF
Analysis of the Global Swell Distributions Using ECMWF Re-Analyses Wind Wave Data 被引量:7
13
作者 ZHANG Jie WANG Weili GUAN Changlong 《Journal of Ocean University of China》 SCIE CAS 2011年第4期325-330,共6页
The existence of three well-defined tongue-shaped zones of swell dominance,termed as 'swell pools',in the Pacific,the Atlantic and the Indian Oceans,was reported by Chen et al.(2002)using satellite data.In thi... The existence of three well-defined tongue-shaped zones of swell dominance,termed as 'swell pools',in the Pacific,the Atlantic and the Indian Oceans,was reported by Chen et al.(2002)using satellite data.In this paper,the ECMWF Re-analyses wind wave data,including wind speed,significant wave height,averaged wave period and direction,are applied to verify the existence of these swell pools.The swell indices calculated from wave height,wave age and correlation coefficient are used to identify swell events.The wave age swell index can be more appropriately related to physical processes compared to the other two swell indices.Based on the ECMWF data the swell pools in the Pacific and the Atlantic Oceans are confirmed,but the expected swell pool in the Indian Ocean is not pronounced.The seasonal variations of global and hemispherical swell indices are investigated,and the argument that swells in the pools seemed to originate mostly from the winter hemisphere is supported by the seasonal variation of the averaged wave direction.The northward bending of the swell pools in the Pacific and the Atlantic Oceans in summer is not revealed by the ECMWF data.The swell pool in the Indian Ocean and the summer northward bending of the swell pools in the Pacific and the Atlan-tic Oceans need to be further verified by other datasets. 展开更多
关键词 global swell distribution swell index wave age ECMWF Re-analyses data
下载PDF
Vegetation NPP Distribution Based on MODIS Data and CASA Model——A Case Study of Northern Hebei Province 被引量:19
14
作者 YUAN Jinguo NIU Zheng WANG Chenli 《Chinese Geographical Science》 SCIE CSCD 2006年第4期334-341,共8页
Net Primary Productivity (NPP) is one of the important biophysical variables of vegetation activity, and it plays an important role in studying global carbon cycle, carbon source and sink of ecosystem, and spatial a... Net Primary Productivity (NPP) is one of the important biophysical variables of vegetation activity, and it plays an important role in studying global carbon cycle, carbon source and sink of ecosystem, and spatial and temporal distribution of CO2. Remote sensing can provide broad view quickly, timely and multi-temporally, which makes it an attractive and powerful tool for studying ecosystem primary productivity, at scales ranging from local to global. This paper aims to use Moderate Resolution Imaging Spectroradiometer (MODIS) data to estimate and analyze spatial and temporal distribution of NPP of the northern Hebei Province in 2001 based on Carnegie-Ames-Stanford Approach (CASA) model. The spatial distribution of Absorbed Photosynthetically Active Radiation (APAR) of vegetation and light use efficiency in three geographical subregions, that is, Bashang Plateau Region, Basin Region in the northwestern Hebei Province and Yanshan Mountainous Region in the Northern Hebei Province were analyzed, and total NPP spatial distribution of the study area in 2001 was discussed. Based on 16-day MODIS Fraction of Photosynthetically Active Radiation absorbed by vegetation (FPAR) product, 16-day composite NPP dynamics were calculated using CASA model; the seasonal dynamics of vegetation NPP in three subreglons were also analyzed. Result reveals that the total NPP of the study area in 2001 was 25.1877 × 10^6gC/(m^2.a), and NPP in 2001 ranged from 2 to 608gC/(m^2-a), with an average of 337.516gC/(m^2.a). NPP of the study area in 2001 accumulated mainly from May to September (DOY 129-272), high NIP values appeared from June to August (DOY 177-204), and the maximum NPP appeared from late July to mid-August (DOY 209-224). 展开更多
关键词 NPP distribution MODIS data CASA model Northvrn Hebei Province
下载PDF
On a Novel Extended Lomax Distribution with Asymmetric Properties and Its Statistical Applications
15
作者 Aisha Fayomi Christophe Chesneau +1 位作者 Farrukh Jamal Ali Algarni 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第9期2371-2403,共33页
In this article,we highlight a new three-parameter heavy-tailed lifetime distribution that aims to extend the modeling possibilities of the Lomax distribution.It is called the extended Lomax distribution.The considere... In this article,we highlight a new three-parameter heavy-tailed lifetime distribution that aims to extend the modeling possibilities of the Lomax distribution.It is called the extended Lomax distribution.The considered distribution naturally appears as the distribution of a transformation of a random variable following the logweighted power distribution recently introduced for percentage or proportion data analysis purposes.As a result,its cumulative distribution has the same functional basis as that of the Lomax distribution,but with a novel special logarithmic term depending on several parameters.The modulation of this logarithmic term reveals new types of asymetrical shapes,implying a modeling horizon beyond that of the Lomax distribution.In the first part,we examine several of its mathematical properties,such as the shapes of the related probability and hazard rate functions;stochastic comparisons;manageable expansions for various moments;and quantile properties.In particular,based on the quantile functions,various actuarial measures are discussed.In the second part,the distribution’s applicability is investigated with the use of themaximumlikelihood estimationmethod.The behavior of the obtained parameter estimates is validated by a simulation work.Insurance claim data are analyzed.We show that the proposed distribution outperforms eight well-known distributions,including the Lomax distribution and several extended Lomax distributions.In addition,we demonstrate that it gives preferable inferences from these competitor distributions in terms of risk measures. 展开更多
关键词 Lomax distribution extended Lomax distribution asymmetry actuarial measures maximum likelihood estimation data analysis
下载PDF
An Adaptive Privacy Preserving Framework for Distributed Association Rule Mining in Healthcare Databases
16
作者 Hasanien K.Kuba Mustafa A.Azzawi +2 位作者 Saad M.Darwish Oday A.Hassen Ansam A.Abdulhussein 《Computers, Materials & Continua》 SCIE EI 2023年第2期4119-4133,共15页
It is crucial,while using healthcare data,to assess the advantages of data privacy against the possible drawbacks.Data from several sources must be combined for use in many data mining applications.The medical practit... It is crucial,while using healthcare data,to assess the advantages of data privacy against the possible drawbacks.Data from several sources must be combined for use in many data mining applications.The medical practitioner may use the results of association rule mining performed on this aggregated data to better personalize patient care and implement preventive measures.Historically,numerous heuristics(e.g.,greedy search)and metaheuristics-based techniques(e.g.,evolutionary algorithm)have been created for the positive association rule in privacy preserving data mining(PPDM).When it comes to connecting seemingly unrelated diseases and drugs,negative association rules may be more informative than their positive counterparts.It is well-known that during negative association rules mining,a large number of uninteresting rules are formed,making this a difficult problem to tackle.In this research,we offer an adaptive method for negative association rule mining in vertically partitioned healthcare datasets that respects users’privacy.The applied approach dynamically determines the transactions to be interrupted for information hiding,as opposed to predefining them.This study introduces a novel method for addressing the problem of negative association rules in healthcare data mining,one that is based on the Tabu-genetic optimization paradigm.Tabu search is advantageous since it removes a huge number of unnecessary rules and item sets.Experiments using benchmark healthcare datasets prove that the discussed scheme outperforms state-of-the-art solutions in terms of decreasing side effects and data distortions,as measured by the indicator of hiding failure. 展开更多
关键词 distributed data mining evolutionary computation sanitization process healthcare informatics
下载PDF
Research on Rolling Load Distribution Method based on Data Mining 被引量:1
17
作者 ZHANG Yan-hua LIU Xiang-hua WANG Guo-dong 《Journal of Iron and Steel Research International》 SCIE CAS CSCD 2005年第6期30-32,53,共4页
A new method of establishing rolling load distribution model was developed by online intelligent information-processing technology for plate rolling. The model combines knowledge model and mathematical model with usin... A new method of establishing rolling load distribution model was developed by online intelligent information-processing technology for plate rolling. The model combines knowledge model and mathematical model with using knowledge discovery in database (KDD) and data mining (DM) as the start. The online maintenance and optimization of the load model are realized. The effectiveness of this new method was testified by offline simulation and online application. 展开更多
关键词 rolling load distribution information processing knowledge discovery data mining
下载PDF
Web Page Recommendation Using Distributional Recurrent Neural Network
18
作者 Chaithra G.M.Lingaraju S.Jagannatha 《Computer Systems Science & Engineering》 SCIE EI 2023年第4期803-817,共15页
In the data retrieval process of the Data recommendation system,the matching prediction and similarity identification take place a major role in the ontology.In that,there are several methods to improve the retrieving... In the data retrieval process of the Data recommendation system,the matching prediction and similarity identification take place a major role in the ontology.In that,there are several methods to improve the retrieving process with improved accuracy and to reduce the searching time.Since,in the data recommendation system,this type of data searching becomes complex to search for the best matching for given query data and fails in the accuracy of the query recommendation process.To improve the performance of data validation,this paper proposed a novel model of data similarity estimation and clustering method to retrieve the relevant data with the best matching in the big data processing.In this paper advanced model of the Logarithmic Directionality Texture Pattern(LDTP)method with a Metaheuristic Pattern Searching(MPS)system was used to estimate the similarity between the query data in the entire database.The overall work was implemented for the application of the data recommendation process.These are all indexed and grouped as a cluster to form a paged format of database structure which can reduce the computation time while at the searching period.Also,with the help of a neural network,the relevancies of feature attributes in the database are predicted,and the matching index was sorted to provide the recommended data for given query data.This was achieved by using the Distributional Recurrent Neural Network(DRNN).This is an enhanced model of Neural Network technology to find the relevancy based on the correlation factor of the feature set.The training process of the DRNN classifier was carried out by estimating the correlation factor of the attributes of the dataset.These are formed as clusters and paged with proper indexing based on the MPS parameter of similarity metric.The overall performance of the proposed work can be evaluated by varying the size of the training database by 60%,70%,and 80%.The parameters that are considered for performance analysis are Precision,Recall,F1-score and the accuracy of data retrieval,the query recommendation output,and comparison with other state-of-art methods. 展开更多
关键词 ONTOLOGY data mining in big data logarithmic directionality texture pattern metaheuristic pattern searching system distributional recurrent neural network query recommendation
下载PDF
State Estimation of Distribution Network Considering Data Compatibility 被引量:1
19
作者 Shengtao Wu Yan Li 《Energy and Power Engineering》 2020年第4期73-83,共11页
Considering that the measurement devices of the distribution network are becoming more and more abundant, on the basis of the traditional Supervisory Control And Data Acquisition (SCADA) measurement system, Phasor mea... Considering that the measurement devices of the distribution network are becoming more and more abundant, on the basis of the traditional Supervisory Control And Data Acquisition (SCADA) measurement system, Phasor measurement unit (PMU) devices are also gradually applied to the distribution network. So when estimating the state of the distribution network, the above two devices need to be used. However, because the data of different measurement systems are different, it is necessary to balance this difference so that the data of different systems can be compatible to achieve the purpose of effective utilization of the estimated power distribution state. To this end, this paper starts with three aspects of data accuracy of the two measurement systems, data time section and data refresh frequency to eliminate the differences between system data, and then considers the actual situation of the three-phase asymmetry of the distribution network. The three-phase state estimation equations are constructed by the branch current method, and finally the state estimation results are solved by the weighted least square method. 展开更多
关键词 distribution Network STATE Estimation data Compatibility Branch CURRENT Method
下载PDF
Bayesian Study Using MCMC of Three-Parameter Frechet Distribution Based on Type-I Censored Data 被引量:2
20
作者 Al Omari Mohammed Ahmed 《Journal of Applied Mathematics and Physics》 2021年第2期220-232,共13页
Type-I censoring mechanism arises when the number of units experiencing the event is random but the total duration of the study is fixed. There are a number of mathematical approaches developed to handle this type of ... Type-I censoring mechanism arises when the number of units experiencing the event is random but the total duration of the study is fixed. There are a number of mathematical approaches developed to handle this type of data. The purpose of the research was to estimate the three parameters of the Frechet distribution via the frequentist Maximum Likelihood and the Bayesian Estimators. In this paper, the maximum likelihood method (MLE) is not available of the three parameters in the closed forms;therefore, it was solved by the numerical methods. Similarly, the Bayesian estimators are implemented using Jeffreys and gamma priors with two loss functions, which are: squared error loss function and Linear Exponential Loss Function (LINEX). The parameters of the Frechet distribution via Bayesian cannot be obtained analytically and therefore Markov Chain Monte Carlo is used, where the full conditional distribution for the three parameters is obtained via Metropolis-Hastings algorithm. Comparisons of the estimators are obtained using Mean Square Errors (MSE) to determine the best estimator of the three parameters of the Frechet distribution. The results show that the Bayesian estimation under Linear Exponential Loss Function based on Type-I censored data is a better estimator for all the parameter estimates when the value of the loss parameter is positive. 展开更多
关键词 Frechet distribution Bayesian Method Type-I Censored data Markov Chain Monte Carlo Metropolis-Hastings Algorithm
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部