期刊文献+
共找到4,352篇文章
< 1 2 218 >
每页显示 20 50 100
Incidence and Survivability of Acute Lymphocytic Leukemia Patients in the United States: Analysis of SEER Data Set from 2000-2019
1
作者 Ishan Ghosh Sudipto Mukherjee 《Journal of Cancer Therapy》 2024年第4期141-163,共23页
The main goal of this research is to assess the impact of race, age at diagnosis, sex, and phenotype on the incidence and survivability of acute lymphocytic leukemia (ALL) among patients in the United States. By takin... The main goal of this research is to assess the impact of race, age at diagnosis, sex, and phenotype on the incidence and survivability of acute lymphocytic leukemia (ALL) among patients in the United States. By taking these factors into account, the study aims to explore how existing cancer registry data can aid in the early detection and effective treatment of ALL in patients. Our hypothesis was that statistically significant correlations exist between race, age at which patients were diagnosed, sex, and phenotype of the ALL patients, and their rate of incidence and survivability data were evaluated using SEER*Stat statistical software from National Cancer Institute. Analysis of the incidence data revealed that a higher prevalence of ALL was among the Caucasian population. The majority of ALL cases (59%) occurred in patients aged between 0 to 19 years at the time of diagnosis, and 56% of the affected individuals were male. The B-cell phenotype was predominantly associated with ALL cases (73%). When analyzing survivability data, it was observed that the 5-year survival rates slightly exceeded the 10-year survival rates for the respective demographics. Survivability rates of African Americans patients were the lowest compared to Caucasian, Asian, Pacific Islanders, Alaskan Native, Native Americans and others. Survivability rates progressively decreased for older patients. Moreover, this study investigated the typical treatment methods applied to ALL patients, mainly comprising chemotherapy, with occasional supplementation of radiation therapy as required. The study demonstrated the considerable efficacy of chemotherapy in enhancing patients’ chances of survival, while those who remained untreated faced a less favorable prognosis from the disease. Although a significant amount of data and information exists, this study can help doctors in the future by diagnosing patients with certain characteristics. It will further assist the health care professionals in screening potential patients and early detection of cases. This could also save the lives of elderly patients who have a higher mortality rate from this disease. 展开更多
关键词 Acute Lymphocytic Leukemia SURVIVABILITY INCIDENCE DEMOGRAPHY SEER data set
下载PDF
Reconstruction of incomplete satellite SST data sets based on EOF method 被引量:2
2
作者 DING Youzhuan WEI Zhihui +2 位作者 MAO Zhihua WANG Xiaofei PAN Delu 《Acta Oceanologica Sinica》 SCIE CAS CSCD 2009年第2期36-44,共9页
As for the satellite remote sensing data obtained by the visible and infrared bands myers,on, the clouds coverage in the sky over the ocean often results in missing data of inversion products on a large scale, and thi... As for the satellite remote sensing data obtained by the visible and infrared bands myers,on, the clouds coverage in the sky over the ocean often results in missing data of inversion products on a large scale, and thin clouds difficult to be detected would cause the data of the inversion products to be abnormal. Alvera et a1.(2005) proposed a method for the reconstruction of missing data based on an Empirical Orthogonal Functions (EOF) decomposition, but his method couldn't process these images presenting extreme cloud coverage(more than 95%), and required a long time for recon- struction. Besides, the abnormal data in the images had a great effect on the reconstruction result. Therefore, this paper tries to improve the study result. It has reconstructed missing data sets by twice applying EOF decomposition method. Firstly, the abnormity time has been detected by analyzing the temporal modes of EOF decomposition, and the abnormal data have been eliminated. Secondly, the data sets, excluding the abnormal data, are analyzed by using EOF decomposition, and then the temporal modes undergo a filtering process so as to enhance the ability of reconstruct- ing the images which are of no or just a little data, by using EOF. At last, this method has been applied to a large data set, i.e. 43 Sea Surface Temperature (SST) satellite images of the Changjiang River (Yangtze River) estuary and its adjacent areas, and the total reconstruction root mean square error (RMSE) is 0.82℃. And it has been proved that this improved EOF reconstruction method is robust for reconstructing satellite missing data and unreliable data. 展开更多
关键词 EOF SST Changjiang River estuary Missing data sets
下载PDF
Evolution algorithm for water storage forecasting response to climate change with little data sets:the Wolonghu Wetland,China
3
作者 尼庆伟 叶人珍 +1 位作者 杨凤林 雷坤 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 2011年第2期127-133,共7页
An attempt of applying a novel genetic programming(GP) technique,a new member of evolution algorithms,has been made to predict the water storage of Wolonghu wetland response to the climate change in northeastern part ... An attempt of applying a novel genetic programming(GP) technique,a new member of evolution algorithms,has been made to predict the water storage of Wolonghu wetland response to the climate change in northeastern part of China with little data set.Fourteen years(1993-2006) of annual water storage and climatic data set of the wetland were taken for model training and testing.The results of simulations and predictions illustrated a good fit between calculated water storage and observed values(MAPE=9.47,r=0.99).By comparison,a multilayer perceptron(MLP)(a popular artificial neural network model) method and a grey model(GM) with the same data set were applied for performances estimation.It was found that GP technique had better performances than the other two methods both in the simulation step and predicting phase and the results were analyzed and discussed.The case study confirmed that GP method is a promising way for wetland managers to make a quick estimation of fluctuations of water storage in some wetlands under condition of little data set. 展开更多
关键词 water storage little data set evolution algorism Wolonghu wetland
下载PDF
Robustness Evaluation of Remote-Sensing Image Feature Detectors with TH Priori-Information Data Set
4
作者 Yiping Duan Xiaoming Tao +1 位作者 Xijia Liu Ning Ge 《China Communications》 SCIE CSCD 2020年第10期218-228,共11页
In this paper,we build a remote-sensing satellite imagery priori-information data set,and propose an approach to evaluate the robustness of remote-sensing image feature detectors.The building TH Priori-Information(TPI... In this paper,we build a remote-sensing satellite imagery priori-information data set,and propose an approach to evaluate the robustness of remote-sensing image feature detectors.The building TH Priori-Information(TPI)data set with 2297 remote sensing images serves as a standardized high-resolution data set for studies related to remote-sensing image features.The TPI contains 1)raw and calibrated remote-sensing images with high spatial and temporal resolutions(up to 2 m and 7 days,respectively),and 2)a built-in 3-D target area model that supports view position,view angle,lighting,shadowing,and other transformations.Based on TPI,we further present a quantized approach,including the feature recurrence rate,the feature match score,and the weighted feature robustness score,to evaluate the robustness of remote-sensing image feature detectors.The quantized approach gives general and objective assessments of the robustness of feature detectors under complex remote-sensing circumstances.Three remote-sensing image feature detectors,including scale-invariant feature transform(SIFT),speeded up robust features(SURF),and priori information based robust features(PIRF),are evaluated using the proposed approach on the TPI data set.Experimental results show that the robustness of PIRF outperforms others by over 6.2%. 展开更多
关键词 REMOTE-SENSING TH data set image feature robustness evaluation
下载PDF
Scaling up Kernel Grower Clustering Method for Large Data Sets via Core-sets 被引量:2
5
作者 CHANG Liang DENG Xiao-Ming +1 位作者 ZHENG Sui-Wu WANG Yong-Qing 《自动化学报》 EI CSCD 北大核心 2008年第3期376-382,共7页
核栽培者是聚类最近 Camastra 和 Verri 建议的方法的一个新奇的核。它证明为各种各样的数据的好性能关于流行聚类的算法有利地设定并且比较。然而,方法的主要缺点是在处理大数据集合的弱可伸缩能力,它极大地限制它的应用程序。在这... 核栽培者是聚类最近 Camastra 和 Verri 建议的方法的一个新奇的核。它证明为各种各样的数据的好性能关于流行聚类的算法有利地设定并且比较。然而,方法的主要缺点是在处理大数据集合的弱可伸缩能力,它极大地限制它的应用程序。在这份报纸,我们用核心集合建议一个可伸缩起来的核栽培者方法,它是比为聚类的大数据的原来的方法显著地快的。同时,它能处理很大的数据集合。象合成数据集合一样的基准数据集合的数字实验显示出建议方法的效率。方法也被用于真实图象分割说明它的性能。 展开更多
关键词 大型数据集 图象分割 模式识别 磁心配置 核聚类
下载PDF
Threshold Selection Study on Fisher Discriminant Analysis Used in Exon Prediction for Unbalanced Data Sets
6
作者 Yutao Ma Yanbing Fang +1 位作者 Ping Liu Jianfu Teng 《Communications and Network》 2013年第3期601-605,共5页
In gene prediction, the Fisher discriminant analysis (FDA) is used to separate protein coding region (exon) from non-coding regions (intron). Usually, the positive data set and the negative data set are of the same si... In gene prediction, the Fisher discriminant analysis (FDA) is used to separate protein coding region (exon) from non-coding regions (intron). Usually, the positive data set and the negative data set are of the same size if the number of the data is big enough. But for some situations the data are not sufficient or not equal, the threshold used in FDA may have important influence on prediction results. This paper presents a study on the selection of the threshold. The eigen value of each exon/intron sequence is computed using the Z-curve method with 69 variables. The experiments results suggest that the size and the standard deviation of the data sets and the threshold are the three key elements to be taken into consideration to improve the prediction results. 展开更多
关键词 FISHER DISCRIMINANT Analysis THRESHOLD Selection Gene PREDICTION Z-Curve Size of data set
下载PDF
Contrasting Vertical Structure of Recent Arctic Warming in Different Data Sets
7
作者 Igor Esau Vladimir Alexeev +1 位作者 Irina Repina Svetlana Sorokina 《Atmospheric and Climate Sciences》 2013年第1期1-5,共5页
Arctic region is experiencing strong warming and related changes in the state of sea ice, permafrost, tundra, marine environment and terrestrial ecosystems. These changes are found in any climatological data set compr... Arctic region is experiencing strong warming and related changes in the state of sea ice, permafrost, tundra, marine environment and terrestrial ecosystems. These changes are found in any climatological data set comprising the Arctic region. This study compares the temperature trends in several surface, satellite and reanalysis data sets. We demonstrate large differences in the 1979-2002 temperature trends. Data sets disagree on the magnitude of the trends as well as on their seasonal, zonal and vertical pattern. It was found that the surface temperature trends are stronger than the trends in the tropospheric temperature for each latitude band north of 50?N for each month except for the months during the ice-melting season. These results emphasize that the conclusions of climate studies drawn on the basis of a single data set analysis should be treated with caution as they may be affected by the artificial biases in data. 展开更多
关键词 ARCTIC WARMING data set Intercomparison ATMOSPHERIC VERTICAL Structure
下载PDF
Top-k probabilistic prevalent co-location mining in spatially uncertain data sets 被引量:4
8
作者 Lizhen WANG Jun HAN +1 位作者 Hongmei CHEN Junli LU 《Frontiers of Computer Science》 SCIE EI CSCD 2016年第3期488-503,共16页
A co-location pattern is a set of spatial features whose instances frequently appear in a spatial neighborhood. This paper efficiently mines the top-k probabilistic prevalent co-locations over spatially uncertain data... A co-location pattern is a set of spatial features whose instances frequently appear in a spatial neighborhood. This paper efficiently mines the top-k probabilistic prevalent co-locations over spatially uncertain data sets and makes the following contributions: 1) the concept of the top-k prob- abilistic prevalent co-locations based on a possible world model is defined; 2) a framework for discovering the top- k probabilistic prevalent co-locations is set up; 3) a matrix method is proposed to improve the computation of the preva- lence probability of a top-k candidate, and two pruning rules of the matrix block are given to accelerate the search for ex- act solutions; 4) a polynomial matrix is developed to further speed up the top-k candidate refinement process; 5) an ap- proximate algorithm with compensation factor is introduced so that relatively large quantity of data can be processed quickly. The efficiency of our proposed algorithms as well as the accuracy of the approximation algorithms is evaluated with an extensive set of experiments using both synthetic and real uncertain data sets. 展开更多
关键词 spatial co-location mining top-k probabilistic prevalent co-location mining spatially uncertain data sets matrix methods
原文传递
Characteristics of plankton Hg bioaccumulations based on a global data set and the implications for aquatic systems with aggravating nutrient imbalance 被引量:1
9
作者 Zhike Li Jie Chi +9 位作者 Zhenyu Wu Yiyan Zhang Yiran Liu Lanlan Huang Yiren Lu Minhaz Uddin Wei Zhang Xuejun Wang Yan Lin Yindong Tong 《Frontiers of Environmental Science & Engineering》 SCIE EI CSCD 2022年第3期121-133,共13页
The bioaccumulation of mercury(Hg)in aquatic ecosystem poses a potential health risk to human being and aquatic organism.Bioaccumulations by plankton represent a crucial process of Hg transfer from water to aquatic fo... The bioaccumulation of mercury(Hg)in aquatic ecosystem poses a potential health risk to human being and aquatic organism.Bioaccumulations by plankton represent a crucial process of Hg transfer from water to aquatic food chain.However,the current understanding of major factors affecting Hg accumulation by plankton is inadequate.In this study,a data set of 89 aquatic ecosystems worldwide,including inland water,nearshore water and open sea,was established.Key factors influencing plankton Hg bioaccumulation(i.e.,plankton species,cell sizes and biomasses)were discussed.The results indicated that total Hg(THg)and methylmercury(MeHg)concentrations in plankton in inland waters were significantly higher than those in nearshore waters and open seas.Bioaccumulation factors for the logarithm of THg and MeHg of phytoplankton were 2.4–6.0 and 2.6–6.7 L/kg,respectively,in all aquatic ecosystems.They could be further biomagnified by a factor of 2.1–15.1 and 5.3–28.2 from phytoplankton to zooplankton.Higher MeHg concentrations were observed with the increases of cell size for both phyto-and zooplankton.A contrasting trend was observed between the plankton biomasses and BAF_(MeHg),with a positive relationship for zooplankton and a negative relationship for phytoplankton.Plankton physiologic traits impose constraints on the rates of nutrients and contaminants obtaining process from water.Nowadays,many aquatic ecosystems are facing rapid shifts in nutrient compositions.We suggested that these potential influences on the growth and composition of plankton should be incorporated in future aquatic Hg modeling and ecological risk assessments. 展开更多
关键词 PLANKTON Hg bioaccumulation Physiological characteristics A cross-system analysis Nutrient compositions Global data set
原文传递
Data Set and Evaluation of Automated Construction of Financial Knowledge Graph 被引量:2
10
作者 Wenguang Wang Yonglin Xu +3 位作者 Chunhui Du Yunwen Chen Yijie Wang Hui Wen 《Data Intelligence》 2021年第3期418-443,共26页
With the technological development of entity extraction, relationship extraction, knowledge reasoning, and entity linking, the research on knowledge graph has been carried out in full swing in recent years. To better ... With the technological development of entity extraction, relationship extraction, knowledge reasoning, and entity linking, the research on knowledge graph has been carried out in full swing in recent years. To better promote the development of knowledge graph, especially in the Chinese language and in the financial industry, we built a high-quality data set, named financial research report knowledge graph(FR2 KG), and organized the automated construction of financial knowledge graph evaluation at the 2020 China Knowledge Graph and Semantic Computing Conference(CCKS2020). FR2 KG consists of 17,799 entities, 26,798 relationship triples, and 1,328 attribute triples covering 10 entity types, 19 relationship types, and 6 attributes. Participants are required to develop a constructor that will automatically construct a financial knowledge graph based on the FR2 KG. In addition, we summarized the technologies for automatically constructing knowledge graphs, and introduced the methods used by the winners and the results of this evaluation. 展开更多
关键词 Knowledge graph Entity extraction Relation extraction FR2KG data set CCKS
原文传递
Constructing Isosurfaces from 3D Data Sets Taking Account of Depth Sorting of Polyhedra
11
作者 周勇 唐泽圣 《Journal of Computer Science & Technology》 SCIE EI CSCD 1994年第2期117-127,共11页
Creating and rendering intermediate geometric primitives is one of the approaches to visualize data sets in 3D space. Some algorithms have been developed to construct isosurface from uniformly distributed 3D data sets... Creating and rendering intermediate geometric primitives is one of the approaches to visualize data sets in 3D space. Some algorithms have been developed to construct isosurface from uniformly distributed 3D data sets. These algorithms assume that the function value varies linearly along edges of each cell. But to irregular 3D data sets, this assumption is inapplicable. Moreover, the depth sorting of cells is more complicated for irregular data sets, which is indispensable for generating isosurface images or semitransparent isosurface images, if Z-buffer method is not adopted.In this paper, isosurface models based on the assumption that the function value has nonlinear distribution within a tetrahedroll are proposed. The depth sorting algorithm and data structures are developed for the irregular data sets in which cells may be subdivided into tetrahedra. The implementation issues of this algorithm are discussed and experimental results are shown to illustrate potentials of this technique. 展开更多
关键词 ISOSURFACE 3D data sets depth sorting POLYHEDRA
原文传递
AOL4PS:A Large-scale Data Set for Personalized Search
12
作者 Qian Guo Wei Chen Huaiyu Wan 《Data Intelligence》 EI 2021年第4期548-567,共20页
Personalized search is a promising way to improve the quality of Websearch,and it has attracted much attention from both academic and industrial communities.Much of the current related research is based on commercial ... Personalized search is a promising way to improve the quality of Websearch,and it has attracted much attention from both academic and industrial communities.Much of the current related research is based on commercial search engine data,which can not be released publicly for such reasons as privacy protection and information security.This leads to a serious lack of accessible public data sets in this field.The few publicly available data sets have not become widely used in academia because of the complexity of the processing process required to study personalized search methods.The lack of data sets together with the difficulties of data processing has brought obstacles to fair comparison and evaluation of personalized search models.In this paper,we constructed a large-scale data set AOL4 PS to evaluate personalized search methods,collected and processed from AOL query logs.We present the complete and detailed data processing and construction process.Specifically,to address the challenges of processing time and storage space demands brought by massive data volumes,we optimized the process of data set construction and proposed an improved BM25 algorithm.Experiments are performed on AOL4 PS with some classic and state-of-the-art personalized search methods,and the experiment results demonstrate that AOL4 PS can measure the effect of personalized search models. 展开更多
关键词 Personalized search Text data processing data set construction
原文传递
A dataset of scientific literature on floods,1990-2017
13
作者 Zhang Hongyue Li Guoqing +2 位作者 Huang Mingrui Qing Xiuling Zhang Huarong 《中国科学数据(中英文网络版)》 CSCD 2018年第3期76-85,共10页
With an increasing number of scientific achievements published,it is particularly important to conduct literature-based knowledge discovery and data mining.Flood,as one of the most destructive natural disasters,has be... With an increasing number of scientific achievements published,it is particularly important to conduct literature-based knowledge discovery and data mining.Flood,as one of the most destructive natural disasters,has been the subject of numerous scientific publications.On January 1,2018,we conducted literature data collection and processing on flood research and categorized the retrieved paper records into Whole SCI Dataset(WS)and High-Citation SCI Dataset(HCS).These data sets can serve as basic data for bibliometric analysis to identify the status of global flood research during 1990-2017.Our study shows that while the Chinese Academy of Sciences was the most productive institution during this period,the United States was the most productive country.Besides,our keyword analysis reveals the potential popular issues and future trends of flood research. 展开更多
关键词 literature data sets FLOOD WS HCS
下载PDF
An RDF Data Set Quality Assessment Mechanism for Decentralized Systems
14
作者 Li Huang Zhenzhen Liu +1 位作者 Fangfang Xu Jinguang Gu 《Data Intelligence》 2020年第4期529-553,共25页
With the rapid growth of the linked data on the Web,the quality assessment of the RDF data set becomes particularly important,especially for the quality and accessibility of the linked data.In most cases,RDF data sets... With the rapid growth of the linked data on the Web,the quality assessment of the RDF data set becomes particularly important,especially for the quality and accessibility of the linked data.In most cases,RDF data sets are shared online,leading to a high maintenance cost for the quality assessment.This also potentially pollutes Internet data.Recently blockchain technology has shown the potential in many applications.Using the blockchain storage quality assessment results can reduce the centralization of the authority,and the quality assessment results have characteristics such as non-tampering.To this end,we propose an RDF data quality assessment model in a decentralized environment,pointing out a new dimension of RDF data quality.We use the blockchain to record the data quality assessment results and design a detailed update strategy for the quality assessment results.We have implemented a system DCQA to test and verify the feasibility of the quality assessment model.The proposed method can provide users with better cost-effective results when knowledge is independently protected. 展开更多
关键词 DECENTRALIZATION Quality assessment Blockchain RDF data set
原文传递
An Evaluation of the Reliability of Complex Systems Using Shadowed Sets and Fuzzy Lifetime Data 被引量:3
15
作者 Olgierd Hryniewicz 《International Journal of Automation and computing》 EI 2006年第2期145-150,共6页
In this paper, we consider the problem of the evaluation of system reliability using statistical data obtained from reliability tests of its elements, in which the lifetimes of elements are described using an exponent... In this paper, we consider the problem of the evaluation of system reliability using statistical data obtained from reliability tests of its elements, in which the lifetimes of elements are described using an exponential distribution. We assume that this lifetime data may be reported imprecisely and that this lack of precision may be described using fuzzy sets. As the direct application of the fuzzy sets methodology leads in this case to very complicated and time consuming calculations, we propose simple approximations of fuzzy numbers using shadowed sets introduced by Pedrycz (1998). The proposed methodology may be simply extended to the case of general lifetime probability distributions. 展开更多
关键词 Estimation of reliability fuzzy reliability data shadowed sets.
下载PDF
Fluctuation Analysis of Decoy State QKD with Finite Data-Set Size 被引量:1
16
作者 唐少杰 焦荣珍 《Communications in Theoretical Physics》 SCIE CAS CSCD 2010年第9期443-446,共4页
Decoy state method quantum key distribution (QKD) is one of the promising practical solutions for BB84QKD with coherent light pulses.The number of data-set size in practical QKD protocol is always finite,which will ca... Decoy state method quantum key distribution (QKD) is one of the promising practical solutions for BB84QKD with coherent light pulses.The number of data-set size in practical QKD protocol is always finite,which will causestatistical fluctuations.In this paper,we apply absolutely statistical fluctuation to amend the yield and error rate of thequantum state.The relationship between exchanged number of quantum signals and key generation rate is analyzed inour simulation,which offers a useful reference for experiment. 展开更多
关键词 量子密钥分发 波动分析 数据集 诱骗 量子密钥分配协议 BB84协议 量子密码 模拟分析
下载PDF
一个基于现实世界的大型Web参照数据集——UK2006 Datasets的初步研究
17
作者 曾刚 李宏 《企业技术开发》 2009年第5期16-17,31,共3页
文章介绍了WEBSPAM-UK2006数据集,一个大型的基于现实世界的,人工评判过一些垃圾行为的web数据集合,详细的对数据集的构成进行了分析,对数据集采用Python进行了初步的预处理,为以后在反垃圾网页行为方面的算法和判定研究提供了非常有意... 文章介绍了WEBSPAM-UK2006数据集,一个大型的基于现实世界的,人工评判过一些垃圾行为的web数据集合,详细的对数据集的构成进行了分析,对数据集采用Python进行了初步的预处理,为以后在反垃圾网页行为方面的算法和判定研究提供了非常有意的经验和参考。 展开更多
关键词 搜索引擎作弊 Web数据集 链接分析 Web图
下载PDF
Oil-gas reservoir in the Mesozoic strata in the Chaoshan depression,northern South China Sea:a new insight from long off set seismic data 被引量:1
18
作者 Tao XING Guangjian ZHONG +2 位作者 Wenhuan ZHAN Zhongquan ZHAO Xi CHEN 《Journal of Oceanology and Limnology》 SCIE CAS CSCD 2022年第4期1377-1387,共11页
The Chaoshan depression,a Mesozoic basin in the Dongsha sea area,northern South China Sea,is characterized by well-preserved Mesozoic strata,being good conditions for oil-gas preservation,promising good prospects for ... The Chaoshan depression,a Mesozoic basin in the Dongsha sea area,northern South China Sea,is characterized by well-preserved Mesozoic strata,being good conditions for oil-gas preservation,promising good prospects for oil-gas exploration.However,breakthrough in oil-gas exploration in the Mesozoic strata has not been achieved due to less seismic surveys.New long-off set seismic data were processed that acquired with dense grid with single source and single cable.In addition,the data were processed with 3D imaging method and fi ner processing was performed to highlight the target strata.Combining the new imaging result and other geological information,we conducted integrated interpretation and proposed an exploratory well A-1-1 for potential hydrocarbon.The result provides a reliable basis for achieving breakthroughs in oil and gas exploration in the Mesozoic strata in the northern South China Sea. 展开更多
关键词 Chaoshan depression Mesozoic strata oil and gas exploration long off set seismic data integrated interpretation exploratory well
下载PDF
Frequent item sets mining from high-dimensional dataset based on a novel binary particle swarm optimization 被引量:2
19
作者 张中杰 黄健 卫莹 《Journal of Central South University》 SCIE EI CAS CSCD 2016年第7期1700-1708,共9页
A novel binary particle swarm optimization for frequent item sets mining from high-dimensional dataset(BPSO-HD) was proposed, where two improvements were joined. Firstly, the dimensionality reduction of initial partic... A novel binary particle swarm optimization for frequent item sets mining from high-dimensional dataset(BPSO-HD) was proposed, where two improvements were joined. Firstly, the dimensionality reduction of initial particles was designed to ensure the reasonable initial fitness, and then, the dynamically dimensionality cutting of dataset was built to decrease the search space. Based on four high-dimensional datasets, BPSO-HD was compared with Apriori to test its reliability, and was compared with the ordinary BPSO and quantum swarm evolutionary(QSE) to prove its advantages. The experiments show that the results given by BPSO-HD is reliable and better than the results generated by BPSO and QSE. 展开更多
关键词 粒子群算法 频繁项集 数据集 二进制 挖掘 高维 APRIORI 初始粒子
下载PDF
Traffic Flow Data Forecasting Based on Interval Type-2 Fuzzy Sets Theory 被引量:4
20
作者 Runmei Li Chaoyang Jiang +1 位作者 Fenghua Zhu Xiaolong Chen 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI 2016年第2期141-148,共8页
This paper proposes a long-term forecasting scheme and implementation method based on the interval type-2 fuzzy sets theory for traffic flow data. The type-2 fuzzy sets have advantages in modeling uncertainties becaus... This paper proposes a long-term forecasting scheme and implementation method based on the interval type-2 fuzzy sets theory for traffic flow data. The type-2 fuzzy sets have advantages in modeling uncertainties because their membership functions are fuzzy. The scheme includes traffic flow data preprocessing module, type-2 fuzzification operation module and long-term traffic flow data forecasting output module, in which the Interval Approach acts as the core algorithm. The central limit theorem is adopted to convert point data of mass traffic flow in some time range into interval data of the same time range(also called confidence interval data) which is being used as the input of interval approach. The confidence interval data retain the uncertainty and randomness of traffic flow, meanwhile reduce the influence of noise from the detection data. The proposed scheme gets not only the traffic flow forecasting result but also can show the possible range of traffic flow variation with high precision using upper and lower limit forecasting result. The effectiveness of the proposed scheme is verified using the actual sample application. 展开更多
关键词 Interval type-2 fuzzy sets central limit theorem confidence interval long-term prediction
下载PDF
上一页 1 2 218 下一页 到第
使用帮助 返回顶部