A new distributed feature selection technique for classifying gene expression data

导出

摘要 Classification of gene expression data is a pivotal research area that plays a substantial role in diagnosis and prediction of diseases. Generally, feature selection is one of the extensively used techniques in data mining approaches, especially in classification. Gene expression data are usually composed of dozens of samples characterized by thousands of genes. This increases the dimensionality coupled with the existence of irrelevant and redundant features. Accordingly, the selection of informative genes (features) becomes difficult, which badly affects the gene classification accuracy. In this paper, we consider the feature selection for classifying gene expression microarray datasets. The goal is to detect the most possibly cancer-related genes in a distributed manner, which helps in effectively classifying the samples. Initially, the available huge amount of considered features are subdivided and distributed among several processors. Then, a new filter selection method based on a fuzzy inference system is applied to each subset of the dataset. Finally, all the resulted features are ranked, then a wrapper-based selection method is applied. Experimental results showed that our proposed feature selection technique performs better than other techniques since it produces lower time latency and improves classification performance.

作者 Sarah M.Ayyad Ahmed I.Saleh Labib M.Labib

机构地区 Computers and Systems Department

出处《International Journal of Biomathematics》 SCIE 2019年第4期79-109,共31页 生物数学学报（英文版）

关键词 Feature selection gene expression dimensionality reduction MICROARRAY data CLASSIFICATION DISTRIBUTED learning MATHEMATICS Subject CLASSIFICATION

分类号 G [文化科学]

引文网络
相关文献

1黄积才.略议STEM教育[J].教育,2019,0(31):11-11.
2The Award for Interdisciplinary Excellence in Mathematics Education[J].数学教育学报,2019,28(4):91-91.
3Information for authors[J].Science China Mathematics,2019,62(9):1851-1851.
4王儒州.中学科学教育中如何体现STEM教育[J].下一代,2019,0(8):0123-0123.
5陈杜梨,唐莉.世界第一位程序员——阿达·洛夫莱斯[J].语数外学习（高中版）（中）,2019,0(6):61-65.
6Maurizio Marchi.Nonlinear versus linearised model on stand density model fitting and stand density index calculation: analysis of coefficients estimation via simulation[J].Journal of Forestry Research,2019,30(5):1595-1602. 被引量：4
7蔡晓容.未来的小学数学教学由STEAM教育来起调[J].师资建设,2019,32(10):78-80.
8Zingiswa Jojo.Creating an Environment for the Restoration of Dignity to Disadvantaged Mathematics Foundation Classrooms[J].Environment & Social Psychology,2019,4(1):7-16.
9刘金城(译).美国北爱荷华大学为学生举办3D打印学习营[J].铸造,2019,68(8):960-960.
10金瑶,张锐,尹东.城市道路视频中小像素目标检测[J].光电工程,2019,46(9):74-81. 被引量：15

International Journal of Biomathematics

2019年第4期

浏览历史

内容加载中请稍等...

A new distributed feature selection technique for classifying gene expression data

相关作者

相关机构

相关主题

浏览历史