摘要
针对单细胞转录组数据上细胞分类准确率较低的问题,提出一种新的细胞集成分类算法.该方法能充分利用不同分类模型的优点,降低单细胞数据的分类误差.分别在慢性粒细胞白血病单细胞测序数据和三阴性乳腺癌单细胞测序数据两个不同数据集上进行实验验证,实验结果表明,由集成算法划分的细胞分类更清晰准确,验证了该算法的有效性.
Aiming at the problem of low accuracy of the cell classification of single cell transcriptome data,we proposed a novel cell ensemble classification algorithm.The algorithm could make full use of advantages of different classification models and reduce the classification error of single cell data.The experimental results on a chronic myeloid leukemia data and a triple-negative breast cancer data show that the cell classification based on the ensemble algorithm is more clear and accurate,which verifies the effectiveness of the proposed algorithm.
作者
刘桂锋
于绍楠
崔璐
LIU Guifeng;YU Shaonan;CUI Lu(Department of Radiology,China-Japan Union Hospital of Jilin University,Changchun 130033,China;Department of Medical Insurance,China-Japan Union Hospital of Jilin University,Changchun 130033,China)
出处
《吉林大学学报(理学版)》
CAS
北大核心
2021年第5期1252-1255,共4页
Journal of Jilin University:Science Edition
基金
国家自然科学基金(批准号:21475126,80151459)
吉林省科技发展计划项目(批准号:20190701052GH)
吴阶平医学基金会项目(批准号:320675019089-40,320675019089-38).
关键词
单细胞转录组
集成分类模型
K-近邻算法
支持向量机
single cell transcriptome
ensemble classification model
k-nearest neighbor algorithm
support vector machine