期刊文献+

基于生成模型的Q-learning二分类算法 被引量:1

Q-learning algorithm based on generative model for binary classification
下载PDF
导出
摘要 对于二分类问题,基于判别模型的分类器一般都是寻找一条最优判决边界,容易受到数据波动的影响。针对该问题提出一种基于生成模型的Q-learning二分类算法(BGQ-learning),将状态和动作分开编码,得到对应各类的判决函数,增加了决策空间的灵活性,同时在求解参数时,采用最小二乘时序差分(TD)算法和半梯度下降法的组合优化方法,加速了参数的收敛速度。设计实验对比了BGQ-learning算法与三种经典分类器以及一种新颖的分类器的分类性能,在UCI数据库七个数据集上的测试结果表明,该算法有着优良的稳定性以及良好的分类精确度。 For binary classification problems,the classifier based on the discriminant model usually searches for an optimal decision boundary,which is susceptible to data fluctuations.This paper proposed a Q-learning algorithm based on the generative model for binary classification(BGQ-learning),which coded state and action separately and obtained corresponding decision functions,increasing the flexibility of decision space.And then it combined least squares temporal-difference(TD)algorithm and semi-gradient descent for parameter optimization,accelerating parameter convergence speed.This paper designed experiments to compare the performance of the proposed algorithm with three classical classifiers and a novel classifier.The test results on 7 data sets of the UCI database show that the proposed algorithm has excellent stability and classification accuracy.
作者 尚志刚 徐若灏 乔康加 杨莉芳 李蒙蒙 Shang Zhigang;Xu Ruohao;Qiao Kangjia;Yang Lifang;Li Mengmeng(School of Electrical Engineering,Zhengzhou University,Zhengzhou 450001,China;Henan Key Laboratory of Brain Science&Brain-Computer Interface Technology,Zhengzhou 450001,China)
出处 《计算机应用研究》 CSCD 北大核心 2020年第11期3326-3329,3333,共5页 Application Research of Computers
基金 国家自然科学基金资助项目(U1304602)。
关键词 Q-LEARNING 生成模型 二分类 最小二乘时序差分算法 半梯度下降法 Q-learning generative model binary classification least squares temporal-difference algorithm semi-gradient descent
  • 相关文献

参考文献1

二级参考文献2

共引文献40

同被引文献12

引证文献1

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部