具有混合奖惩信号的脉冲时间依赖可塑性算法

Spiking timing dependent plasticity algorithm with mixed reward-modulated signals

下载PDF

导出

摘要近年来,具有生理学基础的脉冲时间依赖可塑性(Spiking Timing-Dependent Plasticity,STDP)规则在脉冲神经网络中得到了越来越多的应用.由STDP规则和奖惩机制相结合的R-STDP(reward-modulated STDP)学习算法在改善脉冲神经网络的性能上有良好的效果.但R-STDP算法在训练多层脉冲神经网络时,仍存在反馈信号仅作用于网络末层、中间层无法获得有用奖惩信号.为此,利用自编码器的无监督特性,提出一种具有混合奖惩信号的MR-STDP(Mix Reward-modulated STDP)算法.在中间层中增加重构层以够建基于卷积自编码器的奖惩信号因子模型,通过比较卷积层和重构层的神经元脉冲发放时间,获取中间层网络权重调整的指导因子信号.指导因子信号是对比层间自编码器的输入层与重构层的相同位置神经元所发放的脉冲序列相似性度量指标,并将其与R-STDP相结合,使得中间层能够获得权重指导信号.在MNIST和COVID-19 CT数据集上的实验结果表明,该方法取得了比R-STDP更高的精度,且中间层网络的学习效率大幅提高. In recent years,Spiking Timing-Dependent Plasticity(STDP)rules with physiological basis have been applied more and more in spiking neural networks.The R-STDP(reward-modulated STDP)learning algorithm combining STDP with the reinforcement learning reward modulation embraces great effect on improving the performance of SNN.However,the feedback only reflects on the last layer of spiking deep convolutional neural networks as the R-STDP algorithm works,which means the middle layer cannot get feedback.Inspired by the unsupervised characteristics of the Auto-Encoder,a mix reward-modulatedSTDP(MR-STDP)algorithm with mixed reward/punishment signal was proposed.In this algorithm,the reconstruction layer was added to the middle layer to establish the rewards/punishment signal factor model.The guiding factor signal is the similarity measure of spiking sequences issued by the neurons at the same position of the input layer of the interlayer autoencoder and the reconstruction layer,and it is combined with R-STDP,so that the middle layer can obtain the weight guiding signal.Experiments on MNIST and COVID-19 CT data sets shows that the proposed method achieves higher accuracy than R-STDP,and the efficiency of learning in middle layer is greatly improved.

作者陈运享冯忍陈云华 CHEN Yunxiang;FENG Ren;CHEN Yunhua(School of Computing,Guangdong University of Technology,Guangdong 510006,China)

机构地区广东工业大学计算机学院

出处《微电子学与计算机》 2022年第9期20-25,共6页 Microelectronics & Computer

基金广东省自然科学基金(2021A1515012233)。

关键词脉冲神经网络脉冲时间依赖可塑性图像识别卷积自编码器 Spiking neural network STDP Image recognition Convolutional autoencoder

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1肖云开,邹承明.基于FPGA的脉冲神经网络模型设计与实现[J].微电子学与计算机,2022,39(9):73-79.
2夏鸿斌,肖奕飞,刘渊.融合自注意力机制的长文本生成对抗网络模型[J].计算机科学与探索,2022,16(7):1603-1610. 被引量：1
3王燕,范林,赵妮妮.利用门控网络构建用户动态兴趣的序列推荐模型[J].计算机工程,2022,48(8):283-291. 被引量：6
4王蔚.基于有序加权和熵权的信息安全风险评估[J].项目管理技术,2022,20(5):55-59. 被引量：8
5李丁园,李晓杰.基于多尺度残差卷积自编码器的图像聚类方法[J].吉林大学学报（信息科学版）,2022,40(4):684-687. 被引量：2
6徐旸,王佳斌,彭凯.结合PCA的t-SNE算法的并行化实现方法[J].华侨大学学报（自然科学版）,2022,43(5):685-692.
7Adriana C.Ribeiro,Margarida Catalão-Lopes,Ana S.Costa.Corporate Social Responsibility and Consumers’ Reaction: An Experiment[J].Journal of Sustainable Business and Economics,2022,5(3):1-11.
8Xuan Wang,Minghong Zhong,Hoiyuen Cheng,Junjie Xie,Yingchu Zhou,Jun Ren,Mengyuan Liu.SpikeGoogle:Spiking Neural Networks with GoogLeNet-like inception module[J].CAAI Transactions on Intelligence Technology,2022,7(3):492-502.
9孔锐,庄峻贤,梁冠烨.基于交替训练融合模型的COVID-19的CT影像辅助诊断[J].暨南大学学报（自然科学与医学版）,2022,43(4):432-440.
10M.S.Joshaghani,K.B.Nakshatrala.A Modeling Framework for Coupling Plasticity with Species Diffusion[J].Communications in Computational Physics,2022,32(6):83-125.

微电子学与计算机

2022年第9期

浏览历史

内容加载中请稍等...

具有混合奖惩信号的脉冲时间依赖可塑性算法

相关作者

相关机构

相关主题

浏览历史