期刊文献+
共找到4篇文章
< 1 >
每页显示 20 50 100
张之洞、康有为的初识与上海强学会、《强学报》 被引量:3
1
作者 茅海建 《华东师范大学学报(哲学社会科学版)》 CSSCI 北大核心 2013年第1期1-10,151,共10页
光绪二十一年秋,张之洞与康有为最初交往于南京,张因甲午战争而署理两江总督,反对李鸿章等人的议和,康因发动"公车上书"及上书光绪帝而名声初震,此中的牵线人是张之洞的幕僚梁鼎芬。黄绍箕也参与了此后的活动。两人相会后,张... 光绪二十一年秋,张之洞与康有为最初交往于南京,张因甲午战争而署理两江总督,反对李鸿章等人的议和,康因发动"公车上书"及上书光绪帝而名声初震,此中的牵线人是张之洞的幕僚梁鼎芬。黄绍箕也参与了此后的活动。两人相会后,张之洞支持康有为办理上海强学会,并开办《强学报》。康有为的学术主张与政治见解与张之洞有很大的差别,而康在办理《强学报》时,坚守其见,与张之洞一派决裂。此是张、康关系的转折点,也说明了两人因学术分歧导致政治反目的起因。现藏于中国社会科学院近代史研究所图书馆的"张之洞档案",有一批新的史料可以细化此中的过程。 展开更多
关键词 张之洞 康有为 《强学报》 梁鼎芬 黄遵宪 黄绍箕
下载PDF
Incremental Multi Step R Learning
2
作者 胡光华 吴沧浦 《Journal of Beijing Institute of Technology》 EI CAS 1999年第3期245-250,共6页
Aim To investigate the model free multi step average reward reinforcement learning algorithm. Methods By combining the R learning algorithms with the temporal difference learning (TD( λ ) learning) algorithm... Aim To investigate the model free multi step average reward reinforcement learning algorithm. Methods By combining the R learning algorithms with the temporal difference learning (TD( λ ) learning) algorithms for average reward problems, a novel incremental algorithm, called R( λ ) learning, was proposed. Results and Conclusion The proposed algorithm is a natural extension of the Q( λ) learning, the multi step discounted reward reinforcement learning algorithm, to the average reward cases. Simulation results show that the R( λ ) learning with intermediate λ values makes significant performance improvement over the simple R learning. 展开更多
关键词 reinforcement learning average reward R learning Markov decision processes temporal difference learning
下载PDF
The Cooperative Multi-agent Learning with Random Reward Values
3
作者 张化祥 黄上腾 《Journal of Shanghai Jiaotong university(Science)》 EI 2005年第2期147-150,共4页
This paper investigated how to learn the optimal action policies in cooperative multi-agent systems if the agents’ rewards are random variables, and proposed a general two-stage learning algorithm for cooperative mul... This paper investigated how to learn the optimal action policies in cooperative multi-agent systems if the agents’ rewards are random variables, and proposed a general two-stage learning algorithm for cooperative multi-(agent) decision processes. The algorithm first calculates the averaged immediate rewards, and considers these learned rewards as the agents’ immediate action rewards to learn the optimal action policies. It is proved that the learning algorithm can find the optimal policies in stochastic environment. Extending the algorithm to stochastic Markov decision processes was also discussed. 展开更多
关键词 reinforcement learning GAME random reward
下载PDF
SURGICAL TREATMENT OF BLUNT CARDIAC TRAUMA IN CHILDREN: REPORT OF 2 CASES AND REVIEW OF LITERATURES
4
作者 朱宏斌 苏肇杭 +1 位作者 丁文祥 郑景浩 《Journal of Shanghai Second Medical University(Foreign Language Edition)》 2005年第1期48-51,共4页
Objective Summarizing the clinical experience of surgical treatment in 2 cases of blunt cardiactrauma and reviewing the relevant literatures. Methods A 6-year-old girl was diagnosed muscular ventricularseptal defect a... Objective Summarizing the clinical experience of surgical treatment in 2 cases of blunt cardiactrauma and reviewing the relevant literatures. Methods A 6-year-old girl was diagnosed muscular ventricularseptal defect and left ventricular aneurysm 2d after automobile accident and underwent ventricular septal defect re-pair 2 weeks after injury. Another 9-year-old boy was diagnosed severe mitral regurgitation resulted from rupture ofposterior papillary muscle 9d after automobile accident and underwent mitral valvuloplasty 2 weeks after injury.Results Heart function of the first patient was in New York Heart Association (NYHA) class echocardiographyshowed no residual septal defect and the size of left ventricular aneurysm reduced. Heart function of the second pa-tient is in NYHA class echocardiography showed mild mitral regurgitation. Conclusion Blunt traumaticheart disease occurs either because of heart compression between sternum and the spine and/or because of myocardi-al contusion; A more aggressive strategy with surgical treatment earlier before deterioration of heart function is ad-vocated; Earlier surgical correction of anatomic deformity will achieve a good result and a long time follow-up isnecessary. 展开更多
关键词 trauma heart disease surgical treatment
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部