7[1]Khatib O.Real-time obstacle avoidance formanipulators and mobile robot[J].The InternationalJournal of Robotic Research.1986,5(1):90~98.
8[2]M Gemeinder,M Gerke.GA-based Path Planning forRobot System Employing an Active Search Algorithm[J].Applied Soft Computing,2003.3:149~158.
9[5]Sutton R S,Barto A G Reinforcement Learning:AnIntroduction[M].Cambridge,MA:MIT Press,1998.
10[6]Miyazaki K,Yamamura M,Kobayashi S.On therationality of profit sharing in reinforcement learning[A].Proc of the 3rd International Conference on FuzzyLogic Neural Net and Soft computing,1994.285~288.