3A1-Ayyoub A E,Masoud F A.Heuristic search revisited.The Journal of Systems and Software,2000; (55)
4Muller M.Partial order bounding:A new approach to evaluation in game tree search. Artificial Intelligence,2001 ;(129)
6Crites R H,Barto A G.Elevator group control using multiple reinforcement learning agents[J].Machine Learning,1998,33(2):235-262.
7Moore.A Variable resolution dynamic programming:Efficiently learning action maps in the real valued spaces[A].In Proceedings of the 8th International Machine Learning[C].Williamstown,Massachusetts,USA:[s.n.],1991.333-337.