2Schmidhuber Juergen.On learning how to learn learning strategies[J].1995. 被引量:1
3Perlich C,Dalessandro B,Raeder T,et al. Machine learning for targeted display advertising:Transfer learning in action[J]. Machine Learning,2013:1-25. 被引量:1
4Torrey Lisa,Shaviik Jude,Walker Trevor,et al.Relational skill transfer via advice taking[C].Proceedings of ICML Workshop on Structural Knowledge Transfer for Machine Learning,2006. 被引量:1
二级参考文献13
1Sutton R S. Learning to predict by the methods of temporal difference[J]. Machine Learning. 1988, (3) : 9--44. 被引量:1
3Sutton R S. Temporal credit assignment in reinforcement learning[D]. University of Massachusetts,Amherst,MA, 1984. 被引量:1
4Masayuki Yamamura, Takashi Onozuka. Reinforcement learning with knowledge by using a stochastic gradient method on a bayesian network[A], Proceedings of the 1998 IEEE International Conference on Neural Networks[C]. May 4-9 1998. Anchorage, Alaska, USA : 2045-- 2050. 被引量:1
5Carlos H C Ribeiro. Embedding a priori knowledge in reinforcement learning[J]. Journal of Intelligent and Robotic Systems. 1998,21:51--71. 被引量:1
6Chi-Hyon Oh, Tomoharu Nakashima, Hisao Ishibuchi. Initialization of Q-values by fuzzy rules for accelerating Q-learning[A].Proceedings of the 1998 IEEE International Conference on Neural Networks[C]. May 4-9 1998. Anchorage, Alaska, USA: 2051- 2056. 被引量:1
7Dean F Hougen, Maria Gini, James Slagle. Partitioning input space for reinforcement learning for control. Proceedings of the 1997 IEEE International Congress on Neural Networks.June 9--12, 1997. Houston, TX, USA: 755--760. 被引量:1
8Yoshikazu Arai, Teruo Fujii, Hajime Asama, Yasushi Kataoka.Multilayered reinforcement learning for complicated collision avoidance problems[A]. Proceedings of the 1998 IEEE International Conference on Robotics & Automation[C]. May 16--20,1998. Leuven. Belgium: 2186--2191. 被引量:1
9John W Sheppard. Colearning in Differential Games. Machine Learning. 1998, (33) : 201--233. 被引量:1
10Michael L Littman. Markov games as a framework for muhiagent reinforcement learning[A], Proceedings of the 11th International Conference on Machine Learning. 1994, 157--163. 被引量:1