| [1] | 
																						 
											 Bruce E R, James M G, Jacques J V.Machine operant conditioning[C]// Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 1988: 1500-1501.
											 											 | 
										
																													
																						| [2] | 
																						 
											 Zalama E, Gaudiano P, Coronado J L.Obstacle avoidance by means of an operant conditioning model[C]// International Workshop on Artificial Neural Networks Malaga-Torremolinos, Spain, 1995, 930: 471-477.
											 											 | 
										
																													
																						| [3] | 
																						 
											 Gaudiano P, Chang C.Adaptive obstacle avoidance with a neural network for operant conditioning: Experiments with real robots[C]// IEEE International Symposium on Computational Intelligence in Robotics and Automation, Monterey, 1997: 13-187.
											 											 | 
										
																													
																						| [4] | 
																						 
											 Björn Brembs, brembs.net:Research on Learning, Memory and Evolution [EB/OL]. http://brembs.net/.2014.
											 											 | 
										
																													
																						| [5] | 
																						 
											 Björn Brembs.Spontaneous decisions and operant conditioning in fruit flies[J]. Behavioural Processes (S0376-6357), 2011, 87(1): 157-164.
											 											 | 
										
																													
																						| [6] | 
																						 
											 Zalama E, Gomez J, Paul M, et al.Adaptive behavior navigation of a mobile robot[J]. IEEE Transactions on Systems, Man, and Cybernetics, Part A - Systems and Humans, 2002, 32(1): 160-169.
											 											 | 
										
																													
																						| [7] | 
																						 
											 Itoh K, Miwa H, Matsumoto M, et al.Behavior model of humanoid robots based on operant conditioning[C]// IEEE/RAS International Conference on Humanoid Robots. Piscataway, NJ, USA: IEEE, 2005: 220-225.
											 											 | 
										
																													
																						| [8] | 
																						 
											 Tadahiro T, Tetsuo S.Incremental acquisition of behaviors and signs based on a reinforcement learning schemata model and a spike timing-dependent plasticity network[J]. Advanced Robotics, 2007, 21: 1177-1199.
											 											 | 
										
																													
																						| [9] | 
																						 
											 蔡建羡, 阮晓钢. 基于遗传算法的Skinner操作条件反射学习模型[J]. 系统工程与电子技术, 2011, 33(6): 1370-1376.Cai J X, Ruan X G. Skinner operant conditioning learning model based on genetic algorithm[J]. System Engineering and Electronics, 2011, 33(6): 1370-1376.
											 											 | 
										
																													
																						| [10] | 
																						 
											 任红格, 史涛, 张瑞成. 基于操作条件反射机制的感觉运动系统认知模型的建立[J]. 机器人, 2012, 34(3): 292-298.Ren H G, Shi T, Zhang R C. Foundation of the sensorimotor system cognitive model with operant conditioning mechanism[J]. Robot, 2012, 34(3): 292-298.
											 											 | 
										
																													
																						| [11] | 
																						 
											 徐冰, 刘肖健. 基于动机模型的自主性虚拟人行为选择研究[J]. 计算机应用与软件, 2012, 29(4): 71-74.Xu B, Liu X J. Research on motivation model based behavior selection of autonomous virtual human[J]. Computer application and software, 2012, 29(4): 71-74.
											 											 | 
										
																													
																						| [12] | 
																						 
											 J John O'Doherty, Peter Dayan, et al. Dissociable Roles of Ventral and Dorsal Striatum in Instrumental Conditioning[J]. Science (S0036-8075). 2004, 304(5669): 452-454.
											 											 | 
										
																													
																						| [13] | 
																						 
											 Ruan X, Chen J, Yu N.Thalamic cooperation between the cerebellum and basal ganglia with a new tropism-based action-dependent heuristic dynamic programming method[J]. Neurocomputing (S0925-2312), 2012, 93: 27-40.
											 											 |