Arkin, R. C. (1998). Behavior-based robotics. Cambridge, MA: MIT Press.
Bakker, P., & Kinuyoshi, Y. (1996). Robot see, robot do: An overview of robot imitation. Proceedings of the AISB96 Workshop on Learning in Robots and Animals, 3-11.
Barto, A.G. (1992) Reinforcement learning and adaptive critic methods. In D.A.White & D.A. Sofge (Eds.), Handbook of intelligent control, 469-491. New York: Van Nostrand Reinhold.
Bellman, R. (1954). The theory of dynamic programming. Bulletin of the American Mathematical Society, 60, 503-516.
Brooks, R. A. (1986). A robust layered control system for a mobile robot. IEEE Journal of Robotics and Automation, 2(1), 14-23.
Brooks, R. A. (1990). Elephants don't play chess. Robotics and Autonomous Systems, 6, 3-15.
Catania, A. C., & Brigham, T. A. (Eds.). (1978). Handbook of applied behavior analysis: Social and instructional processes. New York: Irvington.
Dawes, R.M. (1979). The robust beauty of improper linear models in decision making. American Psychologist, 34, 571-582.
Donahoe, J. W., Burgos, J. E., & Palmer, D. C. (1993). A selectionist approach to reinforcement. Journal of the Experimental Analysis of Behavior, 60, 17-40.
Donahoe, J.W. & Palmer, D.C. (1994) Learning and complex behavior. Boston: Allyn & Bacon.
Donahoe, J.W., Palmer, D.C. & Burgos, J.E. (1997) The S-R Issue: Its status in behavior analysis and in Donahoe and Palmer's Learning and Complex Behavior. Journal of the Experimental Analysis of Behavior, 67, 193-211.
Dorigo, M. & Colombetti, M. (1998). Robot shaping: An experiment in behavior engineering. Cambridge, MA: MIT Press.
Edelman, G. M. (1987). Neural darwinism: The theory of neuronal group selection. New York: Basic Books.
Garcia, J., Erwin, F.R., & Koelling, R.A. (1966) Learning with prolonged delay in reinforcement. Psychonomic Science, 5, 121-122.
Holland, J.H. (1985). Properties of the bucket brigade algorithm. In J. J. Grefenstette, (Ed.), Proceedings of the 1st international conference on genetic algorithms and their applications (pp. 1-7). L.E. Associates.
Hutchison, W.R. (1997a) We also need complete behavioral models. Journal of the experimental analysis
of behavior, 67, 224-228.
Hutchison, W.R. (1997b) Learned Emergence of Functional Symbol Systems in Adaptive Autonomous Agents. Proceedings of the Intelligent Systems and Semiotics Conference, September 23-25, 1997. Gaithersburg, MD: NIST.
Hutchison, W. R. (1998). Computer simulations of verbal behavior for research and persuasion. The analysis of verbal behavior, 15, 117-120.
Hutchison, W.R. (2000) Adaptive Autonomous Agent with Verbal Learning (divisional). U.S. Patent #6,038,556. Hutchison, W. R. (2002). Adaptive Agent. U.S. Patent #6,366,896.
Hutchison, W. R. & Constantine, B. J. (2003). Autonomous adaptive agent with grounded functional language. Proceedings of the Seventh International Conference on Cognitive and Neural Systems, May 29-31, 2003, 43.
Kandel, E. R., Schwartz, J. H., & Jessell, T. M. (2000). Principles of neural science. New York: McGraw-Hill.
Klopf, A.H. (1982) The hedonistic neuron. Washington: Hemisphere.
Kritchmar, J. L. & Edelman, G. M. (2002). Machine psychology: Autonomous behavior, perceptual categorization and conditioning in a brain-based device. Cerebral Cortex, 12, 818-830.
Lin, L. (1992). Self-improving reactive agents based on reinforcement learning, planning, and teaching. Machine Learning, 8, 293-321.
Lowenkron, B. (2005) Meaning: A Verbal Behavior Account. The Analysis of Verbal Behavior, 22, nn-nn.
Malott, R. W., Whaley, D. L. & Malott, M. E. (1991). Elementary principles of behavior. Englewood Cliffs, NJ: Prentice Hall.
McBride, B., Longoria, R., & Krotkov, E. (2003). Measurement and prediction of the off-road mobility of small robotic ground vehicles. Performance Metrics for Intelligent Systems Workshop, September 16-18, 2003.
Miller, Sutton, and Werbos, Eds., Neural networks for control. Cambridge, MA: MIT Press (1990)
Minsky, M. L. & Papert, S. A. (1969). Perceptrons. Cambridge, MA: MIT Press.
Ng, A. Y., Coates, A., Diel, M., Ganapathi, V., Schulte, J., Tse, B., Berger, E., & Liang, E. Inverted autonomous helicopter flight via reinforcement learning. In International Symposium on Experimental Robotics, 2004.
Oah, S., & Dickinson, A.M. (1989), A review of empirical studies of verbal behavior. The analysis of verbal behavior, 7, 53-68.
Palmer, D. C. & Donahoe, J. W. (1992). Essentialism and selection in cognitive science and behavior analysis. American Psychologist, 47, 1344-1358.
Peterson, G. B. (2000). The discovery of shaping: B. F. Skinner's big surprise. The Clicker Journal: The Magazine for Animal Trainers, 43, 6-13. Reprinted at www.behavior.org/animals/animals_discovery_shaping.cfm.
Peterson, G. B. (2001). The world's first look at shaping: B. F. Skinner's gutsy gamble. The Clicker Journal: The Magazine for Animal Trainers, 49 & 50, 14-21. Reprinted at www.behavior.org/animals/animals_worlds_first.cfm.
Pryor, K. (1999). Don't shoot the dog: The new art of teaching and training. Revised edition. New York: Bantam.
Rescorla, R. A. & Wagner, A. R. (1972). A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In A. H. Black & W. R. Prokasy (Eds.), Classical conditioning II: Current research and theory (pp. 64-99). New York: Appleton-Century-Crofts.
Roy, D., & Pentland, A. (2002). Learning words from sights and sounds. Cognitive Science, 26(1), 113-146.
Skinner, B.F. (1957) Verbal behavior. New York: Appleton-Century-Crofts. (available from B.F. Skinner Foundation)
Skinner, B. F. (1958). Reinforcement today. American Psychologist, 13, 94-99.
Skinner, B.F. (1966) The ontogeny and phylogeny of behavior. Science, 153, 1203-1213.
Skinner, B.F. (1969) Contingencies of reinforcement. New York: Appleton-Century-Crofts.
Smart, W. D. & Kaelbling, L. P. (2002). Effective reinforcement learning for mobile robots. International Conference on Robotics and Automation, May 11-15, 2002.
Stein, L., & Belluzzi, J.D. (1989) Cellular investigations of behavioral reinforcement. Neuroscience and Biobehavioral Reviews, 13, 69-80.
Sulzer-Azaroff, B. & Mayer, G. R. (1991). Behavior analysis for lasting change. Fort Worth, TX: Holt, Rinehart & Winston.
Sutton, R.S., & Barto, A.G. (1981) Toward a modern theory of adaptive networks: Expectation and prediction. Psychological Review, 88, 135-171.
Sutton, R.S., and Barto, A.G. (1998) Reinforcement Learning: An introduction. Cambridge, MA: MIT Press.
Terrace, H. S. (1963). Errorless transfer of a discrimination across two continua. Journal of the Experimental Analysis of Behavior, 6, 223-232.
Tesauro, G. (1995) Temporal difference learning and TD-Gammon. Communications of the ACM, 38, 3.
Touretzky, D. S., Daw, N. D., & Tira-Thompson, E. J. (2002). Combining configural and TD learning on a robot. Proceedings of the Second International Conference on Development and Learning, Cambridge MA, June 12-15, 2002.
Touretzky, D. S. & Saksida, L. M. (1997). Operant conditioning in Skinnerbots, Adaptive Behavior, 5 (3/4), 219-247.
Ungless, M. A., Magill, P. J., & Bolam, J. P. (2004). Uniform inhibition of dopamine neurons in the ventral tegmental area by aversive stimuli. Science, 303, 2040-2042.
Vihman, M.M. (1996) Phonological development: The origins of language in the child. Cambridge, MA: Blackwell.
Waibel, A., Hanazawa, T., Hinton, G., Shikano, K., and Lang, K. (1989) Phoneme recognition using time-delay neural networks. IEEE Transactions on Acoustics, Speech and Signal Processing, 37(3), 328-339.
Wang, D. L. and Arbib, M. (1990). Complex temporal sequence learning based on short-term memory. Proceedings of the IEEE, 78, 1536-1542.
Weng, J. & Chen, S. (1996). Incremental learning for vision-based navigation. Proceedings of the international conference on pattern recognition, Vienna, Austria. Vol. IV, 45-49.
Werbos, P. J. (1974). Beyond regression: New tools for prediction and analysis in the behavioral sciences. PhD thesis, Harvard University.
Werbos, P. J. (1989). Backpropagation and neural control: A review and prospectus. IEEE international conference on neural networks, Vol. 1, 209-216.
Winston, P. (1992) Artificial intelligence (3rd ed.) Reading, MA: Addison-Wesley.
Yun, I. A., Wakabayashi, K. T., Fields, H. L., & Nicola, S. M. (2004). The ventral tegmental area is required for the behavioral and nucleus accumbens neuronal firing responses to incentive cues. Journal of Neuroscience, 24, 2923-2933.