Dr. Mark Humphrys

School of Computing. Dublin City University.

Home      Blog      Teaching      Research      Contact

Search:

CA249      CA318      CA425      CA651

w2mind.computing.dcu.ie      w2mind.org


Mark Humphrys - Teaching - CA425


Artificial Intelligence



Course Descriptor



How to contact me

See How to contact me.


Notes

My notes contain many hyperlinks to background material. Some students get confused about what is the core course. The core course is anything that is linked to directly on this front page. All other links are just background material.


  1. Background
    1. Introduction to AI
    2. Survey of AI
    3. History of AI

    4. AI Links
    5. Robotics Links


  2. State-space control
    1. Continuum of Autonomy
    2. State-space control
    3. RL as Pattern Classification


  3. Reinforcement Learning - Reference


  4. Reinforcement Learning
    1. PhD extract
      1. Notation
      2. Appendix A of PhD
      3. Appendix B of PhD
      4. Ch.7 - Rewarding on transitions or continuously

    2. Accompanying Notes
      1. Exercise
      2. How Q-learning works

    3. Program code (for practical)
      1. Coding the state-space as a lookup-table
      2. Sample code for lookup-table Q-learning


  5. Reinforcement Learning - More
    1. PhD extracts
      1. Appendix C - 2-reward reward functions
      2. Appendix D - 3-reward (or more) reward functions
      3. Ch.18 - Feudal Q-learning

    2. Accompanying notes
      1. Building a world model
      2. Convergence
      3. The control policy
      4. Boltzmann "soft max" distribution
      5. How to make a decision probabilistically


  6. Movie demo
    1. Movie demo of W-learning contains within it a demo of basic Q-learning.


  7. Reinforcement Learning with Neural Networks (Pre-requisite needed.) - NOT ON COURSE THIS YEAR

    1. Neural Networks (Revision)
    2. Using a Neural Network as a generalisation in RL
    3. Q-learning with a Neural Network
    4. Ch.4 - Using a Neural Network with RL


  8. Multiple Minds - NOT ON COURSE THIS YEAR

    1. Ch.3 - Multi-Module Reinforcement Learning
    2. Ch.4 - Multiple Minds in the same body - Test of Hierarchical Q-learning
    3. Ch.18 - The general form of a Society of Mind based on Reinforcement Learning
    4. Open Issues in AI
    5. Architectures of Autonomous Agents
    6. The World-Wide-Mind (my research project)



Notes on Assignment Notation

I often use   :=   for assignment to distinguish from   =   for equality.

Notes on Assignment Notation



Labs

If the practical is based on the WWM server, I will hold one or two hands-on labs. Dates will be announced.


Practical

Practical - Play "X's and O's" with RL

Deadline Wed 12 Dec 2012.




Reading

Experiments in Adaptive State-Space Robotics, Clocksin and Moore, 1989. - Online. - A simple introduction to the very idea of state-space robotic or agent control.

How to Make Software Agents Do the Right Thing: An Introduction to Reinforcement Learning, Singh et al, 1996. - Online. - A simple introduction to the idea of RL.

Action Selection methods using Reinforcement Learning, Humphrys, 1997 (my PhD thesis). - Online. - Chapter 2 is the more formal introduction to RL above.

Kaelbling et al (1996), "Reinforcement Learning: A Survey", Journal of Artificial Intelligence Research 4:237-285. - Online.

Reinforcement Learning: An Introduction, Sutton and Barto, 1998. - Bookshop, and Online (also here and here).



Library categories



Feeds      HumphrysFamilyTree.com

Bookmark and Share           On Internet since 1987.