Selective Bootstrap Adaptation
  • modified the Least-Mean-Square (LMS) algorithm to produce a reinforcement learning rule that could learn from success and failure signals instead of from training examples
  • a reinforcement learning rule that could learn from success and failure signals instead of from training examples
  • described it as “learning with a critic” instead of “learning with a teacher.”