Selective Bootstrap Adaptation
- modified the Least-Mean-Square (LMS) algorithm to produce a reinforcement learning rule that could learn from success and failure signals instead of from training examples
- a reinforcement learning rule that could learn from success and failure signals instead of from training examples
- described it as “learning with a critic” instead of “learning with a teacher.”