REINFORCEMENT LEARNING WITH GOAL-DIRECTED ELIGIBILITY TRACES
Abstract
The eligibility trace is the most important mechanism used so far in reinforcement learning to handle delayed reward. Here, we introduce a new kind of eligibility trace, the goal-directed trace, and show that it results in more reliable learning than the conventional trace. In addition, we also propose a new efficient algorithm for solving the goal-directed reinforcement learning problem.
You currently do not have access to the full text article. |
---|