No Access

LEARNING, EXPLORATION AND CHAOTIC POLICIES

ALEXEI B. POTAPOV

Department of Physics, The University of Lethbridge, 4401 University Dr. W Lethbridge, Alberta T1K 3M4, Canada

Search for more papers by this author

and

M. K. ALI

Department of Physics, The University of Lethbridge, 4401 University Dr. W Lethbridge, Alberta T1K 3M4, Canada

Search for more papers by this author

https://doi.org/10.1142/S0129183100001309Cited by:4 (Source: Crossref)

Abstract

We consider different versions of exploration in reinforcement learning. For the test problem, we use navigation in a shortcut maze. It is shown that chaotic ∊-greedy policy may be as efficient as a random one. The best results were obtained with a model chaotic neuron. Therefore, exploration strategy can be implemented in a deterministic learning system such as a neural network.

Keywords: