Abstract
We propose a compressed kernelized least squares temporal difference learning (CKLSTD) algorithm for reinforcement learning in large state space by incorporate kernel trick and random projection into traditional LSTD. Our proposed method can be viewed as kernelized version of LSTD-RP and compressed version of KLSTD, wherein the proposed method preserves the merits of both of them. The experimental results suggest that our proposed method can gain better performance in high dimensional feature space.
Original language | English |
---|---|
Pages (from-to) | 8955-8962 |
Number of pages | 8 |
Journal | Journal of Computational Information Systems |
Volume | 9 |
Issue number | 22 |
DOIs | |
Publication status | Published - 15 Nov 2013 |
Externally published | Yes |
Keywords
- Compressive regression
- Kernel sparsification
- Least squares temporal difference learning
- Random projects
ASJC Scopus subject areas
- Information Systems
- Computer Science Applications