The explanation of TD learning on scholarpedia was adequate for me to understand it, but I still can't find a good explanation of the generalized algorithm with eligibility traces.
[link][5 comments]
The explanation of TD learning on scholarpedia was adequate for me to understand it, but I still can't find a good explanation of the generalized algorithm with eligibility traces.