I'm a bit confused...is it reinforcement learning or something to help reinforcement learning methods like monte carlo/temporal difference?
[link][1 comment]
I'm a bit confused...is it reinforcement learning or something to help reinforcement learning methods like monte carlo/temporal difference?