When to use Monte Carlo over TD learning, and vice-versa

503 Views Asked by Ilyes Yamoun At 28 April 2019 at 16:27

When studying Reinforcement learning, and exactly when it comes to Model-Free RL, there are two methods we use generally:

TD learning
Monte Carlo

When is each one of them used over the other? In other words, how do we figure out what method is best for our problem?

Original Q&A

There are 1 best solutions below

Kris On 02 May 2019 at 02:00

Sections 6.1 and 6.2 of Sutton & Barto give a very nice intuitive understanding of the difference between Monte Carlo and TD learning.

Having said that, there's of course the obvious incompatibility of MC methods with non-episodic tasks. In that case, you will always need some kind of bootstrapping.

When to use Monte Carlo over TD learning, and vice-versa

There are 1 best solutions below

Related Questions in MACHINE-LEARNING

Related Questions in REINFORCEMENT-LEARNING

Related Questions in MONTECARLO

Related Questions in TEMPORAL-DIFFERENCE

Trending Questions

Popular # Hahtags

Popular Questions