Comparison between Monte Carlo methods and temporal difference learning