Temporal difference learning
id:
temporal-difference-learning-297-6563436
title:
Temporal difference learning
text:
Temporal difference (TD) learning refers to a class of model-free reinforcement learning methods which learn by bootstrapping from the current estimate of the value function. These methods sample from the environment, like Monte Carlo methods, and perform updates based on current estimates, like dynamic programming methods. While Monte Carlo methods only adjust their estimates once the final outcome is known, TD methods adjust predictions to match later, more accurate, predictions about the futu
brand slug:
wiki
category slug:
encyclopedia
description:
Computer programming concept
original url:
https://en.wikipedia.org/wiki/Temporal_difference_learning
date created:
date modified:
2024-04-27T06:04:22Z
main entity:
{"identifier":"Q7698910","url":"https://www.wikidata.org/entity/Q7698910"}
image:
fields total:
13
integrity:
14