Markov decision process

id: markov-decision-process-230-115537
title: Markov decision process
text: Markov decision process (MDP), also called a stochastic dynamic program or stochastic control problem, is a model for sequential decision making when outcomes are uncertain. Originating from operations research in the 1950s, MDPs have since gained recognition in a variety of fields, including ecology, economics, healthcare, telecommunications and reinforcement learning. Reinforcement learning utilizes the MDP framework to model the interaction between a learning agent and its environment. In thi
brand slug: wiki
category slug: encyclopedia
description: Mathematical model
original url: https://en.wikipedia.org/wiki/Markov_decision_process
date created: 2004-11-02T02:35:34Z
date modified: 2024-09-15T17:19:30Z
main entity: {"identifier":"Q176789","url":"https://www.wikidata.org/entity/Q176789"}
image:
fields total: 13
integrity: 15

Related Entries

Explore Next Part