Multi-armed bandit

id: multi-armed-bandit-165-7152961
title: Multi-armed bandit
text: In probability theory and machine learning, the multi-armed bandit problem is a problem in which a decision maker iteratively selects one of multiple fixed choices when the properties of each choice are only partially known at the time of allocation, and may become better understood as time passes. A fundamental aspect of bandit problems is that choosing an arm does not affect the properties of the arm or other arms. Instances of the multi-armed bandit problem include the task of iteratively all
brand slug: wiki
category slug: encyclopedia
description: Resource problem in machine learning
original url: https://en.wikipedia.org/wiki/Multi-armed_bandit
date created: 2005-10-07T14:38:28Z
date modified: 2024-08-29T09:12:49Z
main entity: {"identifier":"Q2882343","url":"https://www.wikidata.org/entity/Q2882343"}
image: {"content_url":"https://upload.wikimedia.org/wikipedia/commons/8/82/Las_Vegas_slot_machines.jpg","width":1100,"height":768}
fields total: 13
integrity: 16

Related Entries

Explore Next Part