Multi-armed bandit

id: multi-armed-bandit-165-7152961

title: Multi-armed bandit

text: In probability theory and machine learning, the multi-armed bandit problem is a problem in which a decision maker iteratively selects one of multiple fixed choices when the properties of each choice are only partially known at the time of allocation, and may become better understood as time passes. A fundamental aspect of bandit problems is that choosing an arm does not affect the properties of the arm or other arms. Instances of the multi-armed bandit problem include the task of iteratively all

brand slug: wiki

category slug: encyclopedia

description: Resource problem in machine learning

original url: https://en.wikipedia.org/wiki/Multi-armed_bandit

date created: 2005-10-07T14:38:28Z

date modified: 2024-08-29T09:12:49Z

main entity: {"identifier":"Q2882343","url":"https://www.wikidata.org/entity/Q2882343"}

image: {"content_url":"https://upload.wikimedia.org/wikipedia/commons/8/82/Las_Vegas_slot_machines.jpg","width":1100,"height":768}

fields total: 13

integrity: 16

Related Entries

Explore Next Part