Concept information
TÉRMINO PREFERIDO
multi-armed bandit
DEFINICIÓN
- problem consisting of finding, from among a number of actions whose rewards vary according to initially unknown probability distributions, the one(s) which yield(s) the best reward. (This is the problem of choosing the most promising slot machine – one-armed bandit – in a casino). The main problem is to find a good compromise between exploration (determining the laws of probability) and exploitation (guaranteeing a good cumulative reward). (Source: adapted and translated from https://ia.gdria.fr/Glossaire/bandit-multi-bras/ )
CONCEPTO GENÉRICO
EN OTRAS LENGUAS
-
francés
URI
http://data.loterre.fr/ark:/67375/23L-KK7DCNX6-Z
{{toUpperCase label}}
{{#each values }} {{! loop through ConceptPropertyValue objects }}
{{#if prefLabel }}
{{/if}}
{{/each}}
{{#if notation }}{{ notation }} {{/if}}{{ prefLabel }}
{{#ifDifferentLabelLang lang }} ({{ lang }}){{/ifDifferentLabelLang}}
{{#if vocabName }}
{{ vocabName }}
{{/if}}