(Q7795822)

English

Thompson sampling

heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem

Statements

Identifiers

 
edit
edit
    edit
      edit
        edit
          edit
            edit
              edit
                edit