std::map< T, float > probabilities
void update(T arm, float reward, VectorXf &context)
Updates the reward for a specific arm.
T choose(const VectorXf &context)
Chooses an arm based on the given tree and fitness. Should call sample_probs internally.
std::map< T, float > sample_probs(bool update)
Samples the probabilities of the arms.
< nsga2 selection operator for getting the front