Upper Confidence Bound (UCB) - The Multi-Armed Bandit Problem