On Optimistic versus Randomized Exploration in Reinforcement Learning