Temporal logic control of general Markov decision processes by approximate policy refinement