Reordering GPU Kernel Launches to Enable Efficient Concurrent Execution