CCVS: Context-aware Controllable Video Synthesis