A-PSRO: A Unified Strategy Learning Method with Advantage Function for Normal-form Games