TOCO: A Framework for Compressing Neural Network Models Based on Tolerance Analysis