Sign in

Snapshot Distillation: Teacher-Student Optimization in One Generation

By Chenglin Yang and others
Optimizing a deep neural network is a fundamental task in computer vision, yet direct training methods often suffer from over-fitting. Teacher-student optimization aims at providing complementary cues from a model trained previously, but these approaches are often considerably slow due to the pipeline of training a few generations in sequence,... Show more
December 1, 2018
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
Summary