Fundamental technique lets researchers use a big, expensive “teacher” model to train a “student” model for less.
The story How Distillation Makes AI Models Smaller and Cheaper first appeared on Quanta Magazine.