......@@ -212,9 +212,26 @@ Object categorization methods are trained to recognize \textbf{1000s of classes}
\begin{block}{\Large Example: Multi-output Distillation}
\textbf{Multi-exit architectures}
\item a crude initial prediction that is gradually refined
\item can be stopped anytime to provide a valid prediction
\textbf{Standard training}
\item sum of exit-wise losses
\textbf{Proposed training}
\item distillation from later (more accurate) to earlier exits
