Runtime Configurable Deep Neural Networks for Energy-Accuracy Trade-off

07/19/2016
by   Hokchhay Tann, et al.
0

We present a novel dynamic configuration technique for deep neural networks that permits step-wise energy-accuracy trade-offs during runtime. Our configuration technique adjusts the number of channels in the network dynamically depending on response time, power, and accuracy targets. To enable this dynamic configuration technique, we co-design a new training algorithm, where the network is incrementally trained such that the weights in channels trained in earlier steps are fixed. Our technique provides the flexibility of multiple networks while storing and utilizing one set of weights. We evaluate our techniques using both an ASIC-based hardware accelerator as well as a low-power embedded GPGPU and show that our approach leads to only a small or negligible loss in the final network accuracy. We analyze the performance of our proposed methodology using three well-known networks for MNIST, CIFAR-10, and SVHN datasets, and we show that we are able to achieve up to 95 reduction with less than 1 addition, compared to prior work on dynamic network reconfiguration, we show that our approach leads to approximately 50 while achieving similar accuracy.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset