Visible to the public Dynamic Adaptation of Approximate Bit-width for CNNs based on Quantitative Error Resilience

TitleDynamic Adaptation of Approximate Bit-width for CNNs based on Quantitative Error Resilience
Publication TypeConference Paper
Year of Publication2019
AuthorsWu, Chengjun, Shan, Weiwei, Xu, Jiaming
Conference Name2019 IEEE/ACM International Symposium on Nanoscale Architectures (NANOARCH)
Date PublishedJuly 2019
PublisherIEEE
ISBN Number978-1-7281-5520-3
Keywordsadders, approximate bit-width, approximate computing, approximate computing technology, approximation theory, CNNs, Computing Theory, configurable adder, convolution, convolutional neural nets, convolutional neural network, convolutional neural networks, dynamic adaptation, dynamic adaptation of approximate bit-width, error resilience, Hardware, logic circuits, Neurons, power aware computing, power consumption, Power demand, pubcrawl, quantitative error resilience, resilience, Resiliency
Abstract

As an emerging paradigm for energy-efficiency design, approximate computing can reduce power consumption through simplification of logic circuits. Although calculation errors are caused by approximate computing, their impacts on the final results can be negligible in some error resilient applications, such as Convolutional Neural Networks (CNNs). Therefore, approximate computing has been applied to CNNs to reduce the high demand for computing resources and energy. Compared with the traditional method such as reducing data precision, this paper investigates the effect of approximate computing on the accuracy and power consumption of CNNs. To optimize the approximate computing technology applied to CNNs, we propose a method for quantifying the error resilience of each neuron by theoretical analysis and observe that error resilience varies widely across different neurons. On the basic of quantitative error resilience, dynamic adaptation of approximate bit-width and the corresponding configurable adder are proposed to fully exploit the error resilience of CNNs. Experimental results show that the proposed method further improves the performance of power consumption while maintaining high accuracy. By adopting the optimal approximate bit-width for each layer found by our proposed algorithm, dynamic adaptation of approximate bit-width reduces power consumption by more than 30% and causes less than 1% loss of the accuracy for LeNet-5.

URLhttps://ieeexplore.ieee.org/document/9073195/
DOI10.1109/NANOARCH47378.2019.181283
Citation Keywu_ynamic_2019