TY - JOUR
T1 - On-sensor binarized CNN inference with dynamic model swapping in pixel processor arrays
AU - Liu, Yanan
AU - Bose, Laurie
AU - Fan, Rui
AU - Dudek, Piotr
AU - Mayol-Cuevas, Walterio
N1 - Funding Information:
This work was supported by National Key R&D Program of China (Grant No. 2020AAA0108100), UK EPSRC EP/M019454/1, EPSRC Centre for Doctoral Training in Future Autonomous and Robotic Systems: FARSCOPE, and China Scholarship Council (CSC, No. 201700260083).
Publisher Copyright:
Copyright © 2022 Liu, Bose, Fan, Dudek and Mayol-Cuevas.
PY - 2022/8/15
Y1 - 2022/8/15
N2 - Many types of Convolutional Neural Network (CNN) models and training methods have been proposed in recent years aiming to provide efficiency for embedded and edge devices with limited computation and memory resources. The wide variety of architectures makes this a complex task that has to balance generality with efficiency. Among the most interesting camera-sensor architectures are Pixel Processor Arrays (PPAs). This study presents two methods that are useful for embedded CNNs in general but particularly suitable for PPAs. The first is for training purely binarized CNNs, the second is for deploying larger models with a model swapping paradigm that loads model components dynamically. Specifically, this study trains and implements networks with batch normalization and adaptive threshold for binary activations. Then, we convert batch normalization and binary activations into a bias matrix which can be parallelly implemented by an add/sub operation. For dynamic model swapping, we propose to decompose applications that are beyond the capacity of a PPA into sub-tasks that can be solved by tree networks that can be loaded dynamically as needed. We demonstrate our approaches to various tasks including classification, localization, and coarse segmentation on a highly resource constrained PPA sensor-processor.
AB - Many types of Convolutional Neural Network (CNN) models and training methods have been proposed in recent years aiming to provide efficiency for embedded and edge devices with limited computation and memory resources. The wide variety of architectures makes this a complex task that has to balance generality with efficiency. Among the most interesting camera-sensor architectures are Pixel Processor Arrays (PPAs). This study presents two methods that are useful for embedded CNNs in general but particularly suitable for PPAs. The first is for training purely binarized CNNs, the second is for deploying larger models with a model swapping paradigm that loads model components dynamically. Specifically, this study trains and implements networks with batch normalization and adaptive threshold for binary activations. Then, we convert batch normalization and binary activations into a bias matrix which can be parallelly implemented by an add/sub operation. For dynamic model swapping, we propose to decompose applications that are beyond the capacity of a PPA into sub-tasks that can be solved by tree networks that can be loaded dynamically as needed. We demonstrate our approaches to various tasks including classification, localization, and coarse segmentation on a highly resource constrained PPA sensor-processor.
KW - convolutional neural network
KW - embedded computer vision
KW - on-sensor computing
KW - pixel processor array
KW - SCAMP vision system
UR - http://www.scopus.com/inward/record.url?scp=85136869033&partnerID=8YFLogxK
U2 - 10.3389/fnins.2022.909448
DO - 10.3389/fnins.2022.909448
M3 - Article
AN - SCOPUS:85136869033
VL - 16
JO - Frontiers in Neuroscience
JF - Frontiers in Neuroscience
SN - 1662-4548
M1 - 909448
ER -