About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
Science and Technologies for Smart Cities. 6th EAI International Conference, SmartCity360°, Virtual Event, December 2-4, 2020, Proceedings

Research Article

Inference Performance Comparison of Convolutional Neural Networks on Edge Devices

Download(Requires a free EAI acccount)
4 downloads
Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1007/978-3-030-76063-2_23,
        author={Sheikh Rufsan Reza and Yuzhong Yan and Xishuang Dong and Lijun Qian},
        title={Inference Performance Comparison of Convolutional Neural Networks on Edge Devices},
        proceedings={Science and Technologies for Smart Cities. 6th EAI International Conference, SmartCity360°, Virtual Event, December 2-4, 2020, Proceedings},
        proceedings_a={SMARTCITY},
        year={2021},
        month={5},
        keywords={Model compression Edge device Deep learning Internet of Things},
        doi={10.1007/978-3-030-76063-2_23}
    }
    
  • Sheikh Rufsan Reza
    Yuzhong Yan
    Xishuang Dong
    Lijun Qian
    Year: 2021
    Inference Performance Comparison of Convolutional Neural Networks on Edge Devices
    SMARTCITY
    Springer
    DOI: 10.1007/978-3-030-76063-2_23
Sheikh Rufsan Reza1,*, Yuzhong Yan1, Xishuang Dong1, Lijun Qian1
  • 1: Center of Excellence in Research and Education for Big Military Data Intelligence (CREDIT Center), Prairie View A&M University, Texas A&M University System, Prairie View
*Contact email: sreza@student.pvamu.edu

Abstract

With the proliferation of Internet of Things (IoT), large amount of data are generated at edge devices with an unprecedented speed. In order to protect the privacy and security of big edge data, as well as reduce the communications cost, it is desirable to process the data locally at the edge devices. In this study, the inference performance of several popular pre-trained convolutional neural networks on three edge computing devices are evaluated. Specifically, MobileNetV1 & V2 and InceptionV3 models have been tested on NVIDIA Jetson TX2, Jetson Nano, and Google Edge TPU for image classification. Furthermore, various compression techniques including pruning, quantization, binarized neural network, and tensor decomposition are applied to reduce the model complexity. The results will provide a guidance for practitioners when deploying deep learning models on resource constrained edge devices for near real-time and on-site learning.

Keywords
Model compression Edge device Deep learning Internet of Things
Published
2021-05-22
Appears in
SpringerLink
http://dx.doi.org/10.1007/978-3-030-76063-2_23
Copyright © 2020–2025 ICST
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL