Performance Evaluation of Low-Precision Quantized LeNet and ConvNet Neural Networks

Tatar, Güner; Bayar, Salih; Çiçek, İhsan

doi:10.1109/INISTA55318.2022.9894261

Performance Evaluation of Low-Precision Quantized LeNet and ConvNet Neural Networks

Dosyalar

Tatar.pdf (4.62 MB)

Tarih

2022

Yazarlar

Tatar, Güner

Bayar, Salih

Çiçek, İhsan

Yayıncı

IEEE

Erişim Hakkı

info:eu-repo/semantics/embargoedAccess

Özet

Low-precision neural network models are crucial for reducing the memory footprint and computational density. However, existing methods must have an average of 32-bit floatingpoint (FP32) arithmetic to maintain the accuracy. Floating-point numbers need grave memory requirements in convolutional and deep neural network models. Also, large bit-widths cause too much computational density in hardware architectures. Moreover, existing models must evolve into deeper network models with millions or billions of parameters to solve today’s problems. The large number of model parameters increase the computational complexity and cause memory allocation problems, hence existing hardware accelerators become insufficient to address these problems. In applications where accuracy can be tradedoff for the sake of hardware complexity, quantization of models enable the use of limited hardware resources to implement neural networks. From hardware design point of view, quantized models are more advantageous in terms of speed, memory and power consumption than using FP32. In this study, we compared the training and testing accuracy of the quantized LeNet and our own ConvNet neural network models at different epochs. We quantized the models using low precision int-4, int-8 and int-16. As a result of the tests, we observed that the LeNet model could only reach 63.59% test accuracy at 400 epochs with int-16. On the other hand, the ConvNet model achieved a test accuracy of 76.78% at only 40 epochs with low precision int-8 quantization.

Anahtar Kelimeler

Convolutional Neural Networks, Quantized Neural Networks, FPGA, Hardware Accelerators, Floating Point Arithmetic, Fixed Point Arithmetic, LeNet, ConvNet

Kaynak

2022 International Conference on INnovations in Intelligent SysTems and Applications (INISTA)

Scopus Q Değeri

N/A

Künye

TATAR, Güner, Salih BAYAR & İhsan ÇİÇEK. "Performance Evaluation of Low-Precision Quantized LeNet and ConvNet Neural Networks". 2022 International Conference on INnovations in Intelligent SysTems and Applications (INISTA), (2022): 1-6.

Bağlantı

https://hdl.handle.net/11352/4188

Koleksiyon

Elektrik-Elektronik Mühendisliği Bölümü
Scopus İndeksli Yayınlar Koleksiyonu

Detaylı Öğe Kaydı

Performance Evaluation of Low-Precision Quantized LeNet and ConvNet Neural Networks

Dosyalar

Tarih

Yazarlar

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Erişim Hakkı

Özet

Açıklama

Anahtar Kelimeler

Kaynak

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Bağlantı

Koleksiyon

Onay

İnceleme

Ekleyen

Referans Veren