Performance Evaluation of Low-Precision Quantized LeNet and ConvNet Neural Networks

Tatar, Güner; Bayar, Salih; Çiçek, İhsan

dc.contributor.author	Tatar, Güner
dc.contributor.author	Bayar, Salih
dc.contributor.author	Çiçek, İhsan
dc.date.accessioned	2022-10-21T09:46:00Z
dc.date.available	2022-10-21T09:46:00Z
dc.date.issued	2022	en_US
dc.identifier.citation	TATAR, Güner, Salih BAYAR & İhsan ÇİÇEK. "Performance Evaluation of Low-Precision Quantized LeNet and ConvNet Neural Networks". 2022 International Conference on INnovations in Intelligent SysTems and Applications (INISTA), (2022): 1-6.	en_US
dc.identifier.uri	https://hdl.handle.net/11352/4188
dc.description.abstract	Low-precision neural network models are crucial for reducing the memory footprint and computational density. However, existing methods must have an average of 32-bit floatingpoint (FP32) arithmetic to maintain the accuracy. Floating-point numbers need grave memory requirements in convolutional and deep neural network models. Also, large bit-widths cause too much computational density in hardware architectures. Moreover, existing models must evolve into deeper network models with millions or billions of parameters to solve today’s problems. The large number of model parameters increase the computational complexity and cause memory allocation problems, hence existing hardware accelerators become insufficient to address these problems. In applications where accuracy can be tradedoff for the sake of hardware complexity, quantization of models enable the use of limited hardware resources to implement neural networks. From hardware design point of view, quantized models are more advantageous in terms of speed, memory and power consumption than using FP32. In this study, we compared the training and testing accuracy of the quantized LeNet and our own ConvNet neural network models at different epochs. We quantized the models using low precision int-4, int-8 and int-16. As a result of the tests, we observed that the LeNet model could only reach 63.59% test accuracy at 400 epochs with int-16. On the other hand, the ConvNet model achieved a test accuracy of 76.78% at only 40 epochs with low precision int-8 quantization.	en_US
dc.language.iso	eng	en_US
dc.publisher	IEEE	en_US
dc.relation.isversionof	10.1109/INISTA55318.2022.9894261	en_US
dc.rights	info:eu-repo/semantics/embargoedAccess	en_US
dc.subject	Convolutional Neural Networks	en_US
dc.subject	Quantized Neural Networks	en_US
dc.subject	FPGA	en_US
dc.subject	Hardware Accelerators	en_US
dc.subject	Floating Point Arithmetic	en_US
dc.subject	Fixed Point Arithmetic	en_US
dc.subject	LeNet	en_US
dc.subject	ConvNet	en_US
dc.title	Performance Evaluation of Low-Precision Quantized LeNet and ConvNet Neural Networks	en_US
dc.type	conferenceObject	en_US
dc.relation.journal	2022 International Conference on INnovations in Intelligent SysTems and Applications (INISTA)	en_US
dc.contributor.department	FSM Vakıf Üniversitesi, Mühendislik Fakültesi, Elektrik-Elektronik Mühendisliği Bölümü	en_US
dc.identifier.startpage	16	en_US
dc.relation.publicationcategory	Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı	en_US
dc.contributor.institutionauthor	Tatar, Güner

Bu öğenin dosyaları:

Ad:: Tatar.pdf
Boyut:: 4.619Mb
Biçim:: PDF
Açıklama:: Konferans Öğesi

Göster/Aç

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Elektrik-Elektronik Mühendisliği Bölümü [59]
Elektrik-Elektronik Mühendisliği'ne ait yayınları içerir.
Scopus İndeksli Yayınlar / Scopus Indexed Publications [536]
Scopus İndeksli Yayınlar koleksiyonuna ait yayınları içerir.

Basit öğe kaydını göster