Real-Time Hardware Acceleration of Low Precision Quantized Custom Neural Network Model on ZYNQ SoC

Erenoğlu, Ayşe Kübra; Tatar, Güner

doi:10.1109/HORA58378.2023.10155783

Real-Time Hardware Acceleration of Low Precision Quantized Custom Neural Network Model on ZYNQ SoC

Dosyalar

Erenoğlu.pdf (15.77 MB)

Tarih

2023

Yazarlar

Erenoğlu, Ayşe Kübra

Tatar, Güner

Yayıncı

IEEE

Erişim Hakkı

info:eu-repo/semantics/embargoedAccess

Özet

Achieving a lower memory footprint and reduced computational density in neural network models requires the use of low-precision models. However, existing techniques typically rely on floating-point arithmetic to preserve accuracy, which can be problematic for convolutional neural network models (CNNs) with substantial memory requirements when using floating-point numbers. Additionally, larger bit widths lead to higher computational density in hardware architectures. This has resulted in the need for current models to become deeper network models with sometimes billions, of parameters to address contemporary problems, thereby increasing computational complexity and causing memory allocation issues. These challenges render existing hardware accelerators insufficient. In scenarios where hardware complexity can be traded-off for accuracy, the adoption of model quantization enables the utilization of limited hardware resources for implementing neural networks. From a hardware design standpoint, employing quantized models offers notable advantages in terms of speed, memory utilization, and power consumption compared to traditional floating-point arithmetic. To this end, we propose a method for detecting network intrusions by quantizing weight and activation functions using the Brevitas library in a custom multi-layer detector. We conducted real-time experimentation of the technique on the ZYNQ System-on-Chip (SoC) using the FINN framework, which enabled deep neural network extraction within Field Programmable Gate Arrays (FPGAs), resulting in an accuracy rate of approximately 92%. We selected the UNSW-NB15 dataset, which was generated by the Australian Cyber Security Center (ACCS), for the investigation.

Anahtar Kelimeler

FINN Experimental framework, Quantization aware training, System on chip field programmable gate arrays, Low precision arithmetic

Kaynak

5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications, HORA 2023

Scopus Q Değeri

N/A

Künye

ERENOĞLU, Ayşe Kübra & Güner TATAR. "Real-Time Hardware Acceleration of Low Precision Quantized Custom Neural Network Model on ZYNQ SoC". 5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications, HORA 2023, (2023): 1-6.

Bağlantı

https://hdl.handle.net/11352/4631

Koleksiyon

Elektrik-Elektronik Mühendisliği Bölümü
Scopus İndeksli Yayınlar Koleksiyonu

Detaylı Öğe Kaydı

Real-Time Hardware Acceleration of Low Precision Quantized Custom Neural Network Model on ZYNQ SoC

Dosyalar

Tarih

Yazarlar

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Erişim Hakkı

Özet

Açıklama

Anahtar Kelimeler

Kaynak

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Bağlantı

Koleksiyon

Onay

İnceleme

Ekleyen

Referans Veren