Bit-serial computing technique based  efficient deep neural network accelerator

Chhajed, Harsh

Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/3095

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Vishvakarma, Santosh Kumar	-
dc.contributor.author	Chhajed, Harsh	-
dc.date.accessioned	2021-09-14T18:51:56Z	-
dc.date.available	2021-09-14T18:51:56Z	-
dc.date.issued	2021-08-27	-
dc.identifier.uri	https://dspace.iiti.ac.in/handle/123456789/3095	-
dc.description.abstract	Contemporary hardware implementations of deep neural networks face the burden of excess area requirement due to resource-intensive elements such as a multiplier. A semi-custom ASIC approach-based VLSI circuit design of the multiply-accumulate unit in a deep neural network faces the chip area limitation. Therefore an area and power-efficient architecture for the multiply-accumulate unit is imperative to down the burden of excess area requirement for digital design exploration. The present work addresses this challenge by proposing an efficient processing and bit-serial computation based multiply-accumulate unit implementation. The proposed architecture is verified using simulation output and synthesized using Synopsys design vision at 180nm and 45nm technology and extracted all physical parameters using Cadence Virtuoso. At 45nm, design shows 34.35% less area-delay-product (ADP). It shows improvement by 25.94% in area, 35.65% in power dissipation, and 14.30% in latency with respect to the state-of-the-art multiply-accumulate unit design. Furthermore, at lower technology node gets higher leakage power dissipation. In order to save leakage power, we exploit the power-gated design for the proposed architecture. The used coarse-grain power gating technique saves 52.79% leakage/static power with minimal area overhead. The system-level performance of MAC has evaluated using LeNet architecture which is implemented on the FPGA board to validate the performance parameters impact of the proposed multiply-accumulate unit. The architecture with the proposed multiply accumulate unit saved 49.39% look-up table utilization for multiply-accumulate unit and 20% overall look-up table utilization compared to MAC with Xilinx multiplier and adder IP. The proposed architecture over performed for critical time delay compared to the state-of-the-art.	en_US
dc.language.iso	en	en_US
dc.publisher	Department of Electrical Engineering, IIT Indore	en_US
dc.relation.ispartofseries	MSR008	-
dc.subject	Electrical Engineering	en_US
dc.title	Bit-serial computing technique based efficient deep neural network accelerator	en_US
dc.type	Thesis_MS Research	en_US
Appears in Collections:	Department of Electrical Engineering_ETD

Files in This Item:

File	Description	Size	Format
MSR008_Harsh_Chhajed_1904102006.pdf		3.76 MB	Adobe PDF	View/Open

Show simple item record

Altmetric Badge: