Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/16303
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSankhe, Akashen_US
dc.contributor.authorLokhande, Mukulen_US
dc.contributor.authorSharma, Radheshyamen_US
dc.contributor.authorVishvakarma, Santosh Kumaren_US
dc.date.accessioned2025-06-20T06:39:35Z-
dc.date.available2025-06-20T06:39:35Z-
dc.date.issued2025-
dc.identifier.citationSankhe, A., Lokhande, M., Sharma, R., & Vishvakarma, S. K. (2025). Area-Optimized 2D Interleaved Adder Tree Design for Sparse DCIM Edge Processing. Proceedings International Symposium on Quality Electronic Design Isqed. https://doi.org/10.1109/ISQED65160.2025.11014431en_US
dc.identifier.issn1948-3287-
dc.identifier.otherEID(2-s2.0-105007524117)-
dc.identifier.urihttps://dx.doi.org/10.1109/ISQED65160.2025.11014431-
dc.identifier.urihttps://dspace.iiti.ac.in:8080/jspui/handle/123456789/16303-
dc.description.abstractRecently, SRAM-embedded compute-in-memory (CIM) hardware has emerged as a promising solution to mitigate von-Neumann bottlenecks and has shown noteworthy improvements in energy efficiency and throughput for matrix-vector multiplication, a significant portion of neural networks. While PVT variations significantly impact traditional analog/mixed-signal (AMS) macros, the DCIM macros are more robust. This article proposes a DCIM macro that incorporates an 8-transistor SRAM bitcell capable of performing 1-bit multiplications and addressing the bit-flip issue arising from the simultaneous activation of multiple array rows. The macro also includes a 2D interleaved adder tree constructed using a novel 7T-based ripple carry adder (RCA), significantly reducing the adder tree's area. The proposed 16Kb DCIM macro computes 64 parallel products in a single clock cycle. It demonstrates 2 × higher energy efficiency than recent state-of-the-art works at 65nm CMOS. The macro is validated at 250MHz and achieves the classification accuracy of 98.7%, 98.8% for 1A4W precision, and 99.1%, 97.8% for 4A4W precision, for LeNet-5 architecture using MNIST and A-Z alphabet datasets respectively. © 2025 IEEE.en_US
dc.language.isoenen_US
dc.publisherIEEE Computer Societyen_US
dc.sourceProceedings - International Symposium on Quality Electronic Design, ISQEDen_US
dc.subjectCompute-in-memory (CIM)en_US
dc.subjectDNN (deep neural networks)en_US
dc.subjectEdge-AI acceleratorsen_US
dc.subjectmultiply-and-accumulateen_US
dc.subjectreconfigurable precisionen_US
dc.titleArea-Optimized 2D Interleaved Adder Tree Design for Sparse DCIM Edge Processingen_US
dc.typeConference Paperen_US
Appears in Collections:Department of Electrical Engineering

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: