no code implementations • 26 Dec 2023 • Tanvi Sharma, Mustafa Ali, Indranil Chakraborty, Kaushik Roy
The proposed work provides insights into what type of CiM to use, and when and where to optimally integrate it in the cache hierarchy for GEMM acceleration.
no code implementations • 26 Jun 2021 • Jiawei Zhao, Steve Dai, Rangharajan Venkatesan, Brian Zimmer, Mustafa Ali, Ming-Yu Liu, Brucek Khailany, Bill Dally, Anima Anandkumar
Representing deep neural networks (DNNs) in low-precision is a promising approach to enable efficient acceleration and memory reduction.
no code implementations • 8 May 2021 • Sourjya Roy, Mustafa Ali, Anand Raghunathan
Processing in memory has been proposed as a promising solution for the memory wall bottleneck for ML workloads.
no code implementations • 27 Mar 2020 • Mustafa Ali, Akhilesh Jaiswal, Sangamesh Kodge, Amogh Agrawal, Indranil Chakraborty, Kaushik Roy
`In-memory computing' is being widely explored as a novel computing paradigm to mitigate the well known memory bottleneck.