no code implementations • 9 Nov 2021 • Mahmood Azhar Qureshi, Arslan Munir
We also generate a two-dimensional (2D) mesh architecture of Phantom neural computational cores, which we refer to as Phantom-2D accelerator, and propose a novel dataflow that supports all layers of a CNN, including unit and non-unit stride convolutions, and FC layers.
no code implementations • 19 Jul 2020 • Mahmood Azhar Qureshi, Arslan Munir
The designed core provides a 200% increase in peak throughput per PE count while only incurring a 6% increase in area overhead compared to a single, linear multiplier PE core with same output bit precision.