Combined with high-level semantics, Sem-LS is more robust under cluttered environment compared with existing line-shaped representations.
Under limited DRAM bandwidth, a system throughput of 1244GFlop/s is achieved at the Vertex UltraScale platform, which is 5. 48 times higher than the state-of-the-art FPGA implementations.
This achieves a peak throughput of 806. 4GOPS with 567. 5mW and is able to accelerate the five convolutional layers in AlexNet at a frame rate of 326. 2fps.