no code implementations • 6 Dec 2022 • Kavya Sreedhar, Jason Clemons, Rangharajan Venkatesan, Stephen W. Keckler, Mark Horowitz
To create dynamic models, we leverage the resilience of vision transformers to pruning and switch between different scaled versions of a model.
3 code implementations • 1 Feb 2019 • Kentaro Yoshioka, Edward Lee, Simon Wong, Mark Horowitz
We develop fixed-angle, long-duration video datasets across several domains, and we show that the dataset size can be culled by a factor of 300x to reduce the total training time by 47x with no accuracy loss or even with slight improvement.
4 code implementations • 6 Nov 2018 • Kentaro Yoshioka, Edward Lee, Mark Horowitz
For the limited domain, we observed that compact DSMs significantly surpass the accuracy of COCO trained models of the same size.
no code implementations • 10 Sep 2018 • Xuan Yang, Mingyu Gao, Jing Pu, Ankita Nayak, Qiaoyi Liu, Steven Emberton Bell, Jeff Ou Setter, Kaidi Cao, Heonjae Ha, Christos Kozyrakis, Mark Horowitz
Many DNN accelerators have been proposed and built using different microarchitectures and program mappings.
Distributed, Parallel, and Cluster Computing
3 code implementations • 28 Oct 2016 • Jing Pu, Steven Bell, Xuan Yang, Jeff Setter, Stephen Richardson, Jonathan Ragan-Kelley, Mark Horowitz
We address this problem by extending the image processing language, Halide, so users can specify which portions of their applications should become hardware accelerators, and then we provide a compiler that uses this code to automatically create the accelerator along with the "glue" code needed for the user's application to access this hardware.
Software Engineering
1 code implementation • 14 Jun 2016 • Xuan Yang, Jing Pu, Blaine Burton Rister, Nikhil Bhagdikar, Stephen Richardson, Shahar Kvatinsky, Jonathan Ragan-Kelley, Ardavan Pedram, Mark Horowitz
Convolutional Neural Networks (CNNs) are the state of the art solution for many computer vision problems, and many researchers have explored optimized implementations.