no code implementations • 11 Apr 2023 • Venkat Srinivasan, Darshan Gandhi, Urmish Thakker, Raghu Prabhakar
We show that we can successfully train GPT 13B to the same quality as the dense GPT 13B model, while achieving an end-end speedup of 4. 5x over dense A100 baseline.
no code implementations • 1 Aug 2017 • Vishaal Jatav, Ravi Teja, Srini Bharadwaj, Venkat Srinivasan
This paper outlines the results of sentence level linguistics based rules for improving part-of-speech tagging.