no code implementations • 9 Oct 2020 • Jiarui Fang, Yang Yu, Chengduo Zhao, Jie zhou
This paper designed a transformer serving system called TurboTransformers, which consists of a computing runtime and a serving framework to solve the above challenges.