no code implementations • 26 Mar 2024 • Yingtao Shen, Minqing Sun, Jie Zhao, An Zou
Convolutional neural networks (CNNs) have achieved significant popularity, but their computational and memory intensity poses challenges for resource-constrained computing systems, particularly with the prerequisite of real-time performance.
no code implementations • 9 Jun 2022 • Xiangjie Li, Chenfei Lou, Zhengping Zhu, Yuchi Chen, Yingtao Shen, Yehan Ma, An Zou
Predictive Exit can forecast where the network will exit (i. e., establish the number of remaining layers to finish the inference), which effectively reduces the network computation cost by exiting on time without running every pre-placed exiting layer.