2 code implementations • 15 Mar 2024 • Xuanlei Zhao, Shenggan Cheng, Chang Chen, Zangwei Zheng, Ziming Liu, Zheming Yang, Yang You
Scaling multi-dimensional transformers to long sequences is indispensable across various domains.
Text Generation Video Generation