no code implementations • 16 Feb 2024 • Hossein Rajabzadeh, Mojtaba Valipour, Tianshu Zhu, Marzieh Tahaei, Hyock Ju Kwon, Ali Ghodsi, Boxing Chen, Mehdi Rezagholizadeh
Finetuning large language models requires huge GPU memory, restricting the choice to acquire Larger models.
1 code implementation • 3 Aug 2022 • Weiming Ren, Ruijing Zeng, Tongzi Wu, Tianshu Zhu, Rahul G. Krishnan
One of the challenges in curriculum learning is the design of curricula -- i. e., in the sequential design of tasks that gradually increase in difficulty.
no code implementations • 3 Jun 2022 • Runqing Zhang, Tianshu Zhu
Attention Augmented Convolutional Network (AANet) is a mixture of convolution and self-attention, which increases the accuracy of a typical ResNet.