2 code implementations • 11 Jul 2023 • Changshang Xue
It discusses local encoders such as ASCII and GB-2312, which encode specific characters into shorter bytes, and universal encoders like UTF-8 and UTF-16, which can encode the complete Unicode set with greater space requirements and are gaining widespread acceptance.
1 code implementation • 14 Jun 2023 • Changshang Xue, Xiande Zhong, Xiaoqing Liu
In recent years, a significant number of high-quality pretrained models have emerged, greatly impacting Natural Language Understanding (NLU), Natural Language Generation (NLG), and Text Representation tasks.