Search Results for author: Guo Yang

Found 2 papers, 0 papers with code

Dynamic Stashing Quantization for Efficient Transformer Training

no code implementations9 Mar 2023 Guo Yang, Daniel Lo, Robert Mullins, Yiren Zhao

Large Language Models (LLMs) have demonstrated impressive performance on a range of Natural Language Processing (NLP) tasks.

Quantization

Robust Continual Learning through a Comprehensively Progressive Bayesian Neural Network

no code implementations27 Feb 2022 Guo Yang, Cheryl Sze Yin Wong, Ramasamy Savitha

Thus, as the data for new task streams in, sufficient neurons are added to the network such that the total number of neurons in each layer of the network, including the shared representations with previous tasks and individual task related representation, are equal for all tasks.

Continual Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.