no code implementations • 22 Nov 2021 • Jing Fan, Xin Zhang, Sheng Zhang, Yan Pan, Lixiang Guo
In light of the success of transferring language models into NLP tasks, we ask whether the full BERT model is always the best and does it exist a simple but effective method to find the winning ticket in state-of-the-art deep neural networks without complex calculations.