no code implementations • 20 Apr 2023 • Hao Zhang, Dan Qu, Keji Shao, Xukui Yang
In contrast to the general dropout method, which randomly drops neurons, DropDim drops part of the embedding dimensions.