no code implementations • 29 Apr 2018 • Haiyang Wang, Yong Tang, Ziyang Jia, Fei Ye
Second, our model connects each layer to the subsequent ones in a feed-forward fashion, which enhances the capability of the model to resist performance degeneration.