no code implementations • 29 Apr 2019 • Assaf Hoogi, Brian Wilcox, Yachee Gupta, Daniel L. Rubin
Then, the Self-Attention layer learns to suppress irrelevant regions based on features analysis and highlights salient features useful for a specific task.