no code implementations • Findings (EMNLP) 2021 • Akhil Kedia, Sai Chetan Chinthakindi, WonHo Ryu
Inspired by these approaches for a single task setting, this paper proposes to use the finite differences first-order algorithm to calculate this gradient from dot-product of gradients, allowing explicit control on the weightage of this component relative to standard gradients.
Ranked #1 on Text Summarization on GigaWord (using extra training data)