1 code implementation • 12 Jun 2019 • Sam Shleifer, Eric Prokop
These "easy" proxies are higher quality than training on the full dataset for a reduced number of epochs (but equivalent computational cost), and, unexpectedly, higher quality than proxies constructed from the hardest examples.