no code implementations • 22 Nov 2017 • John P. Dickerson, Karthik A. Sankararaman, Aravind Srinivasan, Pan Xu
Prior work addresses online bipartite matching markets, where agents arrive over time and are dynamically matched to a known set of disposable resources.
no code implementations • 15 Apr 2019 • Karthik A. Sankararaman, Soham De, Zheng Xu, W. Ronny Huang, Tom Goldstein
Our results show that, for popular initialization techniques, increasing the width of neural networks leads to lower gradient confusion, and thus faster model training.
no code implementations • 25 Sep 2019 • Karthik A. Sankararaman, Soham De, Zheng Xu, W. Ronny Huang, Tom Goldstein
Through novel theoretical and experimental results, we show how the neural net architecture affects gradient confusion, and thus the efficiency of training.