no code implementations • 13 Dec 2023 • Lionel Wong, Jiayuan Mao, Pratyusha Sharma, Zachary S. Siegel, Jiahai Feng, Noa Korneev, Joshua B. Tenenbaum, Jacob Andreas
Effective planning in the real world requires not only world knowledge, but the ability to leverage that knowledge to build the right representation of the task at hand.
no code implementations • 12 Jun 2022 • Tomer Galanti, Zachary S. Siegel, Aparna Gupte, Tomaso Poggio
We study the bias of Stochastic Gradient Descent (SGD) to learn low-rank weight matrices when training deep neural networks.