1 code implementation • NeurIPS 2020 • Yuandong Tian, Qucheng Gong, Tina Jiang
Based on this, we propose Joint Policy Search(JPS) that iteratively improves joint policies of collaborative agents in imperfect information games, without re-evaluating the entire game.
1 code implementation • 31 May 2019 • Yuandong Tian, Tina Jiang, Qucheng Gong, Ari Morcos
We analyze the dynamics of training deep ReLU networks and their implications on generalization capability.