no code implementations • 30 Jun 2020 • Duo Xu, Mohit Agarwal, Ekansh Gupta, Faramarz Fekri, Raghupathy Sivakumar
Providing Reinforcement Learning (RL) agents with human feedback can dramatically improve various aspects of learning.
no code implementations • 17 Jun 2020 • Bidisha Samanta, Mohit Agarwal, Niloy Ganguly
DE-VAE achieves better control of sentiment as an attribute while preserving the content by learning a suitable lossless transformation network from the disentangled sentiment space to the desired entangled representation.
no code implementations • ICLR 2020 • Duo Xu, Mohit Agarwal, Raghupathy Sivakumar, Faramarz Fekri
Building atop the baseline, we then make the following novel contributions in our work: (i) We argue that the definition of error-potentials is generalizable across different environments; specifically we show that error-potentials of an observer can be learned for a specific game, and the definition used as-is for another game without requiring re-learning of the error-potentials.