A core challenge for an agent learning to interact with the world is to predict how its actions affect objects in its environment.
We study the problem of synthesizing a number of likely future frames from a single input image.
Several mechanisms to focus attention of a neural network on selected parts of its input or memory have been used successfully in deep learning models in recent years.
#26 best model for Machine Translation on WMT2014 English-French
The move from hand-designed features to learned features in machine learning has been wildly successful.
We study the problem of 3D object generation.
This metric better reflects perceptually similarity of images and thus leads to better results.