FlowGAN generates optical flow, which contains only the edge and motion of the videos to be begerated.
Automatic melody generation for pop music has been a long-time aspiration for both AI researchers and musicians.
Sound Multimedia Audio and Speech Processing
Recent advances in image captioning task have led to increasing interests in video captioning task.
This new descriptor is calculated by applying discriminative weights learned from one network to a convolutional layer of the other network.
Ranked #10 on Action Classification on Toyota Smarthome dataset
The workflow of extracting features from images using convolutional neural networks (CNN) and generating captions with recurrent neural networks (RNN) has become a de-facto standard for image captioning task.
We present a novel dataset and a novel algorithm for recognizing activities of daily living (ADL) from a first-person wearable camera.