3 code implementations • 27 Nov 2017 • Katsunori Ohnishi, Shohei Yamamoto, Yoshitaka Ushiku, Tatsuya Harada
FlowGAN generates optical flow, which contains only the edge and motion of the videos to be begerated.
1 code implementation • 31 Oct 2017 • Andrew Shin, Leopold Crestel, Hiroharu Kato, Kuniaki Saito, Katsunori Ohnishi, Masataka Yamaguchi, Masahiro Nakawaki, Yoshitaka Ushiku, Tatsuya Harada
Automatic melody generation for pop music has been a long-time aspiration for both AI researchers and musicians.
Sound Multimedia Audio and Speech Processing
no code implementations • 18 May 2016 • Andrew Shin, Katsunori Ohnishi, Tatsuya Harada
Recent advances in image captioning task have led to increasing interests in video captioning task.
no code implementations • 29 Apr 2016 • Katsunori Ohnishi, Masatoshi Hidaka, Tatsuya Harada
This new descriptor is calculated by applying discriminative weights learned from one network to a convolutional layer of the other network.
Ranked #10 on
Action Classification
on Toyota Smarthome dataset
no code implementations • 30 Mar 2016 • Andrew Shin, Masataka Yamaguchi, Katsunori Ohnishi, Tatsuya Harada
The workflow of extracting features from images using convolutional neural networks (CNN) and generating captions with recurrent neural networks (RNN) has become a de-facto standard for image captioning task.
no code implementations • CVPR 2016 • Katsunori Ohnishi, Atsushi Kanehira, Asako Kanezaki, Tatsuya Harada
We present a novel dataset and a novel algorithm for recognizing activities of daily living (ADL) from a first-person wearable camera.