1 code implementation • IEEE/ACM Transactions on Audio, Speech, and Language Processing 2022 • Hiromu Yakura, Kento Watanabe, Masataka Goto
To acquire robust representations in an unsupervised manner, regular self-supervised contrastive learning trains neural networks to make the feature representation of a sample close to those of its computationally transformed versions.
no code implementations • 19 May 2021 • Hiromu Yakura, Yuki Koyama, Masataka Goto
To enable this, we introduce a framework that utilizes an existing pretrained model for style transfer to calculate a perceptual style distance to the reference sample and uses black-box optimization to find the parameters that minimize this distance.
no code implementations • 21 Jun 2020 • Shota Sakaguchi, Jun Kato, Masataka Goto, Seiichi Uchida
In order to analyze the motion of lyric words, we first apply a state-of-the-art scene text detector and recognizer to each video frame.
no code implementations • 8 May 2020 • Yuki Koyama, Issei Sato, Masataka Goto
To help users respond to plane-search queries, we also propose using a gallery-based interface that provides options in the two-dimensional subspace arranged in an adaptive grid view.
no code implementations • 8 Apr 2020 • Takayuki Nakatsuka, Kazuyoshi Yoshii, Yuki Koyama, Satoru Fukayama, Masataka Goto, Shigeo Morishima
Specifically, we formulate a hierarchical generative model of poses and images by integrating a deep generative model of poses from pose features with that of images from poses and image features.
1 code implementation • NAACL 2018 • Kento Watanabe, Yuichiroh Matsubayashi, Satoru Fukayama, Masataka Goto, Kentaro Inui, Tomoyasu Nakano
This paper presents a novel, data-driven language model that produces entire lyrics for a given input melody.
no code implementations • 26 May 2017 • Kosetsu Tsukuda, Masataka Goto
Third, we carried out qualitative experiments and showed that taking addiction into account enables us to analyze music listening behavior from a new viewpoint in terms of how people listen to music according to the time of day, how an artist's songs are listened to by people, etc.
no code implementations • COLING 2016 • Kento Watanabe, Yuichiroh Matsubayashi, Naho Orita, Naoaki Okazaki, Kentaro Inui, Satoru Fukayama, Tomoyasu Nakano, Jordan Smith, Masataka Goto
This study proposes a computational model of the discourse segments in lyrics to understand and to model the structure of lyrics.