Search Results for author: Masataka Goto

Found 14 papers, 2 papers with code

Self-Supervised Contrastive Learning for Singing Voices

1 code implementation IEEE/ACM Transactions on Audio, Speech, and Language Processing 2022 Hiromu Yakura, Kento Watanabe, Masataka Goto

To acquire robust representations in an unsupervised manner, regular self-supervised contrastive learning trains neural networks to make the feature representation of a sample close to those of its computationally transformed versions.

Contrastive Learning Singer Identification +1

Taste or Addiction?: Using Play Logs to Infer Song Selection Motivation

no code implementations26 May 2017 Kosetsu Tsukuda, Masataka Goto

Third, we carried out qualitative experiments and showed that taking addiction into account enables us to analyze music listening behavior from a new viewpoint in terms of how people listen to music according to the time of day, how an artist's songs are listened to by people, etc.

MirrorNet: A Deep Bayesian Approach to Reflective 2D Pose Estimation from Human Images

no code implementations8 Apr 2020 Takayuki Nakatsuka, Kazuyoshi Yoshii, Yuki Koyama, Satoru Fukayama, Masataka Goto, Shigeo Morishima

Specifically, we formulate a hierarchical generative model of poses and images by integrating a deep generative model of poses from pose features with that of images from poses and image features.

2D Pose Estimation Pose Estimation

Sequential Gallery for Interactive Visual Design Optimization

no code implementations8 May 2020 Yuki Koyama, Issei Sato, Masataka Goto

To help users respond to plane-search queries, we also propose using a gallery-based interface that provides options in the two-dimensional subspace arranged in an adaptive grid view.

Bayesian Optimization

Lyric Video Analysis Using Text Detection and Tracking

no code implementations21 Jun 2020 Shota Sakaguchi, Jun Kato, Masataka Goto, Seiichi Uchida

In order to analyze the motion of lyric words, we first apply a state-of-the-art scene text detector and recognizer to each video frame.

Clustering Dynamic Time Warping +2

Tool- and Domain-Agnostic Parameterization of Style Transfer Effects Leveraging Pretrained Perceptual Metrics

no code implementations19 May 2021 Hiromu Yakura, Yuki Koyama, Masataka Goto

To enable this, we introduce a framework that utilizes an existing pretrained model for style transfer to calculate a perceptual style distance to the reference sample and uses black-box optimization to find the parameters that minimize this distance.

Style Transfer

CatAlyst: Domain-Extensible Intervention for Preventing Task Procrastination Using Large Generative Models

no code implementations11 Feb 2023 Riku Arakawa, Hiromu Yakura, Masataka Goto

The prompt can function by drawing their interest and lowering the hurdle for resumption even when the generated continuation is insufficient to substitute their work, while recent human-AI collaboration research aiming at work substitution depends on a stable high accuracy.

IteraTTA: An interface for exploring both text prompts and audio priors in generating music with text-to-audio models

no code implementations24 Jul 2023 Hiromu Yakura, Masataka Goto

However, compared to the image domain, gaining a clear understanding of the space of possible music audios is difficult because users cannot listen to the variations of the generated audios simultaneously.

Audio Generation Music Generation

A Computational Evaluation Framework for Singable Lyric Translation

no code implementations26 Aug 2023 Haven Kim, Kento Watanabe, Masataka Goto, Juhan Nam

Lyric translation plays a pivotal role in amplifying the global resonance of music, bridging cultural divides, and fostering universal connections.

Semantic Similarity Semantic Textual Similarity +1

Cannot find the paper you are looking for? You can Submit a new open access paper.