Search Results for author: Minz Won

Found 15 papers, 10 papers with code

The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation

1 code implementation16 Nov 2023 Ilaria Manco, Benno Weck, Seungheon Doh, Minz Won, Yixiao Zhang, Dmitry Bogdanov, Yusong Wu, Ke Chen, Philip Tovstogan, Emmanouil Benetos, Elio Quinton, György Fazekas, Juhan Nam

We introduce the Song Describer dataset (SDD), a new crowdsourced corpus of high-quality audio-caption pairs, designed for the evaluation of music-and-language models.

Music Captioning Music Generation +2

A Foundation Model for Music Informatics

1 code implementation6 Nov 2023 Minz Won, Yun-Ning Hung, Duc Le

This paper investigates foundation models tailored for music informatics, a domain currently challenged by the scarcity of labeled data and generalization issues.

Information Retrieval Music Information Retrieval +2

Scaling Up Music Information Retrieval Training with Semi-Supervised Learning

no code implementations2 Oct 2023 Yun-Ning Hung, Ju-Chiang Wang, Minz Won, Duc Le

To our knowledge, this is the first attempt to study the effects of scaling up both model and training data for a variety of MIR tasks.

Information Retrieval Music Information Retrieval +1

Textless Speech-to-Music Retrieval Using Emotion Similarity

no code implementations19 Mar 2023 Seungheon Doh, Minz Won, Keunwoo Choi, Juhan Nam

We introduce a framework that recommends music based on the emotions of speech.

Retrieval

Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training

no code implementations1 Feb 2023 Kin Wai Cheuk, Keunwoo Choi, Qiuqiang Kong, Bochen Li, Minz Won, Ju-Chiang Wang, Yun-Ning Hung, Dorien Herremans

Jointist consists of an instrument recognition module that conditions the other two modules: a transcription module that outputs instrument-specific piano rolls, and a source separation module that utilizes instrument information and transcription results.

Chord Recognition Instrument Recognition +1

Toward Universal Text-to-Music Retrieval

3 code implementations26 Nov 2022 Seungheon Doh, Minz Won, Keunwoo Choi, Juhan Nam

This paper introduces effective design choices for text-to-music retrieval systems.

Music Classification Retrieval +2

Emotion Embedding Spaces for Matching Music to Stories

1 code implementation26 Nov 2021 Minz Won, Justin Salamon, Nicholas J. Bryan, Gautham J. Mysore, Xavier Serra

Content creators often use music to enhance their stories, as it can be a powerful tool to convey emotion.

Cross-Modal Retrieval Metric Learning +1

Music Classification: Beyond Supervised Learning, Towards Real-world Applications

1 code implementation23 Nov 2021 Minz Won, Janne Spijkervet, Keunwoo Choi

The target audience for this web book is researchers and practitioners who are interested in state-of-the-art music classification research and building real-world applications.

Classification Information Retrieval +4

Multimodal Metric Learning for Tag-based Music Retrieval

1 code implementation30 Oct 2020 Minz Won, Sergio Oramas, Oriol Nieto, Fabien Gouyon, Xavier Serra

In this paper, we investigate three ideas to successfully introduce multimodal metric learning for tag-based music retrieval: elaborate triplet sampling, acoustic and cultural music information, and domain-specific word embeddings.

Cross-Modal Retrieval Metric Learning +4

Mood Classification Using Listening Data

1 code implementation22 Oct 2020 Filip Korzeniowski, Oriol Nieto, Matthew McCallum, Minz Won, Sergio Oramas, Erik Schmidt

The mood of a song is a highly relevant feature for exploration and recommendation in large collections of music.

Classification General Classification

Evaluation of CNN-based Automatic Music Tagging Models

7 code implementations1 Jun 2020 Minz Won, Andres Ferraro, Dmitry Bogdanov, Xavier Serra

Recent advances in deep learning accelerated the development of content-based automatic music tagging systems.

Music Auto-Tagging Audio and Speech Processing Sound

Visualizing and Understanding Self-attention based Music Tagging

no code implementations11 Nov 2019 Minz Won, Sanghyuk Chun, Xavier Serra

Recently, we proposed a self-attention based music tagging model.

Sound Audio and Speech Processing

Toward Interpretable Music Tagging with Self-Attention

2 code implementations12 Jun 2019 Minz Won, Sanghyuk Chun, Xavier Serra

In addition, we demonstrate the interpretability of the proposed architecture with a heat map visualization.

Sound Audio and Speech Processing

Transfer Learning of Artist Group Factors to Musical Genre Classification

1 code implementation5 May 2018 Jaehun Kim, Minz Won, Xavier Serra, Cynthia C. S. Liem

The automated recognition of music genres from audio information is a challenging problem, as genre labels are subjective and noisy.

Classification General Classification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.