Search Results for author: Minz Won

Found 15 papers, 10 papers with code

The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation

1 code implementation • 16 Nov 2023 • Ilaria Manco, Benno Weck, Seungheon Doh, Minz Won, Yixiao Zhang, Dmitry Bogdanov, Yusong Wu, Ke Chen, Philip Tovstogan, Emmanouil Benetos, Elio Quinton, György Fazekas, Juhan Nam

We introduce the Song Describer dataset (SDD), a new crowdsourced corpus of high-quality audio-caption pairs, designed for the evaluation of music-and-language models.

Music Captioning Music Generation +2

112

Paper
Code

A Foundation Model for Music Informatics

1 code implementation • 6 Nov 2023 • Minz Won, Yun-Ning Hung, Duc Le

This paper investigates foundation models tailored for music informatics, a domain currently challenged by the scarcity of labeled data and generalization issues.

Information Retrieval Music Information Retrieval +2

123

Paper
Code

Scaling Up Music Information Retrieval Training with Semi-Supervised Learning

no code implementations • 2 Oct 2023 • Yun-Ning Hung, Ju-Chiang Wang, Minz Won, Duc Le

To our knowledge, this is the first attempt to study the effects of scaling up both model and training data for a variety of MIR tasks.

Information Retrieval Music Information Retrieval +1

Paper
Add Code

Textless Speech-to-Music Retrieval Using Emotion Similarity

no code implementations • 19 Mar 2023 • Seungheon Doh, Minz Won, Keunwoo Choi, Juhan Nam

We introduce a framework that recommends music based on the emotions of speech.

Retrieval

Paper
Add Code

Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training

no code implementations • 1 Feb 2023 • Kin Wai Cheuk, Keunwoo Choi, Qiuqiang Kong, Bochen Li, Minz Won, Ju-Chiang Wang, Yun-Ning Hung, Dorien Herremans

Jointist consists of an instrument recognition module that conditions the other two modules: a transcription module that outputs instrument-specific piano rolls, and a source separation module that utilizes instrument information and transcription results.

Chord Recognition Instrument Recognition +1

Paper
Add Code

Toward Universal Text-to-Music Retrieval

3 code implementations • 26 Nov 2022 • Seungheon Doh, Minz Won, Keunwoo Choi, Juhan Nam

This paper introduces effective design choices for text-to-music retrieval systems.

Music Classification Retrieval +2

104

Paper
Code

Jointist: Joint Learning for Multi-instrument Transcription and Its Applications

no code implementations • 22 Jun 2022 • Kin Wai Cheuk, Keunwoo Choi, Qiuqiang Kong, Bochen Li, Minz Won, Amy Hung, Ju-Chiang Wang, Dorien Herremans

However, its novelty necessitates a new perspective on how to evaluate such a model.

Ranked #1 on Music Transcription on Slakh2100

Chord Recognition Instrument Recognition +1

Paper
Add Code

Emotion Embedding Spaces for Matching Music to Stories

1 code implementation • 26 Nov 2021 • Minz Won, Justin Salamon, Nicholas J. Bryan, Gautham J. Mysore, Xavier Serra

Content creators often use music to enhance their stories, as it can be a powerful tool to convey emotion.

Cross-Modal Retrieval Metric Learning +1

Paper
Code

Music Classification: Beyond Supervised Learning, Towards Real-world Applications

1 code implementation • 23 Nov 2021 • Minz Won, Janne Spijkervet, Keunwoo Choi

The target audience for this web book is researchers and practitioners who are interested in state-of-the-art music classification research and building real-world applications.

Classification Information Retrieval +4

133

Paper
Code

Multimodal Metric Learning for Tag-based Music Retrieval

1 code implementation • 30 Oct 2020 • Minz Won, Sergio Oramas, Oriol Nieto, Fabien Gouyon, Xavier Serra

In this paper, we investigate three ideas to successfully introduce multimodal metric learning for tag-based music retrieval: elaborate triplet sampling, acoustic and cultural music information, and domain-specific word embeddings.

Cross-Modal Retrieval Metric Learning +4