Search Results for author: Gus Xia

Found 50 papers, 35 papers with code

Language Model Mapping in Multimodal Music Learning: A Grand Challenge Proposal

no code implementations1 Mar 2025 Daniel Chin, Gus Xia

We have seen remarkable success in representation learning and language models (LMs) using deep neural networks.

cross-modal alignment Language Modeling +2

Music for All: Representational Bias and Cross-Cultural Adaptability of Music Generation Models

1 code implementation11 Feb 2025 Atharva Mehta, Shivam Chauhan, Amirbek Djanibekov, Atharva Kulkarni, Gus Xia, Monojit Choudhury

The advent of Music-Language Models has greatly enhanced the automatic music generation capability of AI systems, but they are also limited in their coverage of the musical genres and cultures of the world.

All Music Generation +2

Exploring GPT's Ability as a Judge in Music Understanding

1 code implementation22 Jan 2025 Kun Fang, Ziyu Wang, Gus Xia, Ichiro Fujinaga

We convert the music data to symbolic inputs and evaluate LLMs' ability in detecting annotation errors in three key MIR tasks: beat tracking, chord extraction, and key estimation.

Beat Tracking Information Retrieval +2

CalliffusionV2: Personalized Natural Calligraphy Generation with Flexible Multi-modal Control

no code implementations3 Oct 2024 Qisheng Liao, Liang Li, Yulang Fei, Gus Xia

In this paper, we introduce CalliffusionV2, a novel system designed to produce natural Chinese calligraphy with flexible multi-modal control.

Few-Shot Learning

Unifying Multitrack Music Arrangement via Reconstruction Fine-Tuning and Efficient Tokenization

no code implementations27 Aug 2024 Longshen Ou, Jingwei Zhao, Ziyu Wang, Gus Xia, Ye Wang

Automatic music arrangement streamlines the creation of musical variants for composers and arrangers, reducing reliance on extensive music expertise.

Language Modeling Language Modelling +1

Emergent Interpretable Symbols and Content-Style Disentanglement via Variance-Invariance Constraints

no code implementations4 Jul 2024 Yuxuan Wu, Ziyu Wang, Bhiksha Raj, Gus Xia

We contribute an unsupervised method that effectively learns from raw observation and disentangles its latent space into content and style representations.

Decoder Disentanglement +1

Proceedings of The second international workshop on eXplainable AI for the Arts (XAIxArts)

no code implementations20 Jun 2024 Nick Bryan-Kinns, Corey Ford, Shuoyang Zheng, Helen Kennedy, Alan Chamberlain, Makayla Lewis, Drew Hemment, Zijin Li, Qiong Wu, Lanxi Xiao, Gus Xia, Jeba Rezwana, Michael Clemens, Gabriel Vigliensoni

This second international workshop on explainable AI for the Arts (XAIxArts) brought together a community of researchers in HCI, Interaction Design, AI, explainable AI (XAI), and digital arts to explore the role of XAI for the Arts.

Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning

2 code implementations28 May 2024 Yixiao Zhang, Yukara Ikemiya, Woosung Choi, Naoki Murata, Marco A. Martínez-Ramírez, Liwei Lin, Gus Xia, Wei-Hsiang Liao, Yuki Mitsufuji, Simon Dixon

Recent advances in text-to-music editing, which employ text queries to modify music (e. g.\ by changing its style or adjusting instrumental components), present unique challenges and opportunities for AI-assisted music creation.

Human-Centered LLM-Agent User Interface: A Position Paper

1 code implementation19 May 2024 Daniel Chin, Yuxuan Wang, Gus Xia

Large Language Model (LLM) -in-the-loop applications have been shown to effectively interpret the human user's commands, make plans, and operate external tools/systems accordingly.

Language Modeling Language Modelling +2

Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models

2 code implementations16 May 2024 Ziyu Wang, Lejun Min, Gus Xia

A cascaded diffusion model is trained to model the hierarchical language, where each level is conditioned on its upper levels.

Music Generation

MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models

1 code implementation9 Feb 2024 Yixiao Zhang, Yukara Ikemiya, Gus Xia, Naoki Murata, Marco A. Martínez-Ramírez, Wei-Hsiang Liao, Yuki Mitsufuji, Simon Dixon

This paper introduces a novel approach to the editing of music generated by such models, enabling the modification of specific attributes, such as genre, mood and instrument, while maintaining other aspects unchanged.

Music Generation Text-to-Music Generation

Content-based Controls For Music Large Language Modeling

1 code implementation26 Oct 2023 Liwei Lin, Gus Xia, Junyan Jiang, Yixiao Zhang

We aim to further equip the models with direct and content-based controls on innate music languages such as pitch, chords and drum track.

Language Modeling Language Modelling +3

Structured Multi-Track Accompaniment Arrangement via Style Prior Modelling

1 code implementation25 Oct 2023 Jingwei Zhao, Gus Xia, Ziyu Wang, Ye Wang

In the realm of music AI, arranging rich and structured multi-track accompaniments from a simple lead sheet presents significant challenges.

Computational Efficiency Disentanglement +4

Motif-Centric Representation Learning for Symbolic Music

1 code implementation19 Sep 2023 Yuxuan Wu, Roger B. Dannenberg, Gus Xia

Music motif, as a conceptual building block of composition, is crucial for music structure analysis and automatic composition.

Contrastive Learning Diversity +4

Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls

1 code implementation19 Jul 2023 Lejun Min, Junyan Jiang, Gus Xia, Jingwei Zhao

We propose Polyffusion, a diffusion model that generates polyphonic music scores by regarding music as image-like piano roll representations.

Music Generation

On the Effectiveness of Speech Self-supervised Learning for Music

no code implementations11 Jul 2023 Yinghao Ma, Ruibin Yuan, Yizhi Li, Ge Zhang, Xingran Chen, Hanzhi Yin, Chenghua Lin, Emmanouil Benetos, Anton Ragni, Norbert Gyenge, Ruibo Liu, Gus Xia, Roger Dannenberg, Yike Guo, Jie Fu

Our findings suggest that training with music data can generally improve performance on MIR tasks, even when models are trained using paradigms designed for speech.

Information Retrieval Music Information Retrieval +2

Calliffusion: Chinese Calligraphy Generation and Style Transfer with Diffusion Modeling

no code implementations30 May 2023 Qisheng Liao, Gus Xia, Zhinuo Wang

In this paper, we propose Calliffusion, a system for generating high-quality Chinese calligraphy using diffusion models.

Denoising Style Transfer +1

Learning Interpretable Low-dimensional Representation via Physical Symmetry

1 code implementation NeurIPS 2023 Xuanjie Liu, Daniel Chin, Yichen Huang, Gus Xia

We have recently seen great progress in learning interpretable music representations, ranging from basic factors, such as pitch and timbre, to high-level concepts, such as chord and texture.

counterfactual Time Series

Vis2Mus: Exploring Multimodal Representation Mapping for Controllable Music Generation

1 code implementation10 Nov 2022 Runbang Zhang, Yixiao Zhang, Kai Shao, Ying Shan, Gus Xia

In this study, we explore the representation mapping from the domain of visual arts to the domain of music, with which we can use visual arts as an effective handle to control music generation.

Music Generation Representation Learning +1

Self-Supervised Hierarchical Metrical Structure Modeling

1 code implementation31 Oct 2022 Junyan Jiang, Gus Xia

We propose a novel method to model hierarchical metrical structures for both symbolic music and audio signals in a self-supervised manner with minimal domain knowledge.

Modeling Perceptual Loudness of Piano Tone: Theory and Applications

1 code implementation21 Sep 2022 Yang Qu, Yutian Qin, Lecheng Chao, Hangkai Qian, Ziyu Wang, Gus Xia

The relationship between perceptual loudness and physical attributes of sound is an important subject in both computer music and psychoacoustics.

Learning Hierarchical Metrical Structure Beyond Measures

1 code implementation21 Sep 2022 Junyan Jiang, Daniel Chin, Yixiao Zhang, Gus Xia

In this paper, we explore a data-driven approach to automatically extract hierarchical metrical structures from scores.

Information Retrieval Music Information Retrieval +1

AccoMontage2: A Complete Harmonization and Accompaniment Arrangement System

1 code implementation1 Sep 2022 Li Yi, Haochen Hu, Jingwei Zhao, Gus Xia

We propose AccoMontage2, a system capable of doing full-length song harmonization and accompaniment arrangement based on a lead melody.

Retrieval Template Matching

Interpreting Song Lyrics with an Audio-Informed Pre-trained Language Model

1 code implementation24 Aug 2022 Yixiao Zhang, Junyan Jiang, Gus Xia, Simon Dixon

Lyric interpretations can help people understand songs and their lyrics quickly, and can also make it easier to manage, retrieve and discover songs efficiently from the growing mass of music archives.

Language Modeling Language Modelling +1

Learning long-term music representations via hierarchical contextual constraints

no code implementations13 Feb 2022 Shiqi Wei, Gus Xia

Learning symbolic music representations, especially disentangled representations with probabilistic interpretations, has been shown to benefit both music understanding and generation.

Contrastive Learning Disentanglement

AccoMontage: Accompaniment Arrangement via Phrase Selection and Style Transfer

1 code implementation25 Aug 2021 Jingwei Zhao, Gus Xia

Accompaniment arrangement is a difficult music generation task involving intertwined constraints of melody, harmony, texture, and music structure.

Music Generation Style Transfer

A Unified Model for Zero-shot Music Source Separation, Transcription and Synthesis

1 code implementation7 Aug 2021 Liwei Lin, Qiuqiang Kong, Junyan Jiang, Gus Xia

We propose a unified model for three inter-related tasks: 1) to \textit{separate} individual sound sources from a mixed music audio, 2) to \textit{transcribe} each sound source to MIDI notes, and 3) to\textit{ synthesize} new pieces based on the timbre of separated sources.

Decoder Disentanglement +3

Learning Interpretable Representation for Controllable Polyphonic Music Generation

2 code implementations17 Aug 2020 Ziyu Wang, Dingsu Wang, Yixiao Zhang, Gus Xia

While deep generative models have become the leading methods for algorithmic composition, it remains a challenging problem to control the generation process because the latent variables of most deep-learning models lack good interpretability.

Disentanglement Music Generation +1

PIANOTREE VAE: Structured Representation Learning for Polyphonic Music

2 code implementations17 Aug 2020 Ziyu Wang, Yiyi Zhang, Yixiao Zhang, Junyan Jiang, Ruihan Yang, Junbo Zhao, Gus Xia

The dominant approach for music representation learning involves the deep unsupervised model family variational autoencoder (VAE).

Music Generation Representation Learning

POP909: A Pop-song Dataset for Music Arrangement Generation

1 code implementation17 Aug 2020 Ziyu Wang, Ke Chen, Junyan Jiang, Yiyi Zhang, Maoran Xu, Shuqi Dai, Xianbin Gu, Gus Xia

The main body of the dataset contains the vocal melody, the lead instrument melody, and the piano accompaniment for each song in MIDI format, which are aligned to the original audio files.

Music Generation

Word Representation for Rhythms

1 code implementation21 Jul 2020 Tongyu Lu, Lyucheng Yan, Gus Xia

This paper proposes a word representation strategy for rhythm patterns.

Rhythm

Continuous Melody Generation via Disentangled Short-Term Representations and Structural Conditions

1 code implementation5 Feb 2020 Ke Chen, Gus Xia, Shlomo Dubnov

Automatic music generation is an interdisciplinary research topic that combines computational creativity and semantic analysis of music to create automatic machine improvisations.

Disentanglement Music Generation +1

Deep Music Analogy Via Latent Representation Disentanglement

3 code implementations9 Jun 2019 Ruihan Yang, Dingsu Wang, Ziyu Wang, Tianyao Chen, Junyan Jiang, Gus Xia

Analogy-making is a key method for computer algorithms to generate both natural and creative music pieces.

Disentanglement Rhythm

Inspecting and Interacting with Meaningful Music Representations using VAE

no code implementations18 Apr 2019 Ruihan Yang, Tianyao Chen, Yiyi Zhang, Gus Xia

Variational Autoencoders(VAEs) have already achieved great results on image generation and recently made promising progress on music generation.

Disentanglement Image Generation +2

A Framework for Automated Pop-song Melody Generation with Piano Accompaniment Arrangement

no code implementations28 Dec 2018 Ziyu Wang, Gus Xia

Second, the melody generation model generates the lead melody and other voices (melody lines) of the accompaniment using seasonal ARMA (Autoregressive Moving Average) processes.

The Effect of Explicit Structure Encoding of Deep Neural Networks for Symbolic Music Generation

1 code implementation20 Nov 2018 Ke Chen, Weilin Zhang, Shlomo Dubnov, Gus Xia, Wei Li

With recent breakthroughs in artificial neural networks, deep generative models have become one of the leading techniques for computational creativity.

Music Generation

Melodic Phrase Segmentation By Deep Neural Networks

no code implementations14 Nov 2018 Yixing Guan, Jinyu Zhao, Yiqin Qiu, Zheng Zhang, Gus Xia

Automated melodic phrase detection and segmentation is a classical task in content-based music information retrieval and also the key towards automated music structure analysis.

Information Retrieval Music Information Retrieval +2

Cannot find the paper you are looking for? You can Submit a new open access paper.