no code implementations • 1 Jul 2024 • Scott H. Hawley
Recent years have witnessed significant progress in generative models for music, featuring diverse architectures that balance output quality, diversity, speed, and user control.
1 code implementation • 4 Jun 2024 • Scott H. Hawley, Austin R. Tackett
We investigate the construction of latent spaces through self-supervised learning to support semantically meaningful operations.
2 code implementations • 7 Feb 2024 • Zach Evans, CJ Carr, Josiah Taylor, Scott H. Hawley, Jordi Pons
Generating long-form 44. 1kHz stereo audio from text prompts can be computationally demanding.
Ranked #1 on
Text-to-Music Generation
on MusicCaps
(KL_passt metric)
no code implementations • 10 Apr 2023 • Scott H. Hawley, Christian J. Steinmetz
We investigate applying audio manipulations using pretrained neural network-based autoencoders as an alternative to traditional signal processing methods, since the former may provide greater semantic or perceptual organization.
no code implementations • 23 Oct 2021 • Scott H. Hawley, Andrew C. Morrison, Grant S. Morgan
Besides improvements on previous metric scores by 10% or more, noteworthy in this project are the introduction of a segmentation-regression map for the entire drum surface yielding interference fringe counts comparable to those obtained via object detection, as well as the accelerated workflow for coordinating the data-cleaning-and-model-training feedback loop for rapid iteration allowing this project to be conducted on a timescale of only 18 days.
1 code implementation • 1 Feb 2021 • Scott H. Hawley, Andrew C. Morrison
We train an object detector built from convolutional neural networks to count interference fringes in elliptical antinode regions in frames of high-speed video recordings of transient oscillations in Caribbean steelpan drums illuminated by electronic speckle pattern interferometry (ESPI).
1 code implementation • 10 Jun 2020 • William Mitchell, Scott H. Hawley
Deep neural networks have shown promise for music audio signal processing applications, often surpassing prior approaches, particularly as end-to-end models in the waveform domain.
1 code implementation • 28 May 2019 • Scott H. Hawley, Benjamin Colburn, Stylianos I. Mimilakis
In this work we present a data-driven approach for predicting the behavior of (i. e., profiling) a given non-linear audio signal processing effect (henceforth "audio effect").
no code implementations • 25 Feb 2019 • Scott H. Hawley
Each of these factors is seen to present a 'moving target' for discussion, which poses a challenge for both technical specialists and non-practitioners of AI systems development (e. g., philosophers and theologians) to speak meaningfully given that the corpus of AI structures and capabilities evolves at a rapid pace.