Diffusion Models for Automatic Music Mixing

Xinyang Wu*, Andrew Horner

*Corresponding author for this work

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

Abstract

Music mixing is a process that involves fine-tuning the levels, dynamics, and frequency content of musical elements to ensure clarity and harmony in the final music production. In this paper, we present an automatic mixing system based on diffusion models to correct imbalances in music mixes. We manipulate the well mixed stems' short-time Fourier transform randomly to simulate the frequency and level imbalances commonly encountered in real-world scenarios. The difference between the imbalanced mix and the original mix is treated as noise for the diffusion model to predict, enabling the reverse denoising process to generate an automated mix. We evaluate our model's performance by calculating the signal-to-distortion ratio between the original and predicted mixes. These results are compared with baseline automatic mixing models, demonstrating significant improvements. Test set results in audio: https://aimg2025submission.github.io/diffmusicmix/

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE International Conference on Big Data, BigData 2024
EditorsWei Ding, Chang-Tien Lu, Fusheng Wang, Liping Di, Kesheng Wu, Jun Huan, Raghu Nambiar, Jundong Li, Filip Ilievski, Ricardo Baeza-Yates, Xiaohua Hu
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3242-3247
Number of pages6
ISBN (Electronic)9798350362480
DOIs
Publication statusPublished - 2024
Event2024 IEEE International Conference on Big Data, BigData 2024 - Washington, United States
Duration: 15 Dec 202418 Dec 2024

Publication series

NameProceedings - 2024 IEEE International Conference on Big Data, BigData 2024

Conference

Conference2024 IEEE International Conference on Big Data, BigData 2024
Country/TerritoryUnited States
CityWashington
Period15/12/2418/12/24

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Keywords

  • automatic music mixing
  • diffusion model
  • signal enhancing

Fingerprint

Dive into the research topics of 'Diffusion Models for Automatic Music Mixing'. Together they form a unique fingerprint.

Cite this