Benchmarking Music Demixing Models for Deep Drum Source Separation

Audio Samples

The following audio samples are provided as supplementary material for the article Benchmarking Music Demixing Models for Deep Drum Source Separation, authored by A. I. Mezza, R. Giampiccolo, A. Bernardini, and A. Sarti, and presented at the IEEE 5th International Symposium on the Internet of Sounds.

The first three examples are taken from the StemGMD dataset, which is composed of both mixtures and ground-truth stems. The last, instead, is taken from the musdb18hq dataset, which provides only mixtures, and thus no ground-truth tracks are present.



Audio Samples


1_funk-groove1_138_beat_4-4

Kick

Snare

Toms

Hi-Hat

Cymbals

Ground-truth

LarsNet

MDX23C (4096)

MDX23C (8192)

HT-Demucs

BS-HT-Demucs (4 bands)

BS-HT-Demucs (8 bands)

BS-RoFormer




2_funk-groove2_105_beat_4-4

Kick

Snare

Toms

Hi-Hat

Cymbals

Ground-truth

LarsNet

MDX23C (4096)

MDX23C (8192)

HT-Demucs

BS-HT-Demucs (4 bands)

BS-HT-Demucs (8 bands)

BS-RoFormer




3_soul-groove3_86_beat_4-4

Kick

Snare

Toms

Hi-Hat

Cymbals

Ground-truth

LarsNet

MDX23C (4096)

MDX23C (8192)

HT-Demucs

BS-HT-Demucs (4 bands)

BS-HT-Demucs (8 bands)

BS-RoFormer




musdb18hq_test_drums_Mu_TooBright



Kick

Snare

Toms

Hi-Hat

Cymbals

Ground-truth

N/A

N/A

N/A

N/A

N/A

LarsNet

MDX23C (4096)

MDX23C (8192)

HT-Demucs

BS-HT-Demucs (4 bands)

BS-HT-Demucs (8 bands)

BS-RoFormer