On automatic drum transcription using non-negative matrix deconvolution and itakura saito divergence - Analyse et Décision en Traitement du Signal et Images Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

On automatic drum transcription using non-negative matrix deconvolution and itakura saito divergence

Axel Roebel
Jordi Pons Puig
  • Fonction : Auteur
Mathieu Lagrange

Résumé

This paper presents an investigation into the detection and classification of drum sounds in polyphonic music and drum loops using non-negative matrix deconvolution (NMD) and the Itakura Saito divergence. The Itakura Saito divergence has recently been proposed as especially appropriate for decomposing audio spectra due to the fact that it is scale invariant, but it has not yet been widely adopted. The article studies new contributions for audio event detection methods using the Itakura Saito divergence that improve efficiency and numerical stability, and simplify the generation of target pattern sets. A new approach for handling background sounds is proposed and moreover, a new detection criteria based on estimating the perceptual presence of the target class sources is introduced. Experimental results obtained for drum detection in polyphonic music and drum soli demonstrate the beneficial effects of the proposed extensions.
Fichier non déposé

Dates et versions

hal-01261256 , version 1 (25-01-2016)

Identifiants

Citer

Axel Roebel, Jordi Pons Puig, Marco Liuni, Mathieu Lagrange. On automatic drum transcription using non-negative matrix deconvolution and itakura saito divergence. Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia. pp.414 - 418, ⟨10.1109/ICASSP.2015.7178002⟩. ⟨hal-01261256⟩
170 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More