Improving audio chord transcription by exploiting harmonic and metric knowledge

W.B. de Haas, J.P. Rodrigues Magalhães, F. Wiering

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    Abstract

    We present a new system for chord transcription from polyphonic musical audio that uses domain-specific knowledge about tonal harmony and metrical position to improve chord transcription performance. Low-level pulse and spectral features are extracted from an audio source using the Vamp plugin architecture. Subsequently, for each beat-synchronised chromagram we compute a list of chord candidates matching that chromagram, together with the confidence in each candidate. When one particular chord candidate matches the chromagram significantly better than all others, this chord is selected to represent the segment. However, when multiple chords match the chromagram similarly well, we use a formal music theoretical model of tonal harmony to select the chord candidate that best matches the sequence based on the surrounding chords. In an experiment we show that exploiting metrical and harmonic knowledge yields statistically significant chord transcription improvements on a corpus of 217 Beatles, Queen, and Zweieck songs.
    Original languageEnglish
    Title of host publication13th International Society for Music Information Retrieval Conference (ISMIR 2012)
    Place of PublicationPorto
    Pages295-300
    Number of pages6
    Publication statusPublished - 2012

    Fingerprint

    Dive into the research topics of 'Improving audio chord transcription by exploiting harmonic and metric knowledge'. Together they form a unique fingerprint.

    Cite this