Dancing with data: Wrangling musical datasets for unbiased insights

Research output: Contribution to conferenceAbstractAcademic

Abstract

For the analysis of symbolic corpora of early music we depend on existing datasets, such as the Josquin Research Project and the
CantusCorpus, or encodings in community-created resources like IMSLP and CPDL. The choice of compositions presented in these
datasets is highly dependent on the scope of the project and/or the preferences of the contributors. As a consequence, they exhibit a
selection bias that makes it hard to use them to answer a variety of musicological questions. To mitigate this problem, we propose a
method, using RISM, DIAMM among other resources to compile a dataset that is more representative of the actual repertoire.
Original languageEnglish
Pages1-1
Number of pages1
Publication statusPublished - 6 Jul 2024
EventMedRen conference: International Medieval and Renaissance Music Conference - Escuela Técnica Superior de Arquitectura, Granada, Spain
Duration: 6 Jul 20249 Jul 2024
https://www.medren2024.com/

Conference

ConferenceMedRen conference
Abbreviated titleMedRen
Country/TerritorySpain
CityGranada
Period6/07/249/07/24
Internet address

Keywords

  • Renaissance Music
  • computational musicology
  • corpus creation

Fingerprint

Dive into the research topics of 'Dancing with data: Wrangling musical datasets for unbiased insights'. Together they form a unique fingerprint.

Cite this