Annotated Beethoven Corpus (ABC) 2.1
Update v2.1: This release is identical to 2.0 with the difference that the .zenodo.json
file (that had been generated by Zenodo!) has been freed from the key related_identifiers
which made the Zenodo validator fail.
4.5 years after its first publication (see below), this is the first revised version of the ABC.
In the meantime, the DCML corpus initiative has advanced
and this update has as a main goal to harmonize the ABC with all other annotated corpora that have been and
will be published. This includes the following changes:
Upgrade to MuseScore 3
- All scores have been converted to MuseScore 3.6.2 format and can be found in the folder
MS3
. - The harmony labels have been moved to MuseScore's "Roman Numeral Analysis" layer of the left-hand staff.
New folder and file structure
- The
code
folder was removed since the old Julia code has been replaced by the Python library ms3. - The MuseScore files are contained in
MS3
and for each movement there are a couple of other files available, identified by their file names:- The folder
notes
contains one TSV file per movement with all note heads (not every note head represents an onset). - The folder
measures
contains one TSV file per movement with all measure-like units - The folder
harmonies
contains one TSV file per movement with all harmony annotation labels - The folder
reviewed
contains two files per movement:- A copy of the score where all out-of-label notes have been colored in red; additionally, modified labels ( w.r.t. v1.0) are shown in these files in a diff-like manner (removed in red, added in green).
- A copy of the harmonies TSV with six added columns that reflect the coloring of out-of-label notes ("coloring reports")
- The file
warnings.log
lists those labels where over 60 % of notes within the label's segment are not expressed
by the label. Potentially, most of them are semantically incorrect.
- The folder
The folders are automatically kept up to date by the dcml_corpus_workflow
which calls the command ms3 review -M -N -X -D
on every change.
Information on what the columns in the TSV files contain can be found in the documentation for ms3.
Changes to the data
A full diff of all changes applied with version 2.0 can be seen here.
- The scores have been aligned by tunescribers.com with the Henle and Breitkopf editions
provided in thepdf
folder and indicated in its README. - Systematic changes to the harmony labels:
- With the harmony labels moved to the Roman Numeral Analysis layer, no initial
.
are needed anymore. V9
is not part of the DCML harmony annotation standard and has been replaced byV7(9)
orV7(+9)
.- Corrected
vii
chords in major keys that had often been wrongly labeled as#vii
.
- With the harmony labels moved to the Roman Numeral Analysis layer, no initial
- Obvious errors have been corrected in many places. Thanks to @craigsapp, @lancioni, @malcolmsailor, @MarkGotham, @napulen and @tymoczko
for reporting quite a few of them!