Skip to content
djbpitt edited this page Jul 8, 2014 · 6 revisions

Welcome to the collatex wiki!

TEI input:

Discussion 2014-07-08 (Lausanne) David J. Birnbaum / Ronald Haentjens Dekker:

  • witness in each separate document
  • take the <body> element
  • get rid of the hierarchy… by converting tags into ranges or milestones
  • tokenize on whitespace and punctuation
  • create normalized version
  • collate
  • generate variant graph
  • TEI output issue: you can't raise the hierarchy again because the collation markup introduces an overlapping hierarchy
  • Solution: not responsibility of CollateX to raise hierarchy again; output with the milestones in place (attach milestone to the nearest token - with "nearest" still to be defined)
Clone this wiki locally