This post continues from part 1. If you came here directly, please read part 1 first.
Change in data focus
The grant proposal said the prototype repository will contain four textual traditions: "the New Testament; Dante's Commedia and Monarchia; Chaucer's Canterbury Tales". Early on in the project, I discovered that the New Testament data was not quite ready for processing in this way, but the other data was in good shape and had full TEI transcriptions and matching images. So the first beta of the system used Dante's Monarchia. All well and good, the data went into ...