HOW DID I CREATE calenderJankyElement.html?
1. Get lemma and ids from database
cd /home/spenteco/projects/spenserSC/fix120613
export PYTHONIOENCODING=utf-8
./identifyAllLemma.py > allLemma.csv
2. Generate a basic text
cd /home/spenteco/projects/spenserSC/fix120613/prepareBaseText
saxonb-xslt calender.xml styleText_SC.xsl > BASE_calendar.xml
saxonb-xslt BASE_calendar.xml tokenize.xsl > TOKENIZED_calendar.xml
MANUALLY CHANGE
span xmlns="http://www.tei-c.org/ns/1.0"
TO
span
4. Match spans and lemma, cleanup spreadsheet, etc
./matchSpansAndLemma.py allLemma.csv prepareBaseText/TOKENIZED_calendar.xml > OUT_matchSpansAndLemma.csv
grep 'lemma|' OUT_matchSpansAndLemma.csv > lemma.csv
grep 'token|' OUT_matchSpansAndLemma.csv > token.csv
5. Validate spreadsheet and apply to XML.
./validateAndApply.py token.csv prepareBaseText/TOKENIZED_calendar.xml > calenderJankyElement.html
6. Add header, style, etc to test.