HOW DID I CREATE calenderJankyElement.html? 1. Get lemma and ids from database cd /home/spenteco/projects/spenserSC/fix120613 export PYTHONIOENCODING=utf-8 ./identifyAllLemma.py > allLemma.csv 2. Generate a basic text cd /home/spenteco/projects/spenserSC/fix120613/prepareBaseText saxonb-xslt calender.xml styleText_SC.xsl > BASE_calendar.xml saxonb-xslt BASE_calendar.xml tokenize.xsl > TOKENIZED_calendar.xml MANUALLY CHANGE span xmlns="http://www.tei-c.org/ns/1.0" TO span 4. Match spans and lemma, cleanup spreadsheet, etc ./matchSpansAndLemma.py allLemma.csv prepareBaseText/TOKENIZED_calendar.xml > OUT_matchSpansAndLemma.csv grep 'lemma|' OUT_matchSpansAndLemma.csv > lemma.csv grep 'token|' OUT_matchSpansAndLemma.csv > token.csv 5. Validate spreadsheet and apply to XML. ./validateAndApply.py token.csv prepareBaseText/TOKENIZED_calendar.xml > calenderJankyElement.html 6. Add header, style, etc to test.