Language models P1

Summary
A toolkit for enhancing language data for HTR processing. Prototype version