Language models P3

Summary
A toolkit for enhancing language data for HTR processing. Final version