Initial training code

Summary
First version of parallel training code for training LLMs on European HPC systems