Datasets suitable for training, validation, and testing of modules, along with infrastructure to support data collection/processing

Summary
Datasets suitable for training, validation, and testing of modules, along with infrastructure to support data collection/processing