Distributed Learning Data Validation and Reproducibility Research