Datasets for Research
- Latency measurements for Amazon Web Services (AWS): http://claudit.feld.cvut.cz/data.php
- Traces from one of the Google’s data centers: https://github.com/google/cluster-data
- Fbflow raw samples from 3 production clusters at Facebook: https://www.facebook.com/network-analytics
- Training Data Platform for ML Models – Figure Eight (platform that helps to turn raw data into useful training data for ML models – annotation, labels, judgements etc. ground truth): https://www.figure-eight.com/
The Center for Applied Internet Data Analysis (CAIDA) also carries out network research and builds research infrastructure to support large-scale data collection. Their datasets can be found at http://www.caida.org/data/overview/ . You may also use Google Dataset Search Engine for searching datasets : https://datasetsearch.research.google.com/ .