Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Description

The Sample Data Generator produces realistic, cohesive, and 100% fictional datasets for use in demonstrations and testing.  

The SDG prefers statistically realistic patterns (e.g., a student with poor attendance generally tracks to poor grades, students are the appropriate age for their grade level, students who are English learners have home languages that track to their ethnicity, and so forth). The system is configurable, and can produce arbitrarily large datasets.

While the SDG creates data with realistic patterns, it is randomly generated and must not be used in place of real-world data for scenarios such as training for machine learning or other algorithmic approaches.

Download

  • Code : ((GitHub Ed-Fi OSS repo link))
  • ZIP Package:  ((GitHub Release ZIP package link))
  • Documentation:  Sample Data Generator

Details

  • By: Major contributions from EdWire
  • License Terms: Apache 2.0 License
  • Released: February 2022

At a Glance

Generation: Tech Suite 3