Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.


CandidateURLSloganCategoryTechnology StackScheduling FunctionalityInitial ThoughtsLicensing
Open Refinehttp://openrefine.org/A free, open source, powerful tool for working with messy dataBig DataRequires JAVA JREnone foundMore focused on unstructiured data, sound-ex and fuzzy matching features could play in for more advanced validation later on. Could play well with pre-submitted cleanupA permissive license similar to the BSD 2-Clause License, but with a 3rd clause that prohibits others from using the name of the project or its contributors to promote derived products without written consent.
Griffinhttps://griffin.apache.org/Big Data Quality Solution For Batch and StreamingBig DataJDK/ Hadoop/SparkNobig data focus does not seem like the right fit. More about measuring data quality then identifying specific instances of bad dataOpen Source (apache 2)
Seal Reporthttps://sealreport.org/The ultimate open database reporting toolReporting tool with scheduling
Has GUI task schedulerCould work well, especially if the LEA was interested in the visualiztion featuresOpen Source (apache 2)