Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

CandidateURLSloganCategoryTechnology StackScheduling FunctionalityInitial ThoughtsLicensing
Open Refinehttp://openrefine.org/A free, open source, powerful tool for working with messy dataBig DataRequires JAVA JREnone foundMore focused on unstructiured data, sound-ex and fuzzy matching features could play in for more advanced validation later on. Could play well with pre-submitted cleanupA permissive license similar to the BSD 2-Clause License, but with a 3rd clause that prohibits others from using the name of the project or its contributors to promote derived products without written consent.
Griffinhttps://griffin.apache.org/Big Data Quality Solution For Batch and StreamingBig DataJDK/ Hadoop/SparkNobig data focus does not seem like the right fit. More about measuring data quality then identifying specific instances of bad dataOpen Source (apache 2)
Seal Reporthttps://sealreport.org/The ultimate open database reporting toolReporting tool with scheduling
Has GUI task schedulerCould work well, especially if the LEA was interested in the visualiztion featuresOpen Source (apache 2)
  • No labels