Candidate | URL | Slogan | Category | Technology Stack | Scheduling Functionality | Initial Thoughts | Licensing |
---|
Open Refine | http://openrefine.org/ | A free, open source, powerful tool for working with messy data | Big Data | Requires JAVA JRE | none found | More focused on unstructured data, sound-ex and fuzzy matching features could play in for more advanced validation later on. Could play well with pre-submitted cleanup | A permissive license similar to the BSD 2-Clause License, but with a 3rd clause that prohibits others from using the name of the project or its contributors to promote derived products without written consent. |
Griffin | https://griffin.apache.org/ | Big Data Quality Solution For Batch and Streaming | Reporting tool with scheduling | JDK/ Hadoop/Spark | No | big data focus does not seem like the right fit. More about measuring data quality then identifying specific instances of bad data | Open Source (apache 2) |
Seal Report | https://sealreport.org/ | The ultimate open database reporting tool | ETL tool |
| Has GUI task scheduler | Could work well, especially if the LEA was interested in the visualization features | Open Source (apache 2) |
Talend | https://www.talend.com |
| Rules Engine | not sure | No | More about validating data during ETL then after it is already in the ODS perhaps? | Commercial w/ an Open source version |
Open Rules | https://openrules.com/ | Business Rules and Decision Management System | Rules Engine | Excel + Java + Eclipse | not sure | Like Drools with the Excel tie in | GPL |
Drools | https://www.drools.org/ |
| DB unit test infrastructure | Java / browser / eclipse | not sure | Would handle complex if/then flows | GPL |
dbfit | http://dbfit.github.io/dbfit/ | Test-driven database development | DB testing infrastructure | .NET or JAVA xUnit test framework | No | Focused on database development in a CI environmet | GPL |
DB Test Driven | http://www.dbtestdriven.com/ | Database test driven development for everyone | DB testing infrastructure | installs natively on MS SQL Server database | not directly | focused on database development in a test-driven development environment | freely available for download. Could not find mention of license |
tSQLt | https://tsqlt.org/ | THE OPEN SOURCE DATABASE UNIT TESTING FRAMEWORK FOR SQL SERVER | BI unit testing infrastucture | SQL Server / tSQL | not directly | More focused on testing database code then actual data but could be leveraged | Open Source (apache 2) |
NBi | http://www.nbi.io/ | open-source framework to test your Business Intelligence solutions | Scheduling App | XML config files that feed into NUnit (.NET flavor of xUNIT) | not directly | More focused on testing database code then actual data but could be leveraged | Open Source (apache 2) |
Quartz Scheduler | http://www.quartz-scheduler.org | open source job scheduling library that can be integrated within virtually any Java application | Organizational process automation infrastructure | JAVA | Yes |
| Open Source (apache 2) |
Run Deck | https://www.rundeck.com/ | The Platform for Self-Service Operations | Scheduling App | JAVA | Yes | Site Reliability Engineering (SRE) focus. Could potentially automate or facilitate organizational processes outside of data validation | Commercial w/ an Open source version |
SOS Berin Job Scheduler | https://www.sos-berlin.com/jobscheduler | Open Source solution for enterprise level workload automation. | Scheduling App | runs as Linux or Windows Service, GUI / database configuration | Yes |
| Open sourcer or commercial (supported) |
ScheduleX | http://www.schedulix.org/en | The Open Source Enterprise Job Scheduling System |
| Native Linux with a windows port | Yes |
| AGPL licence |