Datafold is a data quality platform that helps data teams prevent and identify data quality issues before they reach production. Founded in 2020, the platform automates data testing processes within the developer workflow, enabling analytics engineers to detect potential problems during the development phase. The platform's core functionality includes automated regression testing, which allows engineers to visualize the impact of code changes on data before deployment. Through its Data Diff feature, Datafold analyzes changes in terms of row numbers, schema, primary keys, and column differences. The platform also provides column-level lineage capabilities, allowing users to track how data flows between columns and understand how upstream table modifications affect downstream tables.
Key customers and partnerships
Datafold's customer base includes companies such as Thumbtack, Patreon, Truebill, Faire, and Dutchie. In March 2022, the company established a partnership with dbt Labs to integrate automated test coverage into companies' CI/CD workflows. This integration enabled Datafold to embed automated test summaries directly in GitHub and GitLab, allowing engineers to view the impact of their changes in pull requests.
By using this site, you agree to allow SPEEDA Edge and our partners to use cookies for analytics and personalization. Visit our privacy policy for more information about our data collection practices.