Use case guides
Last modified on 09-Sep-24
Use the following guides as example implementations based on how you intend to use Soda for data quality testing. For standard set up instructions, see Get started.
Guide | Description | Soda product requirements |
---|---|---|
Test data in an Airflow pipeline | Use this guide as an example for how to set up Soda to test the quality of your data in an Airflow pipeline that uses dbt transformations. | Soda Library Soda Cloud |
Test data quality in an ADF pipeline | Learn how to invoke Soda data quality tests in an ETL pipeline in Azure Data Factory. | Soda Library Soda Cloud |
Test data quality in a Dagster pipeline | Learn how to invoke Soda data quality tests in a Dagster pipeline. | Soda Library Soda Cloud |
Test data quality in Databricks pipeline | Learn how to use Databricks notesbooks with Soda to test data quality before feeding a machine learning model. | Soda Library Soda Cloud |
Test data before migration | Use this guide to set up Soda to test before and after data migration between data sources. | Soda Library Soda Cloud |
Self-serve Soda | Use this guide to set up Soda Cloud to enable users across your organization to serve themselves when it comes to testing data quality. | Soda Cloud Soda Agent |
Test data during development | Use this guide to set up Soda to test the quality of your data during your development lifecycle in a GitHub Workflow. | Soda Library Soda Cloud |
Automate monitoring | Use this guide to set up Soda to automatically monitor data quality. | Soda Cloud Soda Agent |
Use the following How tos for practical advice, examples, and instructions for using Soda.
How to | Description | Soda product requirements |
---|---|---|
Invoke Soda in Databricks | Learn how to invoke Soda data quality tests in a Databricks notebook. | Soda Library Soda Cloud |
Use a Secrets Manager | Learn how to set up a Soda Agent to use an External Secrets Manager to retrieve frequently-rotated data source passwords. | Soda Cloud Self-hosted Agent |
Generate API keys | Learn how to use Soda Cloud API keys to securely communicate with other entities such as Soda Library and self-hosted Soda Agents, and to provide secure access to Soda Cloud via API. | Soda Cloud |
Manage sensitive data | Learn how to adjust several configurable settings that help you manage access to sensitive data in Soda Cloud. | Soda Cloud |
Reroute failed row samples | Learn how to programmatically set up Soda Library to display failed row samples in the command-line. | Soda Library Soda Cloud |
Double-onboard a data source | Learn how to onboard a data source in Soda Cloud that you have already onboarded via Soda Library. | Soda Library Soda Cloud |
Need help? Join the Soda community on Slack.
Was this documentation helpful?
What could we do to improve this page?
- Suggest a docs change in GitHub.
- Share feedback in the Soda community on Slack.
Documentation always applies to the latest version of Soda products
Last modified on 09-Sep-24