Connect to your data sources within minutes, gain end to end visibility.
Bring data and business teams together with a single source of truth.
Power mission critical workflows with a SOC-2 compliant platform.
DataHub's unified search experience surfaces results across databases, data lakes, BI platforms, ML feature stores, orchestration tools, and more.
Quickly understand the end-to-end journey of data by tracing lineage across platforms, datasets, ETL/ELT pipelines, charts, dashboards, and beyond.
Proactively identify which entities may be impacted by a breaking change using Impact Analysis.
Combine technical, operational and business metadata to provide a 360º view of your data entities.
Generate Dataset Stats to understand the shape & distribution of the data
Capture historical Data Validation Outcomes from tools like Great Expectations and dbt Tests.
Metadata Tests allow you to define and continuously evaluate a set of conditions on the most important data assets in your company.
Quickly and easily assign entity ownership to users and user groups.
Empower data owners to govern their data entities with:
Keep up with your rapidly evolving business by easily defining, editing, or removing Glossary Terms with a few clicks of a button.
Rest assured that your Tags and Terms are associated with the correct entities by creating approval flows. This empowers your stakeholders to align data assets with business terms with appropriate checks in place.
Seamlessly communicate known issues and their outcomes by creating Incidents to keep your stakeholders informed.
Configure Slack notifications to alert stakeholders of changes to deprecation statuses, schema fields added to or removed from a dataset, and more.
DataHub admins can create Policies to define who can perform what action against which resource(s). When you create a new Policy, you will be able to define the following:
Schedule metadata ingestion using the DataHub user interface. Get started within minutes to get your main data sources ingested. Apply “shift-left” practices to pre-enrich important metadata using ingestion transformers, support for dbt meta-mapping and other features.
DataHub has pre-built integrations with your favorite systems: Snowflake, BigQuery, dbt, Airflow, Looker, Kafka and many others. These connectors are battle-tested by the largest data catalog community and the community is continuously adding more integrations, so this list keeps getting longer and longer.