← Back to Data Science

All Topics

Advertisement

Learn/Data Science/Data Engineering

Data Catalog

Topic: Catalog

Advertisement

Data Discovery

Central catalog for data assets.

Components

Metadata: schema, description. Search: find relevant data. Lineage: data flow. Contact: owners.

Tools

Alation, Collibra: enterprise. Amundsen, DataHub: open source. Cloud: Data Catalog (GCP), Lake Formation (AWS).

Usage

Self-service data access. Impact analysis. Compliance auditing.

Key Takeaways

  1. Central metadata repository
  2. Search and discover data
  3. Enterprise and open source options

Advertisement

Advertisement

Need More Practice?

Get personalized data science help from ChatWhole's AI-powered platform.

Get Expert Help →