Data Discovery
Central catalog for data assets.
Components
Metadata: schema, description. Search: find relevant data. Lineage: data flow. Contact: owners.
Tools
Alation, Collibra: enterprise. Amundsen, DataHub: open source. Cloud: Data Catalog (GCP), Lake Formation (AWS).
Usage
Self-service data access. Impact analysis. Compliance auditing.
Key Takeaways
- Central metadata repository
- Search and discover data
- Enterprise and open source options