Azure Purview–notes

Reading notes

Where exactly in your organization is the data which you are searching for? It is a usual fiasco happends in any big/enterprise to search for information/data when an employee resigns, because ‘catalog’ information resides with certain people in the organization which creates a dependency.

Find my reading notes on Azure Purview:

Azure Purview provides:

Unified data governance service
Manage and govern
     – on-premises,
     – multi-cloud, and
     – SaaS data

Create a
     – holistic,
     – up-to-date map of your data landscape
with
     – automated data discovery,
     – sensitive data classification,
     – and end-to-end data lineage.

Unified Map
  – Automate and Manage metadata from hybrid sources
  – Classify data using built-in and custom classifiers
  – Label sensitive data
  – Integrate all your data systems using Apache Atlas API

Catalogue Insights:
– Asset Insights
– Glossary Insights
– Scan Insights
– Classification Insights
– Sensitive Label Insights

Docs
https://docs.microsoft.com/en-gb/azure/purview/overview

Supported data sources
https://docs.microsoft.com/en-us/azure/purview/purview-connector-overview

Pricing
https://azure.microsoft.com/en-in/pricing/details/azure-purview/

Questions for planning:

Scenarios:
Persona – Who are the users?
Source system – What are the data sources such as Azure Data Lake Storage Gen2 or Azure SQL Database?
Impact Area – What is the category of this scenario?
Detail scenarios – How the users use Purview to solve problems?
Expected outcome – What is the success criteria?

Deployment:
What are the main organization data sources and data systems?
For data sources that are not supported yet by Purview, what are my options?
How many Purview instances do we need?
Who are the users?
Who can scan new data sources?
Who can modify content inside of Purview?
What process can I use to improve the data quality in Purview?
How to bootstrap the platform with existing critical assets, glossary terms, and contacts?
How to integrate with existing systems?
How to gather feedback and build a sustainable process?

Ref: https://docs.microsoft.com/en-gb/azure/purview/deployment-best-practices