Google has announced the general availability of Dataplex, an intelligent data fabric that enables you to centrally manage, monitor, and govern data across data lakes, data warehouses, and data marts, and make this data securely accessible to a variety of analytics and data science tools.
Dataplex allows enterprises to delegate ownership, usage, and sharing of data and provides a single unified interface to consistently monitor and govern data across data domains.
Dataplex enables you to unify data–distributed across data lakes, data warehouses, and data marts–without any data movement, organise it based on your business needs, and centrally manage, monitor, and govern this data. Dataplex enables standardisation and unification of metadata, security policies, governance, classification, and data lifecycle management across this distributed data.
Dataplex harvests the metadata for both structured and unstructured data, using built-in data quality checks to enhance integrity. It automatically registers all metadata in a unified metastore. The data and metadata can also be accessed through a variety of Google Cloud services, such as BigQuery, Dataproc Metastore, Data Catalog, and open source tools, such as Apache Spark and Presto.