Listen to this story
|
Database management systems (DBMS) are widely used in a host of industries spanning finance, healthcare, pharmaceutical, media, service and hospitality, amongst others. The development of various data formats, software-as-a-service (SaaS)-based DBMS, and the growing reliance on “big data” for marketing are great drivers of growth for database management systems.
The global database management system market is expected to exhibit a CAGR of 12.8% during 2022-2027. The year 2022 marked the release of some new database management software systems. A few of these are detailed below.
AlloyDB for PostgreSQL
Google introduced AlloyDB for PostgreSQL, a fully-managed, PostgreSQL-compatible database for demanding enterprise-grade transactional and analytical workloads. Google claimed that in its performance tests, AlloyDB was “more than 4x faster on transactional workloads, and up to 100x faster on analytical queries than standard PostgreSQL, all with simple, predictable pricing”.
Google informed in a blog post that AlloyDB for PostgreSQL was built on the principle of disaggregation of compute and storage and was designed to leverage disaggregation at every layer of the stack.
At AlloyDB’s centre is an intelligent, database-optimized storage service built specifically for PostgreSQL. AlloyDB disaggregates compute and storage at every layer of the stack, using the same infrastructure building blocks that power large-scale Google services such as YouTube, Search, Maps, and Gmail. This unique technology allows it to scale seamlessly while offering predictable performance
AlloyDB for PostgreSQL is a distributed system with three parts:
- A low-latency, regional log storage service catering to very fast write-ahead log writing.
- A log processing service that processes these write-ahead log records and produces “materialised” database blocks.
- Failure-tolerant, sharded, regional block storage for durability even in case of zonal storage failures.
MySQL HeatWave
MySQL Heatwave is a fully managed database service that enables users to run OLTP, OLAP, and machine learning workloads directly from their MySQL Database. MySQL HeatWave is 1400x faster than Amazon Aurora. HeatWave is a massively parallel, high performance, in-memory query accelerator that accelerates MySQL performance by orders of magnitude for analytics workloads, mixed workloads, and machine learning. It can be accessed through Oracle Cloud Infrastructure, Amazon Web Services, and Oracle Database Service for Azure.
The MySQL DB System includes a HeatWave plugin responsible for cluster management, query scheduling, and returning query results to the MySQL DB System.
Velox
In August 2022, Meta introduced Velox, as an open source execution engine for data management systems. Velox is a novel open source C++ database acceleration library that offers reusable, extensible, high-performance, and dialect-agnostic data processing components for building execution engines, and enhancing data management systems. The library depends heavily on vectorization and adaptivity, and is designed from the ground up to support efficient computation over complex data types due to their ubiquity in modern workloads.
It is currently being integrated with more than a dozen data systems at Meta, including analytical query engines such as Presto and Spark, stream processing platforms, message buses and data warehouse ingestion infrastructure, machine learning systems for feature engineering and data preprocessing (PyTorch), and more.
It offers a host of advantages to users in terms of efficiency by democratising optimisation that was previously only available in individual engines. It also provides increased consistency for data users. It enhances efficiency by promoting reusability processing platforms, transactional engines, data ingestion systems and infrastructure, ML systems for feature engineering, amongst others.
InfluxDB 10X
InfluxDB IOx is a cloud-native, real-time, columnar database optimized for time series data built in Rust on top of Apache Arrow and DataFusion. It was written in the Rust programming language and uses Apache Parquet files for on-disk storage and Apache Arrow for operations between components. Apache Arrow is an in-memory specification for columnar data that makes analytical queries very fast.
IOx also uses the DataFusion library, a native SQL query engine, as its parser, planner, optimizer, and execution engine. This means that for the first time InfluxDB supports the PostgreSQL dialect and wire protocol allowing you to connect to third-party libraries and BI tools like PSQL, Grafana, Tableau and Apache Superset.
SQL Server 2022 by Microsoft
SQL Server 2022 introduces a new integration that makes it possible to copy the contents of an on-premises database to Azure SQL Managed Instance. It also includes an integration with Microsoft’s Azure Synapse service. The service provides features that enable companies to prepare business data for analysis and run queries on the data to find useful insights. SQL Server 2022 also introduces integrations with two other products: Microsoft Purview and Azure Arc.
Amazon Neptune
AWS released Amazon Neptune Serverless, a new serverless option made for Amazon Neptune, that automatically scales business-critical and unpredictable graph database workloads. A part of the company’s web service, Amazon Neptune, is a high-performance graph database service to store and query complex connected datasets. Amazon Neptune serverless would include Amazon Neptune’s advanced capabilities for high availability and performance. Moreover, there would be no additional costs to use the new feature, with customers only paying for the database resources.
In 2022, Amazon also announced the release of five new database and analytics capabilities at the AWS re:Invent 2022 conference which include Amazon DocumentDB Elastic Clusters, Amazon OpenSearch Serverless, Amazon Athena for Apache Spark, AWS Glue Data Quality, and Amazon Redshift Multi-AZ.