Amazon Redshift

Amazon Redshift

Fully managed, petabyte-scale data warehouse service in the cloud.

Amazon Redshift Platform

Amazon Redshift is a fully managed cloud data warehouse that delivers fast query performance on large datasets by using massively parallel processing and columnar storage. It allows organizations to analyze structured and semi-structured data at scale using standard SQL.

By using Redshift, you can centralize analytics in a secure and scalable environment while reducing the complexity of managing infrastructure.

Data Vault is a perfect fit for Redshift

The goal of Data Vault modeling is to keep pace with changing business requirements and support agile, incremental delivery of analytics solutions. The hub, link, and satellite design makes the model highly extensible and easy to adapt when changes occur.

Redshift supports the SQL functions needed for Data Vault modeling, including MD5 and SHA hashing for business keys. VaultSpeed can generate all Data Vault structures and ELT code to run natively in Redshift.

Redshift layered architecture

A typical Redshift environment uses a layered approach:

  • Landing layer where ingested data from source systems is staged
  • Integration layer containing harmonized, historical datasets in the Raw Vault
  • Presentation layer delivering business-friendly outputs such as star schemas, flat tables, or custom views

This layered approach matches VaultSpeed’s architecture for building and maintaining governed Data Vault environments.

VaultSpeed Data Vault automation

VaultSpeed automates the design and creation of Data Vault models for Redshift.

The platform combines both data-driven and model-driven approaches:

  • Ingest metadata from source systems to speed up model creation
  • Incorporate business concepts to build a Data Vault that reflects your organization’s needs
  • Automatically generate Redshift-compatible DDL and SQL for loading and transforming data

Reference architecture for Redshift

VaultSpeed delivers no-code automation for building and maintaining Raw Vault and Business Vault layers in Redshift. These layers provide a consistent, governed foundation for analytics and reporting.

For the Presentation layer, VaultSpeed’s Template Studio allows you to define and deploy outputs that meet specific business requirements, including dimensional models and aggregated datasets.

Create workflow schedules

VaultSpeed’s Flow Management Control (FMC) module allows you to orchestrate and schedule your data pipelines. Workflows can be deployed to orchestration tools such as Amazon Managed Workflows for Apache Airflow (MWAA) or any other supported scheduler.

A solid foundation for analytics

VaultSpeed ensures Redshift receives continuously updated, governed, and historical datasets. When business requirements change, you only need to adjust the analytics or presentation layer. The Data Vault layer stores all data history so that you can maintain full traceability and auditability.

Streaming Data Vault in Redshift

VaultSpeed supports multiple load patterns for Redshift, including batch and Change Data Capture (CDC).

When needed, VaultSpeed generates:

  • DDL to create the Data Vault structures in Redshift
  • SQL to implement transformations and CDC logic for incremental updates

This ensures your Redshift environment always works with the latest available data.

Visit aws.amazon.com