Skip to content

Data Warehouse Services

Managed Cloud Data Warehouses


Overview

This section covers the major cloud data warehouse services: Amazon Redshift, Google BigQuery, Snowflake, and Databricks. Understanding when to use each platform is critical for building cost-effective, scalable data platforms.


Service Guides

DocumentDescriptionKey Topics
Redshift GuideAWS data warehouseClusters, distribution, sort keys, WLM
BigQuery GuideGCP serverless warehousePartitioning, clustering, ML, streaming
Snowflake GuideMulti-cloud warehouseTime travel, cloning, data sharing
Databricks GuideLakehouse platformDelta Lake, MLflow, Unity Catalog
ComparisonFeature comparisonArchitecture, pricing, performance

Quick Selection Guide


Key Differences

FeatureRedshiftBigQuerySnowflakeDatabricks
CloudAWS onlyGCP onlyAWS/GCP/AzureAWS/GCP/Azure
ArchitectureClusteredServerlessMulti-clusterLakehouse
Compute Cost$5/TB + cluster$5/TB$2-6/TB$0.50-2.00/TB
Time Travel7 days90 daysUnlimited
Data Sharing✅ Native
ML SupportVia SageMakerBigQuery MLSnowparkMLflow (native)

Learning Path

  1. Start with: Comparison - Understand the landscape
  2. Choose your platform:

Back to Module 3