0008 Dataplane

We need to design a globally distributed architecture for Unkey where the dataplane can operate independently of the primary database for improved availability.

Goals

Achieve 100% dataplane availability independent of primary database
Provide fast access to dynamic data across global regions
Propagate data across the system quickly
Minimize load on expensive storage
Enable efficient cache invalidation
Can run on any cloud or on premise

A dedicated cache layer could be added to reduce the load on DynamoDB and improve read performance as well as cost. Whether this actually saves money is debatable, we'll have to try. These cache nodes would be dumb, they only cache reads for 10s and don't have any manual eviction possibilities.

┌─────────────────┐
│ Gateway 1       │───┐
└─────────────────┘   │
                      │    ┌────────────┐
┌─────────────────┐   │    │   Load     │    ┌────────────┐
│ Gateway 2       │───┼───►│  Balancer  │───►│ Cache      │──┐
└─────────────────┘   │    │            │    │ Node 1     │  │    ┌──────────────┐
                      │    │            │    └────────────┘  ├───►│  DynamoDB    │
                      │    │            │                    │    │  Global      │
                      │    │            │    ┌────────────┐  │    │  Tables      │
┌─────────────────┐   │    │            │───►│ Cache      │──┘    └──────────────┘
│ Gateway n       │───┘    └────────────┘    │ Node 2     │
└─────────────────┘                          └────────────┘

Pros

Built-in multi-region replication with strong consistency
No need to manage complex replication logic
Lower latency reads from local region
Automatic conflict resolution
Serverless and fully managed by AWS
Cheaper per read operation than S3
99.999% availability (s3 only has 99.99%)

Cons

Vendor lock-in to AWS -> we need to have an abstraction
Higher storage cost compared to S3 due to replication
Cost of replication
Replication lag is controlled by AWS, not us

0008 Dataplane

Goals

Options

1. Direct S3 + In-Memory Cache with SWR

Pros

Cons

2. S3 + In-Memory Cache with Gossip Protocol

Pros

Cons

3. S3 + Dedicated Cache Layer

Pros

Cons

4. DynamoDB Global Tables + Caching

Pros

Cons

On this page