Kubernetes Deployment

Name: InferaDB
Author: InferaDB

Deploy InferaDB on Kubernetes with Helm charts and production-grade infrastructure.

Overview

InferaDB provides Helm charts for Kubernetes deployment. See Infrastructure Stack for the reference toolchain.

Helm Charts

Install the InferaDB Helm chart:

helm repo add inferadb https://charts.inferadb.com
helm repo update
helm install inferadb inferadb/inferadb

Override values for production:

helm install inferadb inferadb/inferadb \
  -f values-production.yaml

Engine

The Engine is deployed as a StatefulSet to maintain stable network identities for mesh communication.

Setting	Default	Description
Replicas	3	Initial replica count
HPA min	3	Minimum replicas under autoscaling
HPA max	20	Maximum replicas under autoscaling
CPU request	500m	Requested CPU per pod
Memory request	512Mi	Requested memory per pod

Horizontal Pod Autoscaler

The Engine scales automatically based on CPU utilization and custom metrics (authorization check latency):

engine:
  autoscaling:
    enabled: true
    minReplicas: 3
    maxReplicas: 20
    targetCPUUtilizationPercentage: 70

ServiceMonitor

A Prometheus ServiceMonitor is included for scraping Engine metrics:

engine:
  serviceMonitor:
    enabled: true
    interval: 15s

Ledger

The Ledger is deployed as a StatefulSet with persistent storage. Raft consensus requires an odd number of nodes for quorum.

Setting	Default	Description
Replicas	3	Must be odd (3, 5, 7)
Storage size	10Gi	PersistentVolumeClaim size
Storage class	default	Kubernetes StorageClass

ledger:
  replicas: 3
  persistence:
    size: 10Gi
    storageClass: "fast-ssd"

Ledger pods must not run on spot/preemptible instances.

Infrastructure Stack

Component	Tool	Purpose
OS	Talos Linux	Immutable, API-driven Kubernetes OS
CNI	Cilium (WireGuard)	Encrypted pod networking
GitOps	Flux CD	Continuous deployment
IaC	Terraform / OpenTofu	Infrastructure provisioning

Multi-Cloud Support

InferaDB is tested on the following cloud providers:

Provider	Instance Type	Notes
AWS	Graviton (ARM64)	Best price/performance for Engine
GCP	T2A / C3	Supported on GKE
DigitalOcean	Premium	Supported on DOKS

Spot Instances

Use spot instances for stateless workloads only (Engine, Control). Ledger requires on-demand instances.

engine:
  nodeSelector:
    node.kubernetes.io/instance-type: spot
ledger:
  nodeSelector:
    node.kubernetes.io/instance-type: on-demand

Multi-Region Deployment

Each region runs independent Ledger Raft groups for low-latency access and data residency.

Reference Topology

Region	Nodes	Purpose
nyc1	3	Primary region (US East)
sfo1	3	Secondary region (US West)

Vaults are pinned to a region at creation time. Data never leaves that region.

Deploying

# Region 1
helm install inferadb-nyc1 inferadb/inferadb \
  --set ledger.region=nyc1 \
  --set ledger.peers="ledger-0.nyc1,ledger-1.nyc1,ledger-2.nyc1"

# Region 2
helm install inferadb-sfo1 inferadb/inferadb \
  --set ledger.region=sfo1 \
  --set ledger.peers="ledger-0.sfo1,ledger-1.sfo1,ledger-2.sfo1"

Engine pods connect to the local Ledger group, resolving checks within-region.