Architecture Overview

KalamDB combines SQL execution, realtime delivery, and multi-tier storage in one runtime.

It also supports vector search with EMBEDDING(n) columns, cosine indexes, and similarity ranking in SQL for semantic retrieval workflows.

Major Runtime Layers

kalamdb-api: HTTP + WebSocket surface (/v1/api/*, /v1/ws)
kalamdb-core: orchestration, SQL execution flow, authorization checks, job dispatch
kalamdb-sql: SQL parser/classifier/extensions (SUBSCRIBE, TOPIC, STORAGE, EXECUTE AS USER, etc.)
kalamdb-store + RocksDB: hot write/read path
kalamdb-filestore + Parquet: cold segments and manifests
kalamdb-raft: cluster consensus and replication

Default paths are rooted at storage.data_path (usually ./data):


data/
├── rocksdb/      # hot tier
├── storage/      # cold parquet tier
├── snapshots/    # raft snapshots
└── streams/      # stream table logs

Table-specific cold paths are generated from templates:

For detailed flush behavior and cold-tier movement, see Storage Tiers.