Architecture Overview

KalamDB combines SQL execution, realtime delivery, and multi-tier storage in one runtime.

It also supports vector search with EMBEDDING(n) columns, cosine indexes, and similarity ranking in SQL for semantic retrieval workflows.

Major Runtime Layers

kalamdb-api: HTTP + WebSocket surface (/v1/api/*, /v1/ws)
kalamdb-core: orchestration, SQL execution flow, authorization checks, job dispatch
kalamdb-sql: SQL parser/classifier/extensions (SUBSCRIBE, TOPIC, STORAGE, EXECUTE AS '<user_id>', etc.)
kalamdb-store + RocksDB: hot write/read path
kalamdb-filestore + Parquet: cold segments and manifests
kalamdb-raft: cluster consensus and replication

Read /docs/server/architecture/table-types first when you are deciding data ownership or security. Read /docs/server/architecture/storage-tiers and /docs/server/architecture/manifests when you are tuning flush, compaction, or storage templates. Read /docs/server/architecture/stream-storage when you are tuning STREAM table retention, transient UI state, or TTL cleanup behavior. Read /docs/server/architecture/live-query and /docs/server/architecture/clustering when you are scaling realtime connections or multi-node deployments.

Default paths are rooted at storage.data_path (usually ./data):

TEXT

1data/2├── rocksdb/      # hot tier3├── storage/      # cold parquet tier4├── snapshots/    # raft snapshots5└── streams/      # stream table logs

Table-specific cold paths are generated from templates:

For detailed flush behavior and cold-tier movement, see /docs/server/architecture/storage-tiers. For the file-backed STREAM table path, see /docs/server/architecture/stream-storage.

Component	Technology	Notes
Language	Rust 1.92+	concurrency + memory safety
Query engine	Apache DataFusion 54.0	SQL planning/execution (DuckDB dialect for lambdas and JSON operators)
Columnar format	Apache Arrow 58.x	in-memory batches
Cold storage	Apache Parquet 58.x	compressed columnar files
Hot storage	RocksDB 0.24	write-heavy low-latency path
API runtime	Actix Web 4.12	HTTP + WebSocket
Auth	bcrypt + JWT	password + token flows