OriginChain Engineering Blog

OriginChain Engineering BlogEngineering notes, design decisions, and customer stories from the team behind OriginChain - the AI-native database for SQL, vector, full-text, and graph.https://originchain.aien-usMulti-region cold standby for $5/month instead of $78/monthhttps://originchain.ai/blogs/multi-region-cold-standby-5-vs-78-per-monthoriginchain-blog:multi-region-cold-standby-5-vs-78-per-monthWe refactored our multi-region DR from a textbook hot-standby model to a cold-standby model and dropped per-tenant DR cost from ~$78/mo to ~$5–15/mo. RPO ≤60 s, RTO ~10–15 min. The math, the runbook excerpt, and the trade-off named explicitly.Sun, 07 Jun 2026 09:00:00 GMTdrinfrastructurecostengineeringoperationsOriginChain engineering (OriginChain)How we test our database with 850,000 random API probes a dayhttps://originchain.ai/blogs/850000-random-api-probes-a-dayoriginchain-blog:850000-random-api-probes-a-dayA 24/7 fuzz canary fires ~10,000 random probes at our engine every minute, with a metric, a two-minute alarm, and assertions on no-5xx / JSON-parseable / 401-on-no-auth. 14 generator families, ~85% surface coverage, p99 ~115 ms. Here's the loop, the invariants, and what we don't cover.Sat, 06 Jun 2026 13:00:00 GMTfuzzingreliabilitycanaryengineeringqaOriginChain engineering (OriginChain)Window functions, correlated subqueries, and 590 passing testshttps://originchain.ai/blogs/window-functions-correlated-subqueries-590-testsoriginchain-blog:window-functions-correlated-subqueries-590-testsOriginChain's SQL surface just grew window functions and correlated subqueries. ROW_NUMBER / RANK / DENSE_RANK / LAG / LEAD / SUM-AVG-COUNT-MIN-MAX OVER plus EXISTS / IN / scalar correlated subqueries, all on the planner, all green across ~590 tests. Here's what works in v1, the cumulative-frame trade-off, and what's queued for v2.Sat, 06 Jun 2026 10:00:00 GMTsqlwindow-functionscorrelated-subqueriesengineeringplannerOriginChain engineering (OriginChain)RAG at 10M documents: the failure mode the four-vendor stack hideshttps://originchain.ai/blogs/rag-at-10m-documents-the-four-vendor-failure-modeoriginchain-blog:rag-at-10m-documents-the-four-vendor-failure-modeProduction RAG at 10M documents is a 10-stage pipeline against four databases. The article that everyone shares this week gets the math right - and misses the silent cross-system version skew that quietly degrades the Stage 7 faithfulness check. Here is the failure mode, why it is unreachable on a single-WAL engine, and the code.Tue, 26 May 2026 08:00:00 GMTragvector-searchhybrid-searchengineeringOriginChain (OriginChain)From a 20-second dashboard to 300 ms: rethinking how we fetch tenant credentialshttps://originchain.ai/blogs/from-20s-dashboard-to-300msoriginchain-blog:from-20s-dashboard-to-300msEvery dashboard page that hit a customer's tenant engine first asked SSM RunCommand to read a file off the EC2. Cold path: 4–15 seconds with a tail at 20 s. Replacing it with SSM Parameter Store cut that to ~150 ms, with a backwards-compatible fallback and an opportunistic backfill.Tue, 12 May 2026 20:00:00 GMTinfrastructureperformanceawscontrol-planeOriginChain (OriginChain)Auto-generated Idempotency-Key: the database SDK that doesn't ask you to thinkhttps://originchain.ai/blogs/auto-idempotency-key-sdkoriginchain-blog:auto-idempotency-key-sdkEvery mutating call from our SDK now ships an Idempotency-Key UUID automatically. The user never types one. The check that had to land first wasn't 'does it work' - it was 'is the engine's cache bounded enough that a fresh key per call won't OOM us.'Tue, 12 May 2026 16:00:00 GMTsdkdxidempotencyapi-designOriginChain (OriginChain)OriginChain vs Redis: when each fitshttps://originchain.ai/blogs/vs-redisoriginchain-blog:vs-redisRedis is the gold standard for in-memory KV with sub-ms reads. OriginChain matches Redis at small scale and exceeds it on durability semantics, atomic multi-shape writes, and built-in vector search. Redis is right when your data fits in RAM; OriginChain when you want a primary-store with the same speed feel and AI-native shapes.Wed, 06 May 2026 10:26:12 GMTrediscomparisonkey-valueperformancearchitectureOriginChain Team (OriginChain)OriginChain vs Supabase: bundled stack vs focused databasehttps://originchain.ai/blogs/vs-supabaseoriginchain-blog:vs-supabaseSupabase bundles Postgres + Auth + Storage + Realtime + Edge Functions for prototype velocity. OriginChain is a focused database substrate purpose-built for AI workloads. If you want batteries-included, pick Supabase; if you want the database to be excellent at AI shapes and you'll bring your own auth/storage, pick OriginChain.Wed, 06 May 2026 10:26:12 GMTsupabasecomparisonpostgresarchitectureai-nativeOriginChain Team (OriginChain)Backpressure done right: 429 + Retry-After in OriginChainhttps://originchain.ai/blogs/backpressure-429originchain-blog:backpressure-429When a database can't keep up with writes, it has two choices: refuse cleanly (graceful) or accept and silently fail later (catastrophic). OriginChain's per-API-key backpressure is HTTP 429 with a precise Retry-After. Polite clients recover within seconds; impolite ones never crash the substrate. Here's how it works.Wed, 06 May 2026 10:26:12 GMTbackpressurerate-limitingdesignreliabilityapiOriginChain Team (OriginChain)The economics of agent memory at 100M tool calls/monthhttps://originchain.ai/blogs/agent-memory-economicsoriginchain-blog:agent-memory-economicsAn autonomous agent at 100M tool calls/month produces ~50 GB of trace data and ~75 GB of embeddings. Storage is the small line; what dominates is retention policy and re-embedding. The LLM call itself is 99.8% of the bill. Here's the actual math, with the four levers that move it.Wed, 06 May 2026 10:26:12 GMTcostagent-memoryscalingttltutorialOriginChain Team (OriginChain)OriginChain vs Postgres + pgvector: when each fitshttps://originchain.ai/blogs/vs-postgres-pgvectororiginchain-blog:vs-postgres-pgvectorPostgres + pgvector is great for teams already running Postgres and adding vectors as a feature. OriginChain is purpose-built for AI-agent workloads with thousands of writes per session, atomic multi-shape transactions, and sub-millisecond reads. Honest comparison.Tue, 05 May 2026 06:20:43 GMTpostgrespgvectorcomparisonarchitecturevector-databaseOriginChain Team (OriginChain)OriginChain vs DynamoDB: when each fitshttps://originchain.ai/blogs/vs-dynamodboriginchain-blog:vs-dynamodbDynamoDB is brilliant operational KV - flat fees, infinite scale, zero servers. OriginChain is right when vectors are first-class shapes, you need atomic multi-shape writes, and you're comfortable with a less-mature operational footprint in exchange for AI-shaped throughput.Tue, 05 May 2026 06:20:43 GMTdynamodbcomparisonkey-valuearchitectureawsOriginChain Team (OriginChain)OriginChain vs Pinecone: vectors with payloads vs vectors with sidecarshttps://originchain.ai/blogs/vs-pineconeoriginchain-blog:vs-pineconePinecone stores vectors and assumes your entity data lives elsewhere - you dual-write. OriginChain stores vectors as shapes co-located with parent entities, atomic in one the log. If you're paying the dual-write architecture's tax today, here's the alternative.Tue, 05 May 2026 06:20:43 GMTpineconevector-databasecomparisonarchitectureannOriginChain Team (OriginChain)Why we don't need a separate vector databasehttps://originchain.ai/blogs/no-separate-vector-databaseoriginchain-blog:no-separate-vector-databaseA separate vector database is the wrong shape for AI-native applications. Vectors describe entities; entities have lifecycles. Splitting them across two systems creates dual-write bugs and operational overhead. The correct architecture is one substrate, multiple shapes.Tue, 05 May 2026 06:20:43 GMTarchitecturevector-databasedesignai-nativeOriginChain Team (OriginChain)Per-key TTL: building agent memory that forgetshttps://originchain.ai/blogs/per-key-ttl-agent-memoryoriginchain-blog:per-key-ttl-agent-memoryAI agent memory is full of records that should expire on their own - tool-call traces, session caches, embedding refreshes. Per-key TTL on a fast K/V substrate gets you that pattern without sweeper jobs. Here's how OriginChain handles it and what it lets you build.Tue, 05 May 2026 06:20:43 GMTttlagent-memoryephemeraldesigntutorialOriginChain Team (OriginChain)RAG at production scale: where the latency goeshttps://originchain.ai/blogs/rag-latency-budgetoriginchain-blog:rag-latency-budgetProduction RAG has a 1-3 second latency budget for retrieval-and-generation. Most is the LLM. Database round-trips add up surprisingly fast - and are the easiest place to lose 200ms you didn't have. Walking through where the time goes with realistic numbers.Tue, 05 May 2026 06:20:43 GMTragperformancelatencyanntutorialOriginChain Team (OriginChain)The cost of an AI feature at 100K, 1M, and 10M usershttps://originchain.ai/blogs/ai-feature-cost-modeloriginchain-blog:ai-feature-cost-modelAI features look free in the prototype phase, then the bill hits. LLM tokens dominate (95%), embeddings are small (3%), database is rounding error (2%). Here's a realistic per-user cost model with line items at each scale, and what to optimize first.Tue, 05 May 2026 06:20:43 GMTpricingcosttutorialscalingllmOriginChain Team (OriginChain)Idempotent tool calls: building reliable agent loopshttps://originchain.ai/blogs/idempotent-tool-callsoriginchain-blog:idempotent-tool-callsProduction AI agents make tool calls that have side effects: emails, charges, webhooks. Network retries and LLM hallucinations mean you'll get the same call twice. Here's the idempotency-key playbook on OriginChain - atomic CAS + per-key TTL + the wait-for-result path.Tue, 05 May 2026 06:20:43 GMTagentstool-callsidempotencyreliabilitytutorialOriginChain Team (OriginChain)HA snapshot bootstrap: failover with zero committed-write losshttps://originchain.ai/blogs/ha-snapshot-bootstraporiginchain-blog:ha-snapshot-bootstrapMost database failovers silently drop writes that landed during the cutover window. Here's how OriginChain's snapshot-based failover design closes that window - verified by chaos drill on every release.Mon, 04 May 2026 20:59:15 GMThareplicationarchitecturewalconsensusOriginChain Team (OriginChain)OriginChain quickstart: zero to your first AI feature in 5 minuteshttps://originchain.ai/blogs/quickstartoriginchain-blog:quickstartProvision a managed instance, write a JSON record + vector embedding atomically, run a similarity search, hydrate the matched record. Five minutes, no schema migration, no glue code.Mon, 04 May 2026 20:59:15 GMTtutorialquickstartsdkgetting-startedvector-searchOriginChain Team (OriginChain)Why our roadmap to 1.0 is depth-first (and what's not on it)https://originchain.ai/blogs/depth-first-roadmap-to-1-0originchain-blog:depth-first-roadmap-to-1-0OriginChain's roadmap is HA → fuzzing → optimiser → EXPLAIN → multi-writer → online-schema. Six months. No graph, no FTS, no time-series specialty types until that's done. Here's why depth-first is the right bet for an AI-native database.Mon, 04 May 2026 20:59:15 GMTroadmapproductdesignarchitectureOriginChain Team (OriginChain)Eight competitor comparison pages, three parallel agents, one afternoonhttps://originchain.ai/blogs/eight-vs-pages-three-agents-one-afternoonoriginchain-blog:eight-vs-pages-three-agents-one-afternoonWe shipped /vs/postgres, /vs/pinecone, /vs/weaviate, /vs/qdrant, /vs/milvus, /vs/supabase, /vs/neon, and /vs/mongodb in under an afternoon by running three Claude Code agents in parallel. The pattern, the prompt, and the part agents don't replace.Mon, 04 May 2026 16:00:00 GMTengineeringai-toolsagentsmarketing-siteOriginChain (OriginChain)Why we shipped a vanilla OpenAPI spec when nobody asked for onehttps://originchain.ai/blogs/openapi-spec-and-the-ai-coding-looporiginchain-blog:openapi-spec-and-the-ai-coding-loopIn 2026, engineers don't write SDKs - they ask their AI agent to. The agent fetches your OpenAPI spec, runs openapi-generator, and ships a working client in 30 seconds. If you don't have a spec, you don't exist in that loop.Mon, 04 May 2026 13:00:00 GMTopenapiai-toolssdkdeveloper-experienceOriginChain (OriginChain)One MCP server, every AI IDE: OriginChain inside Claude Desktop and Cursorhttps://originchain.ai/blogs/mcp-server-launchoriginchain-blog:mcp-server-launchDatabase vendors who don't ship an MCP server in 2026 are invisible to the agents writing the code. Here's what @originchain/mcp-server exposes - five tools, stdio transport, env-var config - and what MCP doesn't yet solve.Mon, 04 May 2026 10:00:00 GMTmcpai-toolsclaudecursorOriginChain (OriginChain)