Quick Start — Gateway
Stand up the gateway, connect a provider, and make your first guarded LLM call in under five minutes.
Quick Start — Gateway
Stand up the gateway, connect a provider, and make your first guarded LLM call in under five minutes.
Performance & Cost
Five optimizations ship enabled (or one flag away): embedded runtime, speculative dispatch, async post-guards, per-category timeouts, semantic cache short-circuit.
AI Guardrails
Configure safety policies across all five evaluation stages — input, output, MCP action, RAG retrieval, and post-LLM.
SDK Integrations
Drop-in adapters for OpenAI, Anthropic, Bedrock, Gemini, LiteLLM, LangChain, and PydanticAI.
Real-time, in-process
Guard evaluation runs in the gateway itself — no RPC hop, no sidecar. Embedded runtime is on by default in single-binary deployments.
Speculative dispatch
Fire the provider call in parallel with input guards. Allow-path latency becomes
max(guards, model) instead of guards + model.
Smart caching cuts spend
Semantic cache short-circuits the guard + provider pipeline on templated traffic. Up to 60% cost reduction on chatbot-style workloads.
Auditable by default
Every decision lands in the evidence store — sync, shadow, and async findings alike. Export-ready for SOC 2 / ISO 27001.
GitHub
Star, file issues, or send PRs.
Discord
Real-time help, roadmap discussion, design reviews.