Skip to content

Built-in Observability

DeepIntShield includes built-in observability, a powerful feature that automatically captures and stores detailed information about every AI request and response that flows through your system. This provides structured, searchable data with real-time monitoring capabilities, making it easy to debug issues, analyze performance patterns, and understand your AI application’s behavior at scale.

All LLM interactions are captured with comprehensive metadata including inputs, outputs, tokens, costs, and latency. The logging plugin operates asynchronously with zero impact on request latency.

Live Log Stream Interface


DeepIntShield traces comprehensive information for every request, without any changes to your application code.

Complete Request Tracing Overview

  • Input Messages: Complete conversation history and user prompts
  • Model Parameters: Temperature, max tokens, tools, and all other parameters
  • Provider Context: Which provider and model handled the request
  • Output Messages: AI responses, tool calls, and function results
  • Performance Metrics: Latency and token usage
  • Status Information: Success or error details
  • Logging Headers: Capture configured request headers (e.g., X-Tenant-ID) into log metadata
  • Ad-hoc Headers: Any x-bf-lh-* prefixed header is automatically captured into metadata
  • See Logging Headers below for full details
  • Audio Processing: Speech synthesis and transcription inputs/outputs
  • Vision Analysis: Image URLs and vision model responses
  • Tool Execution: Function calling arguments and results

Multimodal Request Tracing


The logging plugin intercepts all requests flowing through DeepIntShield using the plugin architecture, ensuring your LLM requests maintain optimal performance:

  1. PreLLMHook: Captures request metadata (provider, model, input messages, parameters).
  2. Async Processing: Logs are written in background goroutines with sync.Pool optimization.
  3. PostLLMHook: Updates log entry with response data (output, tokens, cost, latency, errors).
  4. Real-time Updates: WebSocket broadcasts keep the UI synchronized.

All logging operations are non-blocking, ensuring your LLM requests maintain optimal performance.


Configure request tracing to control what gets logged and where it’s stored.

Tracing Configuration Interface

  1. Navigate to http://localhost:8080
  2. Go to “Settings”
  3. Toggle “Enable Logs”

Retrieve and analyze logs with powerful filtering capabilities via the UI, API, and WebSockets.

Advanced Log Filtering Interface

When running the Gateway, access the built-in dashboard at http://localhost:8080. The UI provides:

  • Real-time log streaming
  • Advanced filtering and search
  • Detailed request/response inspection
  • Token and cost analytics

Query logs programmatically using the GET request.

Terminal window
curl 'http://localhost:8080/api/logs?' \
'providers=openai,anthropic&' \
'models=gpt-4o-mini&' \
'status=success,error&' \
'start_time=2024-01-15T00:00:00Z&' \
'end_time=2024-01-15T23:59:59Z&' \
'min_latency=1000&' \
'max_latency=5000&' \
'min_tokens=10&' \
'max_tokens=1000&' \
'min_cost=0.001&' \
'max_cost=10&' \
'content_search=python&' \
'limit=100&' \
'offset=0'

Available Filters:

FilterDescriptionExample
providersFilter by AI providersopenai,anthropic
modelsFilter by specific modelsgpt-4o-mini,claude-3-sonnet
statusRequest statussuccess,error,processing
objectsRequest typeschat.completion,embedding
start_time / end_timeTime range (RFC3339)2024-01-15T10:00:00Z
min_latency / max_latencyResponse time (ms)1000 to 5000
min_tokens / max_tokensToken usage range10 to 1000
min_cost / max_costCost range (USD)0.001 to 10
content_searchSearch in messages"error handling"
limit / offsetPagination100, 200

Response Format

{
"logs": [...],
"pagination": {
"limit": 100,
"offset": 0,
"sort_by": "timestamp",
"order": "desc"
},
"stats": {
"total_requests": 1234,
"success_rate": 0.85,
"average_latency": 100,
"total_tokens": 10000,
"total_cost": 100
}
}

Perfect for analytics, debugging specific issues, or building custom monitoring dashboards.

Subscribe to real-time log updates for live monitoring:

const ws = new WebSocket('ws://localhost:8080/ws')
ws.onmessage = (event) => {
const logUpdate = JSON.parse(event.data)
console.log('New log entry:', logUpdate)
}

Choose the right storage backend for your scale and requirements.

The logging plugin is automatically enabled in Gateway mode with SQLite storage by default. You can configure it to use PostgreSQL by setting the logs_store configuration in your config.json file.

  • Best for: Development, small-medium deployments
  • Performance: Excellent for read-heavy workloads
  • Setup: Zero configuration, single file storage
  • Limits: Single-writer, local filesystem only
{
"logs_store": {
"enabled": true,
"type": "sqlite",
"config": {
"path": "./logs.db"
}
}
}
  • MySQL: For traditional MySQL environments.
  • ClickHouse: For large-scale analytics and time-series workloads.

The logging plugin captures all DeepIntShield request types:

  • Text Completion (streaming and non-streaming)
  • Chat Completion (streaming and non-streaming)
  • Responses (streaming and non-streaming)
  • Embeddings
  • Speech Generation (streaming and non-streaming)
  • Transcription (streaming and non-streaming)
  • Video Generation

Capture specific HTTP request headers into the metadata field of every LLM and MCP log entry. This enables request tracing, tenant identification, and custom debugging without modifying your application code.

There are two ways headers get captured into log metadata:

1. Configured Logging Headers — Define a list of header names in the configuration. The logging plugin looks up each configured header (case-insensitive) and stores its value in the metadata.

2. x-bf-lh-* Prefix (Automatic) — Any request header with the x-bf-lh- prefix is automatically captured into metadata with no configuration needed. The prefix is stripped and the remainder becomes the metadata key.

Request HeaderMetadata KeyMetadata Value
x-bf-lh-tenant-id: acmetenant-idacme
x-bf-lh-env: productionenvproduction
x-bf-lh-region: us-east-1regionus-east-1

Both methods can be used together — configured headers and x-bf-lh-* headers are merged into the same metadata map.

  1. Navigate to Config > Logging
  2. Ensure Enable Logs is toggled on
  3. Scroll to Logging Headers

Logging Headers Configuration

  1. Enter a comma-separated list of header names (e.g., X-Tenant-ID, X-Correlation-ID)
  2. Click Save Changes

Changes take effect immediately — no restart required.

Configured headers:

Terminal window
# Config has: logging_headers: ["X-Tenant-ID", "X-Correlation-ID"]
curl http://localhost:8080/v1/chat/completions \
-H "Content-Type: application/json" \
-H "X-Tenant-ID: tenant-123" \
-H "X-Correlation-ID: req-abc-456" \
-d '{"model": "gpt-4o", "messages": [{"role": "user", "content": "Hello"}]}'

Log metadata: {"x-tenant-id": "tenant-123", "x-correlation-id": "req-abc-456"}

Ad-hoc x-bf-lh-* headers (no config needed):

Terminal window
curl http://localhost:8080/v1/chat/completions \
-H "Content-Type: application/json" \
-H "x-bf-lh-env: production" \
-H "x-bf-lh-version: v2.1.0" \
-d '{"model": "gpt-4o", "messages": [{"role": "user", "content": "Hello"}]}'

Log metadata: {"env": "production", "version": "v2.1.0"}

Metadata is displayed in the log detail view for both LLM and MCP logs as individual key-value entries alongside other request details.

Log Entry with Metadata

Required headers and logging headers serve different purposes and can be used together:

FeaturePurposeEffect on Request
Required HeadersEnforce header presenceRejects request if missing (400)
Logging HeadersCapture header valuesNo effect on request — only logs metadata

A common pattern is to require a header and log it:

{
"client": {
"required_headers": ["X-Tenant-ID"],
"logging_headers": ["X-Tenant-ID"]
}
}

Use the built-in logging plugin for:

  • Local Development: Quick setup with SQLite, no external dependencies
  • Self-hosted Deployments: Full control over your data with PostgreSQL
  • Simple Use Cases: Basic monitoring and debugging needs
  • Privacy-sensitive Workloads: Keep all logs on your infrastructure

Switch to the Maxim plugin for:

  • Advanced evaluation and testing workflows
  • Prompt engineering and experimentation
  • Multi-team governance and collaboration
  • Production monitoring with alerts and SLAs
  • Dataset management and annotation pipelines

Switch to the OTel plugin for:

  • Integration with existing observability infrastructure
  • Correlation with application traces and metrics
  • Custom collector configurations
  • Compliance and enterprise requirements

The logging plugin is designed for zero-impact observability:

  • Async Operations: All database writes happen in background goroutines
  • Sync.Pool: Reuses memory allocations for LogMessage and UpdateLogData structs
  • Batch Processing: Efficiently handles high request volumes
  • Automatic Cleanup: Removes stale processing logs every 30 seconds

In benchmarks, the logging plugin adds < 0.1ms overhead to request processing time.