Skip to Content
BlogSprint 1 Complete: 46 Issues, 42K Lines

Sprint 1 Complete: 46 Issues, 42K Lines

Sprint 1 is done. 46 issues closed, 235 files changed, approximately 42,700 lines of code added across the entire platform. Here’s what shipped.

Gateway: 34 New Modules

The AI gateway grew from a core proxy into a full governance platform:

  • Security scanner — prompt injection, jailbreak, and data exfiltration detection with multi-signal risk scoring
  • DAG orchestrator — conditional branching workflows with per-stage cost budgets
  • Trace context — W3C Trace Context propagation across fleet agents
  • Fleet cost tracker — cross-agent cost attribution for multi-runner fleets
  • Unified cache — Redis cache layer with provider-aware TTL
  • Connection pool — managed httpx connection pools for upstream providers
  • SSE heartbeat — keepalive for long-running streaming responses
  • Error classifier — normalized error codes and automatic retry classification
  • Error replay — replay failed requests with modified parameters
  • Model recommender — suggest cheaper models based on usage patterns
  • Cost attribution — project-based cost tagging with custom labels
  • Ephemeral tokens — short-lived tokens for browser-based gateway access
  • Policy YAML — define governance policies as YAML files
  • Dashboard subscriptions — real-time WebSocket updates for dashboard widgets
  • CloudEvents — standardized event format for webhooks

Plus 19 more modules covering warmup, route registry, API versioning, webhook event storage, and more.

27 New Gateway Routes

Every new module is exposed via gateway routes, bringing the total to 130 route files. Key additions:

  • /gateway/security-scanner/* — scan status, configuration, test endpoints
  • /gateway/dag/* — create, execute, and monitor DAG workflows
  • /gateway/traces/* — distributed tracing, span ingestion, HITL checkpoints
  • /gateway/fleet-costs/* — fleet-wide cost aggregation and breakdown
  • /gateway/cost-attribution/* — query costs by model, provider, tag, key
  • /gateway/error-replay/* — replay and debug failed requests
  • /gateway/model-recommendations/* — cost optimization suggestions

Observer SDK v2

A new distributed tracing SDK for Python and TypeScript:

  • Python: CurateMeTracer with span context managers, @trace decorator, per-span cost tracking, and @hitl_checkpoint for human-in-the-loop gating
  • TypeScript: Observer class with WebSocket streaming, 15+ tracking methods, privacy-aware redaction, and automatic reconnection
  • W3C Trace Context: Full traceparent header propagation across services

CLI Commands

New gateway-focused CLI commands:

curate gateway test --full # Health check + smoke test curate gateway status # Latency and pool status curate costs today # Today's spend curate costs by-model # Cost breakdown by model curate runners health # Runner fleet health

MCP Server: 12 New Tools

The in-container MCP server gained tools for desktop automation, BYOVM management, warm pool control, and compliance:

  • curate_desktop_click, curate_desktop_type, curate_desktop_screenshot_region
  • curate_byovm_status, curate_byovm_capabilities
  • curate_warm_pool_status, curate_warm_pool_request
  • curate_compliance_check, curate_compliance_report, curate_audit_log

Total MCP tools: 43 across 7 tiers.

Developer Documentation

  • Quickstart guides — gateway, Python, TypeScript, cURL
  • Governance reference — full 13-step chain documentation
  • Cost tracking guide — attribution, budgets, alerts
  • 5 operational runbooks — budget exceeded, high latency, PII blocked, rate limits, stuck runners
  • Framework examples — LangChain, CrewAI, OpenAI Agents, Vercel AI, Claude SDK

Dashboard: 10 New Pages

  • Cost attribution dashboard with tag-based drill-down
  • DAG orchestration builder, list, and detail views
  • Onboarding verification test page
  • Latency benchmarks visualization
  • Runner inspector with live resource monitoring
  • Pricing calculator

Tests: 380+ New

45 new test files covering all Sprint 1 modules. Total test count across the platform: 20,800+ backend, 2,118 frontend, 87+ E2E.

What’s Next

Sprint 1 built the foundation. The platform now has:

  • 85 gateway modules with 130 routes
  • 51 LLM providers
  • 121 runner control plane modules
  • 229 dashboard pages
  • Full SDK coverage (Python, TypeScript, CLI, MCP, Observer, Embed)

Next up: design partner outreach, production hardening, and the features that design partners request. The platform is ready.