v1.0.0March 2026GA

The first general availability release of BoltPipeline. SQL in, governed pipelines out — with full lifecycle management, security by architecture, and zero access to your raw data.

Pipeline Compilation

8-stage compilation: splitting, classification, validation, dependency resolution, SCD generation, lineage, profiling, artifact assembly
SQL-to-pipeline — submit SQL with comment-based metadata tags, get deployment-ready artifacts
Dependency resolution with topological sort (two-level DAG: group-level + microstep-level)
Template-driven code generation via Jinja (dialect-aware, auditable, consistent)
Snowflake adapter (GA)

SCD Automation

SCD Type 0 (append-only) — new rows appended, existing rows never modified
SCD Type 1 (overwrite) — MERGE with UPDATE for changed rows
SCD Type 2 (full history) — hash-based change detection, effective dates, active/inactive flags, versioning
Auto-generated staging tables, merge SQL, and audit columns
DB-free generation — SCD logic computed without database connection

Column Lineage

AST-derived lineage — computed directly from SQL using sqlglot, no runtime tracing
Table-level and column-level lineage across all pipeline steps
DML coverage: INSERT...SELECT, UPDATE, DELETE, CTAS, CREATE VIEW, MERGE
JSON export for external catalog or governance tools

Smart Profiling

SQL push-down profiling — all queries execute inside your database, only aggregate metrics returned
12 metric categories: uniqueness, null %, cardinality, data type distribution, and more
Join inference with cardinality-based type detection (1:1, N:1, 1:N)
PII detection — 5 regex patterns (SSN, credit card, email, phone, IP) via SQL REGEXP push-down
Pipeline-aware scope — only referenced tables profiled, no warehouse-wide scans
Background scheduling — profiling runs during low-activity database windows
Production-first design — production profiles serve as source of truth for all environments

Validation & Rule Engine

30+ rule validation engine — column existence, data types, contracts, audit columns, nullability, SCD configuration, schema drift
Auto-generated actionable DDL (ALTER TABLE / CREATE TABLE) to resolve schema drift
Custom SQL tests per pipeline step

Drift Detection

Schema drift — column additions, removals, type changes detected automatically
Volume anomaly — row count changes flagged (warning at 50%, critical at 90%)
Freshness monitoring — staleness tracking with configurable thresholds
PII drift — new PII detected in previously clean tables blocks deployment
SCD readiness — key uniqueness and temporal gap validation

Open Format Artifacts

DML Plan SQL (.sql) — deployment-ready SQL with INSERT, MERGE, SCD logic
Actionable DDL (.sql) — ALTER/CREATE statements to resolve schema drift
Plan Summary (.yaml) — step list, execution order, SCD types, metrics
Airflow YAML (.yaml) — orchestration definition with resolved dependencies
Lineage Graph (.json) — table-level and column-level lineage
Profiling Results (.json) — column stats, join candidates, drift findings
No vendor lock-in — if you leave, you keep every artifact

Security

Per-agent mTLS — every agent gets its own X.509 certificate with full PKI chain validation
Metadata-only communication — raw data and credentials never leave your environment
Credential isolation — database passwords managed entirely in your secret manager
Scratch schema execution — profiling and validation never modify production tables
One-way agent communication — agents pull instructions, platform cannot reach into your infrastructure
Auto-rotating certificates — 90-day cycle with session-based refresh

PCO Lifecycle & Operations

Plan → Certify → Operate governed lifecycle with tollgates between phases
DAG-scheduled parallel execution with dependency-aware ordering
Offline resilience — encrypted local result queuing with automatic reconnection
Environment promotion — Dev → Integration → Production with re-certification at each stage
Role-based access — Viewer, Developer, Operator, Admin with separation of duties
6 background task handlers for async pipeline processing

Platform

Console — web UI for SQL upload, pipeline management, lineage, profiling results
Command Center — hosted compilation, validation, artifact storage, agent work queue
Agent — lightweight Python runtime in your infrastructure, polls CC for work
Multi-tenant architecture with tenant isolation
Per-pipeline pricing — all features included, unlimited users

Test Coverage

186

Agent tests

388

Command Center tests

Stored procedures

Release Notes