Cloudflare Services Integration

This document describes all Cloudflare services integrated into the adblock-compiler project, their current status, and configuration guidance.

Service Status Overview

Service	Status	Binding	Purpose
KV Namespaces	✅ Active	`COMPILATION_CACHE`, `RATE_LIMIT`, `METRICS`	Caching, rate limiting, metrics aggregation
R2 Storage	✅ Active	`FILTER_STORAGE`	Filter list storage and artifact persistence
D1 Database	✅ Active	`DB`	Compilation history, deployment records
Queues	✅ Active	`ADBLOCK_COMPILER_QUEUE`, `ADBLOCK_COMPILER_QUEUE_HIGH_PRIORITY`	Async compilation, batch processing
Analytics Engine	✅ Active	`ANALYTICS_ENGINE`	Request metrics, cache analytics, workflow tracking
Workflows	✅ Active	`COMPILATION_WORKFLOW`, `BATCH_COMPILATION_WORKFLOW`, `CACHE_WARMING_WORKFLOW`, `HEALTH_MONITORING_WORKFLOW`	Durable async execution
Hyperdrive	✅ Active	`HYPERDRIVE`	Accelerated PostgreSQL (PlanetScale) connectivity
Tail Worker	✅ Active	`adblock-compiler-tail`	Log collection, error forwarding
SSE Streaming	✅ Active	—	Real-time compilation progress via `/compile/stream`
WebSocket	✅ Active	—	Real-time bidirectional compile via `/ws/compile`
Observability	✅ Active	—	Built-in logs and traces via `[observability]`
Cron Triggers	✅ Active	—	Cache warming (every 6h), health monitoring (every 1h)
Pipelines	✅ Configured	`METRICS_PIPELINE`	Metrics/audit event ingestion → R2
Log Sink (HTTP)	✅ Configured	`LOG_SINK_URL` (env var)	Tail worker forwards to external log service
API Shield	📋 Dashboard	—	OpenAPI schema validation at edge (see below)
Containers	🔧 Configured	`ADBLOCK_COMPILER`	Durable Object container (production only)

Cloudflare Pipelines

Pipelines provide scalable, batched HTTP event ingestion — ideal for routing metrics and audit events to R2 or downstream analytics.

Setup

# Create the pipeline (routes to R2)
wrangler pipelines create adblock-compiler-metrics-pipeline \
  --r2-bucket adblock-compiler-r2-storage \
  --batch-max-mb 10 \
  --batch-timeout-secs 30

Usage

The PipelineService (src/services/PipelineService.ts) provides a type-safe wrapper:

import { PipelineService } from '../src/services/PipelineService.ts';

const pipeline = new PipelineService(env.METRICS_PIPELINE, logger);

await pipeline.send({
    type: 'compilation_success',
    requestId: 'req-123',
    durationMs: 250,
    ruleCount: 12000,
    sourceCount: 5,
});

Configuration

The binding is defined in wrangler.toml:

[[pipelines]]
binding = "METRICS_PIPELINE"
pipeline = "adblock-compiler-metrics-pipeline"

Log Sinks (Tail Worker)

The tail worker (worker/tail.ts) can forward structured logs to any HTTP log ingestion endpoint (Better Stack, Grafana Loki, Logtail, etc.).

Configuration

Set these secrets/environment variables:

wrangler secret put LOG_SINK_URL       # e.g. https://in.logs.betterstack.com
wrangler secret put LOG_SINK_TOKEN     # Bearer token for the log sink

Optional env var (defaults to warn):

wrangler secret put LOG_SINK_MIN_LEVEL  # debug | info | warn | error

Supported Log Sinks

Service	`LOG_SINK_URL`	Auth
Better Stack	`https://in.logs.betterstack.com`	Bearer token
Logtail	`https://in.logtail.com`	Bearer token
Grafana Loki	`https://<host>/loki/api/v1/push`	Bearer token
Custom HTTP	Any HTTPS endpoint	Bearer token (optional)

Cloudflare API Shield enforces OpenAPI schema validation at the edge for all requests to /compile, /compile/stream, and /compile/batch. This is configured in the Cloudflare dashboard — no code changes are required.

Setup

Go to Cloudflare Dashboard → Security → API Shield
Click Add Schema and upload docs/api/cloudflare-schema.yaml
Set Mitigation action to Block for schema violations
Enable for endpoints:
- POST /compile
- POST /compile/stream
- POST /compile/batch

Event	Description
`compilation_request`	Every incoming compile request
`compilation_success`	Successful compilation with timing and rule count
`compilation_error`	Failed compilation with error type
`cache_hit` / `cache_miss`	KV cache effectiveness
`rate_limit_exceeded`	Rate limit hits by IP
`workflow_started` / `completed` / `failed`	Workflow lifecycle
`batch_compilation`	Batch compile job metrics
`api_request`	All API endpoint calls

Querying

-- Average compilation time over last 24h
SELECT
  avg(double1) as avg_duration_ms,
  sum(double2) as total_rules
FROM adguard-compiler-analytics-engine
WHERE timestamp > NOW() - INTERVAL '1' DAY
  AND blob1 = 'compilation_success'

D1 Database

D1 stores compilation history and deployment records, enabling the admin dashboard to show historical data.

Schema

Migrations are in migrations/. Apply with:

wrangler d1 execute adblock-compiler-d1-database --file=migrations/0001_init.sql --remote
wrangler d1 execute adblock-compiler-d1-database --file=migrations/0002_deployment_history.sql --remote

Workflows

Four durable workflows handle crash-resistant async operations:

Workflow	Trigger	Purpose
`CompilationWorkflow`	`/compile/async`	Single async compilation with retry
`BatchCompilationWorkflow`	`/compile/batch`	Per-item recovery for batch jobs
`CacheWarmingWorkflow`	Cron (every 6h)	Pre-populate KV cache
`HealthMonitoringWorkflow`	Cron (every 1h)	Check source URL health

AdBlock Compiler

Cloudflare Services Integration

Service Status Overview

Cloudflare Pipelines

Setup

Usage

Configuration

Log Sinks (Tail Worker)

Configuration

Supported Log Sinks

API Shield

Setup

Schema Location

Analytics Engine

Tracked Events

Querying

D1 Database

Schema

Workflows

References