zeroclaw/docs/operations
jordanthejet 5dfa722738 ci: consolidate CI/CD pipeline — 6 Rust jobs → 2, unified cache, frequency optimization
Consolidate redundant Rust compilation jobs to cut PR cycle time from 2+ hours
to ~30 minutes by reducing parallel cold compilations and upgrading runners.

CI Run (ci-run.yml):
- Merge lint + workspace-check + package-check → quality-gate (25min, 8vcpu)
- Merge test + build → test-and-build (30min, 8vcpu)
- Unify cache keys: prefix-key=zeroclaw-ci-v1, shared-key=runner.os-rust
- Update ci-required gate, lint-feedback deps to reference new job names

Security Audit (sec-audit.yml):
- Merge audit + deny + security-regressions → rust-security (25min, 8vcpu)
- Merge sbom + unsafe-debt → compliance (lightweight runner)
- Add fast-path: non-Rust PRs skip Rust compilation entirely

Frequency optimization (off PR path):
- sec-codeql.yml: push-to-main + weekly only (was PR + push)
- ci-reproducible-build.yml: push-to-main + weekly only (was PR + push)
- ci-change-audit.yml: push-to-main only (was PR + push)

Runner upgrades:
- All Rust compilation jobs: 2vcpu → blacksmith-8vcpu-ubuntu-2404
- ci-supply-chain-provenance, test-fuzz: upgraded to 8vcpu
- test-e2e: upgraded to 8vcpu, fixed env indentation bug

Feature matrix (feature-matrix.yml):
- Non-default lanes (whatsapp-web, browser-native, nightly-all-features)
  skip on compile profile, run on nightly only
- resolve-profile + summary jobs use ubuntu-latest (no Rust compilation)

Docs/scripts:
- lint_feedback.js: update job name references for quality-gate
- required-check-mapping.md: document new consolidated job names
- ci-map.md: update trigger map, triage guide, maintenance rules
- self-hosted-runner-remediation.md: update job name reference

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-05 15:51:07 -05:00
..
canary-gate-runbook.md feat(ci): formalize canary cohorts and observability policy 2026-02-25 22:29:26 +08:00
connectivity-probes-runbook.md ci: reduce non-blocking workflow load 2026-02-28 05:00:19 +08:00
docs-deploy-policy.md docs(ci): document docs deploy promotion and rollback policy 2026-02-25 21:55:13 +08:00
docs-deploy-runbook.md docs(ci): document docs deploy promotion and rollback policy 2026-02-25 21:55:13 +08:00
feature-matrix-runbook.md chore: remove Linear and Hetzner integrations (replay #2809) 2026-03-05 02:17:32 -05:00
ghcr-tag-policy.md docs(release): document GHCR vulnerability gate policy 2026-02-25 21:35:57 +08:00
ghcr-vulnerability-policy.md docs(release): document GHCR vulnerability gate policy 2026-02-25 21:35:57 +08:00
incident-2026-03-02-main-red-runner-regression.md docs(incident): redact runner host identifiers 2026-03-01 18:46:41 -08:00
nightly-all-features-runbook.md docs(ci): align nightly governance docs with active matrix profile 2026-02-25 23:01:49 +08:00
prerelease-stage-gates.md ci: remove dev-to-main promotion gate and align main flow 2026-02-28 14:06:08 +08:00
README.md ci: add provider connectivity probes matrix and runbook 2026-02-24 14:38:08 +08:00
required-check-mapping.md ci: consolidate CI/CD pipeline — 6 Rust jobs → 2, unified cache, frequency optimization 2026-03-05 15:51:07 -05:00
self-hosted-runner-remediation.md ci: consolidate CI/CD pipeline — 6 Rust jobs → 2, unified cache, frequency optimization 2026-03-05 15:51:07 -05:00

Operations & Deployment Docs

For operators running ZeroClaw in persistent or production-like environments.

Core Operations

Common Flow

  1. Validate runtime (status, doctor, channel doctor)
  2. Apply one config change at a time
  3. Restart service/daemon
  4. Verify channel and gateway health
  5. Roll back quickly if behavior regresses