If the agent is not appearing in the fleet, re-register it via the Agent Registration page. This resets the agent's bond and issues a fresh pulse schedule.
After registration, wait up to 60 seconds for the agent to emit its first pulse. This page auto-refreshes. The fleet status will update when the first heartbeat is received. If no pulse arrives, repeat from Step 1.
This card auto-dismisses when agents.alive > 0 · Full Runbook
Database Posture
Loading…
Live DynamoDB table counts from the system status route.
Backend: GET /beak/system/status · Fields: database.* (eggs, ducklings, birth_certificates, …) + GET /beak/metrics for growth strip
Live table visibilityLast checked: —
Loading table posture…
GrowthLive metrics
Ducklings—
Certs issued—
Connections—
Peck requests—
Health—
DynamoDB Billing
Loading…
Live DynamoDB billing modes from system/status.
Backend: GET /beak/system/status · Fields: database_billing.*
PAY_PER_REQUEST vs PROVISIONEDLast checked: —
Loading billing modes…
Health—
Pending Peck Requests
Loading…
Live from GET /beak/metrics.
Backend: GET /beak/metrics · Fields: spaceducks_bonded
Bonding demand + audit pathLast checked: —
Spaceducks bonded—
HistoryPeck request history available via /beak/audit
Health—
🔗 Peck Protocol
Loading…
Live Step Functions execution counts from peck-approval-workflow.
Backend: GET /beak/system/status · Fields: peck_protocol.running, peck_protocol.succeeded, peck_protocol.failed
Connection approval state machineLast checked: —
🔄 Running—
✅ Succeeded—
❌ Failed—
Lambda Cost Estimator
Loading…
Projected monthly cost from invocation count and duration.
Backend: POST /beak/audit (for invocation count), GET /beak/system/status (for memory)
Unofficial estimate based on public pricingLast checked: —
Monthly Invocations—
Average Duration— ms
Configured Memory— MB
Projected Cost$—. est.
Excludes free tier. Prices for us-east-1.
Peck Failure Analysis
Loading…
Analysis of failed peck requests from the Step Functions workflow, split into denied, expired, and callback-error buckets.
Backend: GET /beak/system/status · Fields: peck_protocol.failed, peck_protocol.succeeded, peck_protocol.failure_breakdown.*
Live system throughput from eggs to bonded spaceducks. → Cert Monitor
Backend: GET /beak/system/status · Fields: database.eggs/ducklings/birth_certificates, agents.total_bonded + GET /beak/metrics for trend
Database + agent telemetryLast checked: —
Eggs—
→
Ducklings—
→
Certified—
→
Spaceducks—
Total certs issued—Waiting for metrics…
Last cert issued—Timestamp pending…
Trend—Weekly movement pending…
Cert Issuance Latency
Loading…
Last-issued age plus rolling 24h certificate volume from live metrics and audit signals.
Backend: GET /beak/metrics · Fields: last_cert_issued_at (+ fallback POST /beak/audit for duck.cert_issued / duck.hatched events)
Latest issuance timestamp + last 24h issuance countLast checked: —
Last issued age—Waiting for issuance telemetry…
Issued in last 24h—Waiting for audit volume…
Latest cert timestamp—Using /beak/metrics when present, then audit fallback.
Mission Control API
Loading…
Request health for the operator-facing APIs used by this page.
Backend: GET /beak/system/status (latency + fields), GET /beak/metrics, POST /beak/audit — all three fetched each refresh cycle
Live API reachabilityLast checked: —
Status route/beak/system/status
Metrics route/beak/metrics
Audit route/beak/audit
Observed latency—
Health—
Governance Actions
Manual approval required
Production changes move through the governance lane: open a request ID, record the target version cycle, and capture approval before any alias change. Frozen surfaces still require T-JOSH sign-off. See GOVERNANCE-LOG.md for the full audit trail.
Deployment Approval Queue
Loading…
Pending, approved, and expired governance actions from sd_gov_actions. Use this queue before deploy promotion, rollback, or freeze exit work.
Local operator diff of known Lambda environment variable names only — no values are surfaced. Baseline is stored in this browser for additive drift awareness.
Source set: current Mission Control build knowledge of runtime env keysLast checked: —
v372026-03-21 05:31 UTC | DC-038 lambda_alias_version in status response
v362026-03-21 05:24 UTC | SD-034 AgentMail cert email wired into /beak/cert/issue
v352026-03-21 05:18 UTC | birth certificate issuance now captures ip_address, user_agent, and country metadata at issuance time; prod alias promoted to v35
Lambda Deploy Timeline
Loading…
Last 10 Lambda deploys from sd_deployment_log localStorage. Latest version highlighted amber. View diff viewer →
Auth Dependencies
Loading…
Mixed live + operator-confirmed auth provider posture.
Static sectionsCommunication Status and Webhook Health are explicit operator-truth tiles until dedicated API fields exist. Quick links are utility navigation, not health tiles.
RemovedDecorative/dead-weight tiles from the previous layout were replaced by real data surfaces or explicit static operator state.
Recent Events
Loading…
Last five audit entries from POST /beak/audit.
Backend: POST /beak/audit (body: {}) · Fields: entries[].event_type, entries[].timestamp
Live audit feedLast checked: —
Loading recent events…
Route Health
Checking
Live probe of key API routes
Backend: direct HTTP probes — GET /beak, GET /beak/metrics, GET /beak/system/status — independent of main refresh cycle
GET /beak—
GET /beak/metrics—
GET /beak/system/status—
Last checked: —
Recent Activity
Loading…
Last five entries from GET /beak/system/statusrecent_audit when present.
Fallback panel until Lambda log lines are exposed by the status routeLast checked: —
Loading recent activity…
Cert Inventory
Loading…
Duckling and certificate state distribution computed from live database counts.
Backend: GET /beak/system/status · Fields: database.eggs, database.ducklings, database.birth_certificates, agents.total_bonded
3. After each deploy: update coordination/DEPLOY-LOG.md and record version + alias promotion.
4. After incidents: log to coordination/GOVERNANCE-LOG.md with request ID + approver.
Next Priorities
Wire signup/cert audit events in Lambda so the Audit Activity strip shows real counts.
Expose agents.stale_list in /beak/system/status for the Stale Agent Spotlight card.
Expose last_cert_issued_at in /beak/metrics to enable real cert latency display.
Source: coordination/OPERATOR-NOTES.md · Last updated: 2026-03-21 UTC · Batch DC-055–DC-057
Operator Shift Log
Ready
Local shift notes for handoff continuity. Saves the latest three UTC-stamped notes to your browser only.
Collapsible card · localStorage only · keeps last 3 notesLast checked: —
Newest first · UTC timestamps
Last saved: never
Stale Agent Spotlight
Loading…
Dead or never-pulsed agent summary from GET /beak/system/status. Individual agent records are shown when the API exposes them; otherwise summary counts are rendered.
Live Runtime Outranks Board StateBoard task status is text — live runtime state is truth. When in doubt, trust the live API and function configuration over any board entry.
⌨️ Operator Command Palette
One-click copyable diagnostics for rapid operator investigation — curl and aws CLI commands pre-filled for prod.
Commands pre-loaded · Click Copy to grab
📊 Metrics Delta Strip
Waiting…
Change since last refresh — green for growth, red for drops, grey for no change.
Last checked: —
⚠️ Degraded-Service Impact Matrix
Loading…
Translates live sandbox state, dead-agent count, and peck failures into plain-English operator risk.
Last checked: —
🔗 Connection Pressure
Loading…
Connections per bonded agent, peck pending vs failed ratio, and overload thresholds.
Last checked: —
🔄 What Changed Since Last Refresh
Metrics that moved on the latest live poll with old → new values.
Waiting for first refresh comparison…
—
📋 Route Failure Journal
Last route probe failures persisted in localStorage — UTC stamp, endpoint, status code.
Exports diagnostics JSON, anomaly summary, operator notes, route health, and parity state as a downloadable Markdown + JSON pair.
🚨 Operator Drill Checklist
Incident triage, alias verification, route probe review, and rollback preparation steps.
Step 1 — Confirm live statusRun curl .../beak/system/status and verify lambda.version matches prod alias via aws lambda get-alias.
Step 2 — Check route healthUse Route Latency strip — any route >1000ms or error indicates a degradation. Check Route Failure Journal for recent failures.
Step 3 — Review anomaly summaryCheck the Anomaly Banner and Impact Matrix. Classify as Degraded / Partial Outage / Full Outage.
Step 4 — Rollback readinessCheck Deploy Readiness Checklist. If rollback needed: list versions with aws lambda list-versions-by-function, then use Promote Alias command from palette.
Step 5 — HandoffExport Incident Handoff Pack v2 (Markdown + JSON) for shift handoff or incident ticket. Save a Shift Log note with current state.
📈 Route Latency History
Last few probe latencies per route — movement over time for degradation detection.
No latency history yet — waiting for route probes.
🕵️ Stale Data Detector
Checking…
Cards that have not refreshed within the last 120 seconds are flagged here so operators don't mistake cached UI for live truth.
Waiting for first refresh…
—
🚧 Operational Blockers
Loading…
Precise operational constraints for SES, SNS, cert pipeline, and GitHub — not vague waiting states.
Last checked: —
🔄 Board / Runtime Reconciliation
Compare board task state against live runtime truth — exports a compact Markdown handoff for operator review.
Outlier detections from sd_latency_log — responses >3s flagged as cold start candidates
Scanning…
Source: sd_latency_log · threshold: 3000ms · wired to main refresh cycle · Deployment Comparison →
🎓 Cert Issuance Rate
Live certs/hour and certs/day from audit_log delta snapshots · 7-day trend
Computing…
⚠️ No new certs issued in >24h — cert pipeline may be stalled
Certs / Hour
—
Certs / Day
—
7-Day Trend
Collecting baseline data…
Source: sd_cert_rate_log delta snapshots · wired to mc:status · min 2 snapshots required
⚡ Lambda Invocation Trend (7-day)
Daily invocation counts from localStorage.sd_invocation_log. Amber at 2× baseline · Red at 5× baseline.
Today
—
invocations
7-day total
—
Baseline
—
avg/day
Loading…
NormalAmber: ≥2× baselineRed: ≥5× baseline
📬 SES/SNS Sandbox Exit Readiness0% complete
Step-by-step checklist for requesting SES production access and SNS sandbox exit. Progress is saved in localStorage.
📧 SES Production Access
📱 SNS Sandbox Exit
Progress persisted in localStorage.sd_sandbox_exit_checklist.
🐤 Canary Deploy Tracker
Compares Lambda $LATEST against the prod alias version. Green = in sync · Amber = drift detected.
$LATEST version
—
Invoked version this request
Prod Alias → version
—
Currently promoted alias
Drift Status
Checking…
About $LATEST drift: Lambda's $LATEST always points to the most recent deployment — even if it hasn't been promoted to the prod alias. When you see "$LATEST vs prod = v41", it means a new upload exists but your production traffic is still safely routed to v41 via the alias. This is cosmetic drift, not a production incident. Only promote after testing.
Local operator freeze state only. This surface does not redefine governance authority, rollback rights, or approval semantics — it adds a visible hold note for this browser session profile.
Deploy/promotion actions are visually blocked while the freeze is active. Rollback and diagnostic review remain visible so operator recovery authority is preserved.
Platform Governance
Space Duck Galaxy 1.1 Beta — governance framework summary.
Every agent must be linked to a verified human duckling
Every connection requires explicit approval (peck protocol)
Every action is logged in the audit trail
Permissions can be revoked instantly at any time
Birth certificates are permanent and tamper-evident
Gradually shift traffic to a new Lambda version. T-JOSH governance required to execute.
Prod alias: —
Current prod aliasbeak-api:prod → version ?
Canary versionNot set — enter below
No split selected
Rollback if error rate exceeds this value
Prerequisites
✓ Lambda function published with versioning enabled
✓ Alias "prod" exists on target function
⚠ CloudWatch alarm configured for error rate monitoring
⚠ Rollback runbook reviewed and on-call notified
What this does
Executes update-alias --routing-config to shift a percentage of Lambda invocations to the new version. 100% promote removes the routing config entirely, making the new version the sole prod target.