Skip to content

Document deployment.health + the no-poll failure loop#37

Merged
acoshift merged 2 commits into
mainfrom
failure-feedback
Jun 20, 2026
Merged

Document deployment.health + the no-poll failure loop#37
acoshift merged 2 commits into
mainfrom
failure-feedback

Conversation

@acoshift

Copy link
Copy Markdown
Member

Part of the failure-feedback feature (SPEC-failure-feedback.md §H).

Changes

  • automation/notification-channels.md — new "Asynchronous failures" section: when deployment.health fires (crash-loop/OOM/ImagePull auto-error, or deployer apply failure) vs the synchronous deployment.deploy failure, the system actor (empty actor), the non-secret reason-in-message convention, and that full logs stay behind deployment.logs.
  • deployments/monitoring.md — new "React to failures without polling" section: the end-to-end agent loop (me.generateToken → pull channel → notification.pulldeployment.status/logs), with recovery arriving on the same stream.
  • automation/deploy-from-github.md — note that PR previews now report post-deploy failures (sticky comment → crashed state + GitHub deployment status → failure), not just the CI-time outcome.

Sequencing

The deployment.health event (notification-channels + monitoring docs) ships with apiserver Part 1 (deploys-app/apiserver#174). The deploy-from-github post-deploy-failure note describes the Part 2 GitHub surface (deploys-app/apiserver#175) — merge this after Part 2 lands.

Docs-only (Hugo markdown); CLI flags and internal links match existing pages.

🤖 Generated with Claude Code

https://claude.ai/code/session_011d4bVuGLnCbcJD9ZvastPH

acoshift and others added 2 commits June 20, 2026 23:19
notification-channels.md gains an "Asynchronous failures" section: when
deployment.health fires (crash-loop/OOM/ImagePull auto-error, or deployer apply
failure) vs the synchronous deployment.deploy failure, the system actor (empty
actor), the non-secret reason-in-message convention, and that logs stay behind
deployment.logs. monitoring.md gains "React to failures without polling" — the
full me.generateToken → pull channel → notification.pull → deployment.status/logs
loop, with recovery arriving on the same stream.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
Claude-Session: https://claude.ai/code/session_011d4bVuGLnCbcJD9ZvastPH
A PR preview that deploys green but then crash-loops or whose apply fails now
updates its sticky comment to a failure state and flips the GitHub deployment
status to failure (via the deployment.health signal), so a broken preview stops
looking green. Production gets the status update, no comment.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
Claude-Session: https://claude.ai/code/session_011d4bVuGLnCbcJD9ZvastPH
@deploys-app deploys-app Bot temporarily deployed to pr-37 June 20, 2026 23:26 Destroyed
@deploys-app

deploys-app Bot commented Jun 20, 2026

Copy link
Copy Markdown

Preview deleted (PR closed).

@acoshift acoshift merged commit cd808d9 into main Jun 20, 2026
1 check passed
@acoshift acoshift deleted the failure-feedback branch June 20, 2026 23:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant