返回顶部
r

retro

Deep blameless postmortem workflow—timeline, impact, root cause vs contributing factors, what went well/poorly, action items with owners, and follow-through. Use after incidents, outages, or near-misses to improve reliability culture.

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 1.0.0
安全检测
已通过
100
下载量
0
收藏
概述
安装方式
版本历史

retro

# Postmortems A good postmortem **learns** without blaming individuals. It produces **owned actions** that reduce recurrence or improve detection—not generic “communicate better” platitudes. ## When to Offer This Workflow **Trigger conditions:** - SEV incidents, customer-visible outages, data-loss scares - Near-misses worth documenting - Need facilitation structure in a blame-prone culture **Initial offer:** Use **six stages**: (1) scope & audience, (2) timeline & impact, (3) root cause analysis, (4) what worked / didn’t, (5) action items, (6) communication & follow-up). Confirm internal-only vs customer-facing summary. --- ## Stage 1: Scope & Audience **Goal:** Define readers (exec, engineering, CS) and redact PII or sensitive security details. ### Practices - Blameless framing in the invite and template **Exit condition:** Template chosen; owner for the final document. --- ## Stage 2: Timeline & Impact **Goal:** Minute-resolution timeline in UTC: detection → onset → mitigation → resolution. ### Impact - Users affected, duration, data integrity if relevant, SLA breach **Exit condition:** Facts align with any external customer communication. --- ## Stage 3: Root Cause Analysis **Goal:** Use five whys or fishbone as tools, not rituals. Separate **root cause** (fix that stops the class of failure) from **contributing factors** (process gaps, missing tests). ### Practices - Do not name an individual as the “root cause” **Exit condition:** Evidence-backed causal chain; contributing factors listed. --- ## Stage 4: What Worked / Didn’t **Goal:** Reinforce positives (runbooks followed, clear comms) and negatives (missing dashboards, slow escalation). --- ## Stage 5: Action Items **Goal:** Specific tickets with owners and dates; categorize prevent / detect / recover / process. ### Practices - Avoid vague “add monitoring”—name metrics or signals **Exit condition:** Items linked in the issue tracker. --- ## Stage 6: Communication & Follow-Up **Goal:** Share summary internally; external postmortem only when policy requires; track completion in 30/60 days. --- ## Final Review Checklist - [ ] Blameless tone; timeline and facts accurate - [ ] Impact quantified where possible - [ ] Root cause vs contributing factors distinguished - [ ] Action items owned, dated, tracked - [ ] Follow-up review scheduled ## Tips for Effective Guidance - Match depth to severity; lightweight retro for minor incidents. - Link traces, metrics, and logs in an appendix for engineers. - Psychological safety enables honesty—leadership models it. ## Handling Deviations - Security incidents: coordinate with legal/infosec before public detail.

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 retro-1775975162 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 retro-1775975162 技能

通过命令行安装

skillhub install retro-1775975162

下载 Zip 包

⬇ 下载 retro v1.0.0

文件大小: 2.01 KB | 发布时间: 2026-4-13 11:48

v1.0.0 最新 2026-4-13 11:48
Initial release of the retro skill: a structured, blameless postmortem workflow.

- Guides users through six stages: scoping, timeline, root cause analysis, review of what worked/failed, action items, and follow-up.
- Focuses on clear ownership of action items and distinguished root cause vs contributing factors.
- Emphasizes psychological safety and tailored communication for both internal and external audiences.
- Provides best practices, final review checklist, and specific guidance for security incidents.

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部