返回顶部
p

postmortems

Deep blameless postmortem workflow—timeline, impact, root cause vs contributing factors, what went well/poorly, action items with owners, and follow-through. Use after incidents, outages, or near-misses to improve reliability culture.

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 1.0.0
安全检测
已通过
79
下载量
0
收藏
概述
安装方式
版本历史

postmortems

# Postmortems (Deep Workflow) A good postmortem **learns** without **blaming individuals**. It produces **owned** **actions** that **reduce recurrence** or **improve detection**—not a generic “we will communicate better.” ## When to Offer This Workflow **Trigger conditions:** - **SEV** incidents, customer-visible outages, data loss scares - **Near-miss** worth documenting (luck prevented impact) - **Blame** culture risk—need **facilitation** structure **Initial offer:** Use **six stages**: (1) scope & audience, (2) timeline & impact, (3) root cause analysis, (4) what worked / didn’t, (5) action items, (6) communication & follow-up). Confirm **internal-only** vs **customer-facing** summary. --- ## Stage 1: Scope & Audience **Goal:** **Readers** (exec, eng, CS) and **sensitivity** (PII, security details redacted). ### Practices - **Blameless** framing in invite and template **Exit condition:** Template chosen; owner for final doc. --- ## Stage 2: Timeline & Impact **Goal:** **Minute-resolution** timeline with **UTC**; **detection** vs **start** vs **mitigation** vs **resolution**. ### Impact - **Users** affected, **duration**, **data** **integrity** if relevant, **SLA** **breach** **Exit condition:** **Customer** **communication** **aligned** **with** **facts** **here**. --- ## Stage 3: Root Cause Analysis **Goal:** **Five whys** or **fishbone** as **tool**, not **ritual**—**root** **cause** **and** **contributing** **factors** **separate**. ### Practices - **Root**: **fix** **that** **stops** **recurrence** **class** **(with** **evidence)** - **Contributors**: **process**, **missing** **tests**, **alert** **gaps** **Exit condition:** **No** **single** **person** **named** **as** **“root** **cause.”** --- ## Stage 4: What Worked / Didn’t **Goal:** **Reinforce** **good** **(runbooks,** **heroes** **who** **followed** **process)** **and** **fix** **bad** **(missing** **dashboards).** --- ## Stage 5: Action Items **Goal:** **Specific**, **tracked** **tickets** **with** **owners** **and** **dates**; **types**: **prevent**, **detect**, **recover**, **process**. ### Practices - **Avoid** **vague** **“add** **monitoring”** **without** **metric** **names** **Exit condition:** **Action** **items** **in** **issue** **tracker** **linked**. --- ## Stage 6: Communication & Follow-Up **Goal:** **Share** **summary** **with** **org**; **review** **completion** **in** **30/60** **days**. ### Practices - **External** **postmortem** **if** **customer** **promise** **requires** --- ## Final Review Checklist - [ ] Blameless tone; facts and timeline clear - [ ] Impact quantified where possible - [ ] Root cause and contributing factors distinguished - [ ] Action items owned, dated, and tracked - [ ] Follow-up review scheduled ## Tips for Effective Guidance - **Severity** **should** **match** **depth** **of** **postmortem** **(lightweight** **for** **small** **incidents).** - **Link** **to** **metrics** **and** **traces** **in** **appendix** **for** **engineers.** - **Psychological** **safety** **enables** **honesty**—**leadership** **must** **model** **it.** ## Handling Deviations - **Security** **incident**: **coordinate** **with** **legal** **before** **public** **detail**. - **Repeated** **same** **failure**: **escalate** **to** **architecture** **or** **SLO** **review**.

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 postmortems-1776030867 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 postmortems-1776030867 技能

通过命令行安装

skillhub install postmortems-1776030867

下载 Zip 包

⬇ 下载 postmortems v1.0.0

文件大小: 2.22 KB | 发布时间: 2026-4-13 11:35

v1.0.0 最新 2026-4-13 11:35
- Initial release of the "postmortems" skill with a structured, blameless postmortem workflow.
- Six clear stages: scope & audience, timeline & impact, root cause vs contributing factors, what worked/what didn’t, action items (owners/dates), communication and follow-up.
- Emphasis on owned, specific actions and psychological safety to drive learning and reliability culture.
- Includes tips for tailoring depth to severity, linking to metrics, and guidance for sensitive/security incidents.
- Comprehensive checklist ensures clarity, accountability, and scheduled follow-up.

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部