ReletRelet Docs

Troubleshooting

Structured incident diagnosis and resolution playbooks for Relet operators.

Use this playbook to reduce mean time to resolution and improve escalation quality across teams.

Troubleshooting workflow illustration

Incident triage sequence

Use the same sequence every time so incidents are diagnosed consistently.

Identify the impacted module, organization, and UTC time window.
Determine scope: single user, single organization, or multi-organization.
Capture exact user-visible error text and the last successful action.
Run module-specific checks and validate recovery before closure.

Module-level diagnostic checklist

These checks help you isolate common failure causes quickly.

Verify filters, statuses, and date ranges before assuming a record is missing. Confirm role-based visibility boundaries for the affected user.

Validate document totals, plan state, and payment status transitions. Then verify the same flow on one controlled record.

Confirm owner assignment, required completion data, and status transition order from intake through closure.

Verify the user role, organization context, and recent permission changes. Revoke stale invitations and re-test.

Escalation package requirements

You must include these items when escalating so engineering can reproduce and fix issues without extra back-and-forth.

  1. Organization name and affected workflow.
  2. UTC timestamps of first failure and latest failure.
  3. Reproduction steps with five actions or fewer.
  4. Actual result and expected result.
  5. Relevant user role and any recent permission/configuration changes.

Escalation quality standard

If a report does not include reproducible steps and a clear expected result, complete the report before escalation.

Frequent incidents and first actions

Use these quick actions before deeper investigation.

Recovery validation

You should not close incidents until you validate both immediate recovery and short-term stability.

  1. Repeat the original failing workflow.
  2. Confirm no regression in adjacent workflows.
  3. Confirm affected users can proceed normally.
  4. Log root cause and prevention actions in your team runbook.

Next steps

  1. Use FAQ for recurring operator questions.
  2. Use Admin monitoring for platform checks.
  3. Update the affected guide page with any new prevention controls.

Last updated on

On this page