Candidate Actions
Distributed system failure mode
Admin
Actions
Distributed system failure mode
active
v2
Preview
Duplicate
Save draft
Publish
Action name
Category
Screening
Technical
Compliance
Verification
Reference
Aptitude
Kind
Question
Multiple choice
Code submission
Document upload
External score
Video response
Take-home
Reference check
Signature
Applies to roles
Engineering
Product
Design
Sales
Research
Finance
Operations
Legal
Data / ML
Quant
Question configuration
Prompt
Describe a real production incident where a distributed system you worked on failed in an unexpected way. How did you diagnose it, and what changed as a result?
Expected length
Short (1-2 sentences)
Medium (a paragraph)
Long (multi-paragraph)
Ideal answer hint (recruiter-only)
Grading rubric (recruiter-only)
Strong: specific incident, diagnosis process, measurable change afterwards. Weak: generic 'we had an outage' with no detail.
Archive action