Categories
Process Site Reliability

Preventing Business Failure

(Decorist : 5/19-5/19)

Challenge

Realized a key analytics ETL server was a crucial component of the engineering infrastructure and without redundancy.

Action

  • Crafted a plan to remediate risk.
  • Performed AWS devops necessary to bring up 2nd instance.
  • Trained-up data engineer.
  • Worked with Data Engineer and offshore Tiger Team of 2 to deliver a process for spinning up a Docker-based backup server.

Results

  • Created replacement Docker image (and recovery process) to be spun-up, ensuring business continuity in catastrophic situation.