My On-Call Investigation Tool

I’ve been on-call and the thing that always got to me wasn’t the incidents themselves, it was the context-switching. You get paged, you stop what you’re doing, you open six different tools, each with its own auth and query language, and you start piecing together what’s broken. By the time you’ve gathered enough signal to have a theory, twenty minutes have passed and half of that was just logging in to things. ...