I recently had an on site at Google for a Systems SRE position. I did not get the job and the feedback that my recruiter gave me was that the hiring committee found my performance in the 'troubleshooting' interview to be lacking. I guess nothing can beat the experience of actually dealing with outages but I was wondering if anyone has suggestions on additional resources for this kind of interview.
The one's that I'm aware of:
* The troubleshooting chapter in the Google SRE book * https://jvns.ca/zines/ * http://www.brendangregg.com/linuxperf.html * Reading about past outages: https://github.com/danluu/post-mortems
Thank you very much!