Google recently introduced Cloud Operations Sandbox, which is a sandbox environment where you can see the principles of SRE applied.
This got me excited and a perfect companian to my reading of Site Reliability Engineering book
Part and parcel of that is comprehensive observability tooling—logging, monitoring, tracing, profiling and debugging—which can help you troubleshoot production issues faster, increase release velocity and improve service reliability.
I have started the sandbox script and I intend to follow along the documentation.
I am in particular interesting in the observability bits.
More to follow (still waiting the installer to provision my sandbox :laugh: )