Abstract:
The authors discuss how to develop and operate large distributed software systems that embrace failures, learn from them, and adapt to maintain a very high system uptime.Metadata
Abstract:
The authors discuss how to develop and operate large distributed software systems that embrace failures, learn from them, and adapt to maintain a very high system uptime.