Chaos engineering is an emerging practice of resilience testing in IT development. By deliberately injecting failures and errors into a system and monitoring the response, IT development teams can gain confidence in a software system’s ability to withstand real-time chaos (unexpected situations). Chaos engineering helps prevent outages, cut costs, eliminate fault lines, and build confidence in managing any system.
Read this e-book to learn about the principles of chaos engineering, the tools and resources used, and an example involving a Kubernetes application.