Member-only story
Ensure a constant Cloud Native Chaos
When designing a system, architects should always aim to build it in a way it is resilient against failure. When designing a modern and cloud native solution based upon microservices running in containers you need to ensure that the system will be able to continue operating as a container fails.
One way of ensuring that the system will continue to serve the users is by rigorous testing against component failure. Another way is to include constant controlled “failure” as part of the design.
Chaos engineering with Chaos Monkey
A way of introducing constant controlled failure is leveraging the concept of controlled “failure”. Chaos engineering is the discipline of experimenting on a software system in production in order to build confidence in the system’s capability to withstand turbulent and unexpected conditions.
As with every field of engineering, Chaos Engineering, has many types of implementations, one of the more popular ways of introducing Chaos Engineering is by leveraging Chaos Monkey which has been developed by the people by the NetFlix Engineering teams.
The simple explanation of Chaos Monkey is; Chaos Monkey is responsible for randomly terminating instances in production to ensure that engineers implement their services to be resilient to instance…