Chaos Engineering with Tammy Bütow

Available on: iTunes | Android | RSS

Chaos Engineering introduces failures across a system. This helps us evaluate how are system will perform when a failure occurs. Tammy Bütow, Principal Site Reliability Engineer at Gremlin, explains why Chaos Engineering emerged. We talked about the different types of chaos that can be introduced to a system: DNS related attacks, black hole attacks and database attacks. Tammy highlighted the importance of a Service Level Agreement and went over its components. The discussion continued with topics around what metrics to collect for monitoring, incident management, being on-call and tracking down an issue.

Tammy-Butow
Tammy Bütow, Principal Site Reliability Engineer at Gremlin

Gremlin partnered with the Cloud Native Foundation (CNCF) to offer five diversity and inclusion grants. Apply here by July 31st!

@tammybutow
@GremlinInc
@GirlGeekAcademy
@techwomenshow

Show Notes:
Get Started with Chaos Engineering
Principles of Chaos
Dropbox outage post-mortem
The Netflix Simian Army

Sponsors

blind
Blind is an anonymous app for tech workers to discuss, debate and talk about about compensation, corporate policies, workplace harassment, and more.

There are 50,000 companies active on Blind.
Check if yours is there and connect with other employees.

Go to teamblind.com to download the app.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

w

Connecting to %s