COMING SOON - Lab Exercise: Incident Analysis and Developing a Balanced Action Plan to Mitigate (C110004G-SPVC)


This lab enables you to apply knowledge learned in the Identifying Root Causes for Past Incidents module.


This course is intended for learners who are pursuing professional-level site reliability engineer certification on IBM Cloud.


Before starting this curriculum, the target audience should understand:
•System Thinking
•DevOps practices
•Cloud Architecture
•Software engineering principles
•System administration
•Network and OSI model
•Networking and security practices for IBM Cloud
•Incident management
•Root cause analysis


The target audience should also be able to:
•Proficiently write code
•Create run books as a reference
•Make system components serviceable
•Interpret data and statistics to determine actions
•Use LogDNA, SysDig, Grafana, Prometheus, Kibana
•Interpret schematics
•Drive incidents to resolution
•Remediate underlying sources of unreliability
•Create and configure VMs
•Create and configure Containers on IBM Kubernetes Service (IKS)/Red Hat OpenShift Kubernetes Services (ROKS)
•Create and configure Containers using OpenShift
•Create and configure Serverless applications
•Configure for high availability and scalability


  • Build skills in identifying root cause problems in cloud environments
  • Learn to develop a balanced action plan
mostrar detailes

Course Outline

Topic 1: Identifying Root Causes for Past Incidents and Developing a Balanced Action Plan to Mitigate Future Incidents