IBM Cloud Associate Site Reliability Engineer (SRE) Certification Prep Course (CNSREASG)

Overview

Earn your IBM Cloud Associate SRE v1 certification in 45 days or less!

 

The Site Reliability Engineer is one of the most in demand IT jobs in the marketplace today. This entry level course will help you build the foundational skills required to operate services that sustain service level objectives and engineer scalable, secure, and highly reliable services on IBM Cloud.

 

The instructor-led IBM Cloud Associate Site Reliability Engineer (SRE) Certification Prep Course, designed for those with 6 months to 1 year practical experience, is complemented by a 45-day subscription to the Associate SRE digital learning curriculum. This blended approach, instructor-led coupled with digital self-study, can further support your efforts to pass the IBM Certified Associate SRE - Cloud v1 certification exam.

 

Weekly prework assignments via the digital subscription prepares you for the instructor-led event and places emphasis on the content areas that support most of the exam questions. The instructor-led event reinforces the exam content areas via interactive learning, including team breakouts, use cases, games, matching exercises, and much more. 

 

Your certification prep course experience will take place over approximately a 6-week time frame:

  • Weeks 1-4: Complete the recommended weekly prework assignments via digital learning subscription (3-4 hours per week)
  • Week 5: Participate in the instructor-led IBM Cloud Associate SRE Certification Prep Course
  • Weeks 5-6: Use post course task guidance to continue to prepare for the certification exam (student guide, assessment and digital learning subscription)

 

It’s highly recommended that you take IBM Certified Associate SRE - Cloud v1 certification exam on or before your 45-day digital subscription expires. 

Good Luck! 

Audience

This prep course is for anyone interested in earning an IBM Cloud Associate SRE v1 certification. 

Prerequisites

null

Objective

You’ll develop your skills and knowledge around incident management, monitoring, troubleshooting, operations, deployments and security and compliance. You will also learn the tools needed to manage enterprise workloads in IBM Cloud environments.

Upon completion of this course you should be able to: 

- Identify root causes and contributing factors and apply software engineering principles toward mitigation
- Explain service level objective, service level indicator, error budget
- Identify appropriate IBM Cloud tools and technology for operations management
- Explain of release strategies and concepts
- Explain modern software engineering concepts
- Implement and manage backup and recovery
- Monitor resource utilization
- Perform operational readiness review
- Identify key metrics for service health
- Apply build to manage concepts
- Create and maintain metrics, traces, and alerts
- Recognize and differentiate performance and availability metrics
- Describe operations information provided by Watson AIOps
- Explain blameless postmortems
- Describe how to manage incidents
- Explain security policies
- Monitor security threats
- Evaluate security compliance
- Describe how to troubleshoot problems caused by the runtime and backend services
- Describe how to troubleshoot problems caused by the compute infrastructure
- Troubleshoot the problems caused by the network
- Describe how to troubleshoot problems caused by storage
- Collect, analyze, and manage logs for troubleshooting on IBM Cloud
- Differentiate between availability, reliability, resiliency, and architecture as related to business needs
- Design for component resiliency
- Implement resiliency for the workload
- Describe storage replication
- Describe failure domains
- Describe zero downtime deployment
- Explain infrastructure as code with Terraform and Schematics
- Explain post-incident review concepts
- Develop and update run books
- Implement rank-ordered postmortem actions and toil

 

Detaylari Göster

Course Outline

Upon completion of this course students should be able to: 

- Identify root causes and contributing factors and apply software engineering principles toward mitigation
- Explain service level objective, service level indicator, error budget
- Identify appropriate IBM Cloud tools and technology for operations management
- Explain of release strategies and concepts
- Explain modern software engineering concepts
- Implement and manage backup and recovery
- Monitor resource utilization
- Perform operational readiness review
- Identify key metrics for service health
- Apply build to manage concepts
- Create and maintain metrics, traces, and alerts
- Recognize and differentiate performance and availability metrics
- Describe operations information provided by Watson AIOps
- Explain blameless postmortems
- Describe how to manage incidents
- Explain security policies
- Monitor security threats
- Evaluate security compliance
- Describe how to troubleshoot problems caused by the runtime and backend services
- Describe how to troubleshoot problems caused by the compute infrastructure
- Troubleshoot the problems caused by the network
- Describe how to troubleshoot problems caused by storage
- Collect, analyze, and manage logs for troubleshooting on IBM Cloud
- Differentiate between availability, reliability, resiliency, and architecture as related to business needs
- Design for component resiliency
- Implement resiliency for the workload
- Describe storage replication
- Describe failure domains
- Describe zero downtime deployment
- Explain infrastructure as code with Terraform and Schematics
- Explain post-incident review concepts
- Develop and update run books
- Implement rank-ordered postmortem actions and toil