Microsoft
Microsoft Logo

AZ-400: Develop a Site Reliability Engineering (SRE) strategy 

  • Offered byMicrosoft

AZ-400: Develop a Site Reliability Engineering (SRE) strategy
 at 
Microsoft 
Overview

Learn how Site Reliability Engineering enables you to sustainably achieve the appropriate level of reliability in your systems, services, and products

Duration

8 hours

Total fee

Free

Mode of learning

Online

Schedule type

Self paced

Difficulty level

Beginner

Official Website

Explore Free Course External Link Icon

Credential

Certificate

AZ-400: Develop a Site Reliability Engineering (SRE) strategy
 at 
Microsoft 
Highlights

  • Know about monitoring options for an Azure virtual machine (VM)
  • Learn the use of application logs in Azure Web Apps to help debug web app code
  • Learn how to Respond to the incidents and activities in your infrastructure through alerting capabilities in Azure Monitor
Details Icon

AZ-400: Develop a Site Reliability Engineering (SRE) strategy
 at 
Microsoft 
Course details

What are the course deliverables?
  • Develop an instrumentation strategy
  • Develop a Site Reliability Engineering (SRE) strategy
  • Develop a security and compliance plan
  • Manage source control
  • Facilitate communication and collaboration
  • Define and implement continuous integration
More about this course
  • Learn how developers write programs that run on the cloud, including how to deploy, be fault-tolerant, load balance, scale, and deal with latency
  • Discover what cloud elasticity means and different ways to scale your cloud resources
  • Review multidimensional metrics for the load balancer in Azure Monitor Metrics
  • Learn how to manage site reliability

AZ-400: Develop a Site Reliability Engineering (SRE) strategy
 at 
Microsoft 
Curriculum

MODULE: 1 Introduction to Site Reliability Engineering (SRE)

Introduction to Site Reliability Engineering

What is SRE and why does it matter?

SRE in context

Key SRE principles and practices: virtuous cycles

Key SRE principles and practices: The human side of SRE

Getting started with SRE

MODULE: 2 Improve incident response with alerting on Azure

Explore the different alert types that Azure Monitor supports

Use metric alerts for alerts about performance issues in your Azure environment

Exercise - Use metric alerts to alert on performance issues in your Azure environment

Use log alerts to alert on events in your application

Use activity log alerts to alert on events within your Azure infrastructure

Exercise - Use activity log alerts to alert on events within your Azure infrastructure

Use smart groups to reduce alert noise in Azure Monitor

MODULE:3 Capture Web Application Logs with App Service Diagnostics Logging

Enable and configure App Service application logging

Exercise - Enable and configure App Service application logging using the Azure portal

View live application logging with the log streaming service

Exercise - View live application logging with the log streaming service using Azure CLI

Retrieve application log files

Exercise - Retrieve Application Log Files using Azure CLI and Kudu

MODULE: 4 Manage site reliability

What is reliability engineering?

What is Application Insights?

Perform ongoing tuning to reduce meaningless alerts

Analyze alerts to establish a baseline

Blameless postmortems

MODULE: 5 Scale your cloud resources with elasticity

Compute load patterns

Scaling compute resources

Automated scaling on the cloud

Load balancing

Serverless computing

MODULE: 6 Troubleshoot inbound network connectivity for Azure Load Balancer

Troubleshoot Azure Load Balancer

Diagnose issues by reviewing configurations and metrics

Exercise - Set up your environment

Exercise - Identify and resolve inbound network connectivity

MODULE: 7 Monitor the health of your Azure virtual machine by using Azure Metrics Explorer and metric alerts

Monitor the health of the virtual machine

Exercise - Set up a VM with boot diagnostics

View VM metrics

Configure the Azure Diagnostics extension

Exercise - Configure the Azure Diagnostics extension

Diagnostic data case studies

Exercise - Use diagnostic data

Other courses offered by Microsoft

Free
2 hours
Intermediate
Free
4 hours
Intermediate
Free
5 hours
Beginner
Free
1 hours
Beginner
View Other 1171 CoursesRight Arrow Icon
qna

AZ-400: Develop a Site Reliability Engineering (SRE) strategy
 at 
Microsoft 

Student Forum

chatAnything you would want to ask experts?
Write here...