This browser is no longer supported.
Upgrade to Microsoft Edge to take advantage of the latest features, security updates, and technical support.
Discover secure, future-ready cloud solutions—on-premises, hybrid, multicloud, or at the edge
Learn about sustainable, trusted cloud infrastructure with more regions than any other provider
Build your business case for the cloud with key financial and technical guidance from Azure
Plan a clear path forward for your cloud journey with proven tools, guidance, and resources
See examples of innovation from successful companies of all sizes and from all industries
Explore some of the most popular Azure products
Provision Windows and Linux VMs in seconds
Enable a secure, remote desktop experience from anywhere
Migrate, modernize, and innovate on the modern SQL family of cloud databases
Build or modernize scalable, high-performance apps
Deploy and scale containers on managed Kubernetes
Add cognitive capabilities to apps with APIs and AI services
Quickly create powerful cloud apps for web and mobile
Everything you need to build and operate a live game on one platform
Execute event-driven serverless code functions with an end-to-end development experience
Jump in and explore a diverse selection of today's quantum hardware, software, and solutions
Secure, develop, and operate infrastructure, apps, and Azure services anywhere
Create the next generation of applications using artificial intelligence capabilities for any developer and any scenario
Specialized services that enable organizations to accelerate time to value in applying AI to solve common scenarios
Add cognitive capabilities to apps with APIs and AI services
Accelerate information extraction from documents
Build, train, and deploy models from the cloud to the edge
Enterprise scale search for app development
Create bots and connect them across channels
Design AI with Apache Spark™-based analytics
Build for mixed reality using AI sensors
Apply advanced coding and language models to a variety of use cases
Gather, store, process, analyze, and visualize data of any variety, volume, or velocity
Limitless analytics with unmatched time to insight
Design AI with Apache Spark™-based analytics
Govern, protect, and manage your data estate
Hybrid data integration at enterprise scale, made easy
Provision cloud Hadoop, Spark, R Server, HBase, and Storm clusters
Real-time analytics on fast-moving streaming data
Build, train, and deploy models from the cloud to the edge
Enterprise-grade analytics engine as a service
Scalable, secure data lake for high-performance analytics
Fast and highly scalable data exploration service
Access cloud compute capacity and scale on demand—and only pay for the resources you use
Provision Windows and Linux VMs in seconds
Manage and scale up to thousands of Linux and Windows VMs
Deploy and scale containers on managed Kubernetes
Build and deploy Spring Boot applications with a fully managed service from Microsoft and VMware
Quickly create powerful cloud apps for web and mobile
Execute event-driven serverless code functions with an end-to-end development experience
A dedicated physical server to host your Azure VMs for Windows and Linux
Cloud-scale job scheduling and compute management
Migrate SQL Server workloads to the cloud at lower total cost of ownership (TCO)
Provision unused compute capacity at deep discounts to run interruptible workloads
Develop and manage your containerized applications faster with integrated tools
Deploy and scale containers on managed Kubernetes
Deploy and scale containers on managed Red Hat OpenShift
Build and deploy modern apps and microservices using serverless containers
Execute event-driven serverless code functions with an end-to-end development experience
Run containerized web apps on Windows and Linux
Launch containers with hypervisor isolation
Deploy and operate always-on, scalable, distributed apps
Build, store, secure, and replicate container images and artifacts
Support rapid growth and innovate faster with secure, enterprise-grade, and fully managed database services
Migrate, modernize, and innovate on the modern SQL family of cloud databases
Build or modernize scalable, high-performance apps
Build apps that scale with managed and intelligent SQL database in the cloud
Fully managed, intelligent, and scalable PostgreSQL
Managed, always up-to-date SQL instance in the cloud
Fully managed, scalable MySQL Database
Migrate SQL Server workloads to the cloud at lower total cost of ownership (TCO)
Accelerate apps with high-throughput, low-latency data caching
Accelerate your data migration to Azure
Modernize Cassandra data clusters with a managed instance in the cloud
Deploy applications to the cloud with enterprise-ready, fully managed community MariaDB
Deliver innovation faster with simple, reliable tools for continuous delivery
Services for teams to share code, track work, and ship software
Continuously build, test, and deploy to any platform and cloud
Plan, track, and discuss work across your teams
Get unlimited, cloud-hosted private Git repos for your project
Create, host, and share packages with your team
Test and ship confidently with an exploratory test toolkit
Quickly create environments using reusable templates and artifacts
Use your favorite DevOps tools with Azure
Full observability into your applications, infrastructure, and network
Optimize app performance with high-scale load testing
Streamline development with secure, ready-to-code workstations in the cloud
Deploy Grafana dashboards as a fully managed Azure service
Build, manage, and continuously deliver cloud applications—using any platform or language
Powerful and flexible environment to develop apps in the cloud
A powerful, lightweight code editor for cloud development
World’s leading developer platform, seamlessly integrated with Azure
Comprehensive set of resources to create, deploy, and manage apps
A powerful, low-code platform for building apps quickly
Get the SDKs and command-line tools you need
Build, test, release, and monitor your mobile and desktop apps
Full observability into your applications, infrastructure, and network
Optimize app performance with high-scale load testing
Streamline development with secure, ready-to-code workstations in the cloud
Get Azure innovation everywhere—bring the agility and innovation of cloud computing to your on-premises workloads
Secure, develop, and operate infrastructure, apps, and Azure services anywhere
Cloud-native SIEM and intelligent security analytics
Build and run innovative hybrid apps across cloud boundaries
Extend threat protection to any infrastructure
Experience a fast, reliable, and private connection to Azure
Synchronize on-premises directories and enable single sign-on
Migrate, modernize, and innovate on the modern SQL family of cloud databases
Services for teams to share code, track work, and ship software
Fully managed, intelligent, and scalable PostgreSQL
Extend cloud intelligence and analytics to edge devices
Manage user identities and access to protect against advanced threats across devices, data, apps, and infrastructure
Synchronize on-premises directories and enable single sign-on
Consumer identity and access management in the cloud
Manage your domain controllers in the cloud
Seamlessly integrate on-premises and cloud-based applications, data, and processes across your enterprise
Automate the access and use of data across clouds
Connect across private and public cloud environments
Publish APIs to developers, partners, and employees securely and at scale
Reliable event delivery at massive scale
Connect assets or environments, discover insights, and drive informed actions to transform your business
Connect, monitor, and manage billions of IoT assets
Use IoT spatial intelligence to create models of physical environments
Extend cloud intelligence and analytics to edge devices
Go from proof of concept to proof of value
Create, connect, and maintain secured intelligent IoT devices from the edge to the cloud
Unified threat protection for all your IoT/OT devices
Build intelligent edge solutions with world-class developer tools, long-term support, and enterprise-grade security.
Making embedded IoT development and connectivity easy
Use an enterprise-grade service for the end-to-end machine learning lifecycle
Accelerate edge intelligence from silicon to service
Add location data and mapping visuals to business applications and solutions
Simplify, automate, and optimize the management and compliance of your cloud resources
Build, manage, and monitor all Azure products in a single, unified console
Govern, protect, and manage your data estate
Stay connected to your Azure resources—anytime, anywhere
Streamline Azure administration with a browser-based shell
Your personalized Azure best practices recommendation engine
Simplify data protection with built-in backup management at scale
Monitor, allocate, and optimize cloud costs with transparency, accuracy, and efficiency using Microsoft Cost Management
Implement corporate governance and standards at scale
Keep your business running with built-in disaster recovery service
Improve application resilience by introducing faults and simulating outages
Full observability into your applications, infrastructure, and network
Deploy Grafana dashboards as a fully managed Azure service
Deliver high-quality video content anywhere, any time, and on any device
Encode, store, and stream video and audio at scale
Studio grade encoding at cloud scale
A single player for all your playback needs
Deliver content to virtually all devices with ability to scale
Securely deliver content using AES, PlayReady, Widevine, and Fairplay
Fast, reliable content delivery network with global reach
Simplify and accelerate your migration to the cloud with guidance, tools, and resources
Simplify migration and modernization with a unified platform
Keep your business running with built-in disaster recovery service
Accelerate your data migration to Azure
Appliances and solutions for data transfer to Azure and edge compute
Monitor, allocate, and optimize cloud costs with transparency, accuracy, and efficiency using Microsoft Cost Management
Blend your physical and digital worlds to create immersive, collaborative experiences
Create multi-user, spatially aware mixed reality experiences
Render high-quality, interactive 3D content with real-time streaming
Use IoT spatial intelligence to create models of physical environments
Build for mixed reality using AI sensors
Automatically align and anchor 3D content to objects in the physical world
Build and deploy cross-platform and native apps for any mobile device
Send push notifications to any platform from any back end
Build, test, release, and monitor your mobile and desktop apps
Create cloud-powered mobile apps faster
Add location data and mapping visuals to business applications and solutions
Publish APIs to developers, partners, and employees securely and at scale
Enterprise scale search for app development
Add cognitive capabilities to apps with APIs and AI services
Create multi-user, spatially aware mixed reality experiences
Quickly create powerful cloud apps for web and mobile
Build rich communication experiences with the same secure platform used by Microsoft Teams
Connect cloud and on-premises infrastructure and services to provide your customers and users the best possible experience
Create your own private network infrastructure in the cloud
Deliver high availability and network performance to your apps
Build secure, scalable, highly available web front ends in Azure
Establish secure, cross-premises connectivity
Host your Domain Name System (DNS) domain in Azure
Fast, reliable content delivery network with global reach
Protect your Azure resources from distributed denial-of-service (DDoS) attacks
Rapidly ingest data from space into the cloud with a satellite ground station service
Experience a fast, reliable, and private connection to Azure
Extend Azure management for deploying 5G and SD-WAN network functions on edge devices
Centrally manage virtual networks in Azure from a single pane of glass
Private access to services hosted on the Azure platform, keeping your data on the Microsoft network
Protect your enterprise from advanced threats across hybrid cloud workloads
Cloud-native SIEM and intelligent security analytics
Extend threat protection to any infrastructure
Build secure, scalable, highly available web front ends in Azure
Safeguard and maintain control of keys and other secrets
Establish secure, cross-premises connectivity
Protect your Azure resources from distributed denial-of-service (DDoS) attacks
Fully managed service that helps secure remote access to your virtual machines
A cloud-native web application firewall (WAF) service that provides powerful protection for web apps
Protect your Azure Virtual Network resources with cloud-native network security
Central network security policy and route management for globally distributed, software-defined perimeters
Get secure, massively scalable cloud storage for your data, apps, and workloads
High-performance, highly durable block storage
Scalable, secure data lake for high-performance analytics
Simple, secure and serverless enterprise-grade cloud file shares
Enterprise-grade Azure file shares, powered by NetApp
Simplify data protection with built-in backup management at scale
Massively scalable and secure object storage
Appliances and solutions for data transfer to Azure and edge compute
Industry leading price point for storing rarely accessed data
Build, deploy, and scale powerful web applications quickly and efficiently
Quickly create and deploy mission-critical web apps at scale
Publish APIs to developers, partners, and employees securely and at scale
Fast, reliable content delivery network with global reach
Enterprise scale search for app development
Add real-time web functionalities easily
Easily build real-time messaging web applications using WebSockets and the publish-subscribe pattern
Add location data and mapping visuals to business applications and solutions
Streamlined full-stack development from source code to global high availability
Build rich communication experiences with the same secure platform used by Microsoft Teams
Optimize app performance with high-scale load testing
Easily add real-time collaborative experiences to your apps with Fluid Framework
Empower employees to work securely from anywhere with a cloud-based virtual desktop infrastructure
Enable a secure, remote desktop experience from anywhere
Provision Windows desktops and apps with VMware and Azure Virtual Desktop
Provision Windows desktops and apps on Azure with Citrix and Azure Virtual Desktop
Set up labs for education, training, and other related scenarios
Streamline development with secure, ready-to-code workstations in the cloud
Build, manage, and continuously deliver cloud apps—with any platform or language
Analyze images, comprehend speech, and make predictions using data
Simplify and accelerate your migration and modernization with guidance, tools, and resources
Gather, store, process, analyze, and visualize data of any variety, volume, or velocity
Bring the agility and innovation of the cloud to your on-premises workloads
Connect, monitor, and control devices with secure, scalable, and open edge-to-cloud solutions
Help protect data, apps, and infrastructure with trusted security services
Simplify and accelerate development and testing (dev/test) across any platform.
Bring together people, processes, and products to continuously deliver value to customers and coworkers.
Build secure apps on a trusted platform. Embed security in your developer workflow and foster collaboration with a DevSecOps framework.
Give customers what they want with a personalized, scalable, and secure shopping experience.
Turn your ideas into applications faster using the right tools for the job.
Create reliable apps and functionalities at scale and bring them to market faster.
Reach your customers everywhere, on any device, with a single mobile app build.
Respond to changes faster, optimize costs, and ship confidently.
Build apps faster by not having to manage infrastructure.
Connect modern applications with a comprehensive set of messaging services on Azure.
Accelerate time to market, deliver innovative experiences, and improve security with Azure application and data modernization.
Use business insights and intelligence from Azure to build software as a service (SaaS) apps
Move to a SaaS model faster with a kit of prebuilt code, templates, and modular resources.
Build mission-critical solutions to analyze images, comprehend speech, and make predictions using data.
Uncover latent insights from across all of your business data with AI.
Protect your data and code while the data is in use in the cloud.
Build machine learning models faster with Hugging Face on Azure.
Save money and improve efficiency by migrating and modernizing your workloads to Azure with proven tools and guidance.
Optimize costs, operate confidently, and ship features faster by migrating your ASP.NET web apps to Azure.
Simplify and accelerate development and testing (dev/test) across any platform.
Enhanced security and hybrid capabilities for your mission-critical Linux workloads.
Reduce infrastructure costs by moving your mainframe and midrange apps to Azure.
Bring the intelligence, security, and reliability of Azure to your SAP applications.
Move your SQL Server databases to Azure with few or no application code changes.
Run your Windows workloads on the trusted cloud for Windows Server.
Explore tools and resources for migrating open-source databases to Azure while reducing costs.
Protect your data and code while the data is in use in the cloud.
Accelerate time to market, deliver innovative experiences, and improve security with Azure application and data modernization.
Seamlessly integrate applications, systems, and data for your enterprise.
Run your Oracle® database and enterprise applications on Azure and Oracle Cloud.
Reduce fraud and accelerate verifications with immutable shared record keeping.
Drive faster, more efficient decision making by drawing deeper insights from your analytics.
Accelerate time to insights with an end-to-end cloud analytics solution.
Minimize disruption to your business with cost-effective backup and disaster recovery solutions.
Get fully managed, single tenancy supercomputers with high-performance storage and no data movement.
Bring innovation anywhere to your hybrid environment across on-premises, multicloud, and the edge.
Deliver ultra-low-latency networking, applications and services at the enterprise edge.
Run your mission-critical applications on Azure for increased operational agility and security.
Experience quantum impact today with the world's first full-stack, quantum computing cloud ecosystem.
Deliver ultra-low-latency networking, applications, and services at the mobile operator edge.
Connect devices, analyze data, and automate processes with secure, scalable, and open edge-to-cloud solutions.
Build open, interoperable IoT solutions that secure and modernize industrial systems.
Help safeguard physical work environments with scalable IoT solutions designed for rapid deployment.
Strengthen your security posture with end-to-end security for your IoT solutions.
Meet environmental sustainability goals and accelerate conservation projects with IoT technologies.
Ensure compliance using built-in cloud governance capabilities.
Minimize disruption to your business with cost-effective backup and disaster recovery solutions.
Protect your data and code while the data is in use in the cloud.
Cloud-native network security for protecting your applications, network, and workloads.
Modernize operations to speed response rates, boost efficiency, and reduce costs
Transform customer experience, build trust, and optimize risk management
Build, quickly launch, and reliably scale your games across platforms
Implement remote government access, empower collaboration, and deliver secure services
Boost patient engagement, empower provider collaboration, and improve operations
Improve operational efficiencies, reduce costs, and generate new revenue opportunities
Create content nimbly, collaborate remotely, and deliver seamless customer experiences
Personalize customer experiences, empower your employees, and optimize supply chains
Get started easily, run lean, stay agile, and grow fast with Azure for startups
Accelerate mission impact, increase innovation, and optimize efficiency—with world-class security
Find reference architectures, example scenarios, and solutions for common workloads on Azure
We're in this together—explore Azure resources and tools to help you navigate COVID-19
Search from a rich catalog of more than 17,000 certified apps and services
Plan a clear path forward for your cloud journey with proven tools, guidance, and resources
Get the best value at every stage of your cloud journey
See which services offer free monthly amounts
Only pay for what you use, plus get free services
Find the options that work best for you
Explore special offers, benefits, and incentives
Estimate the costs for Azure products and services
Estimate your total cost of ownership and cost savings
Learn how to manage and optimize your cloud spend
Understand the value and economics of moving to Azure
Find, try, and buy trusted apps and services
Get up and running in the cloud with help from an experienced partner
Find the latest content, news, and guidance to lead customers to the cloud
Build, extend, and scale your apps on a trusted cloud platform
Reach more customers—sell directly to over 4M users a month in the commercial marketplace
Posted on February 14, 2022
“As we head into the fourth calendar year of the Advancing Reliability blog series, empowering organizations to run their workloads reliably on Azure remains one of our top priorities. We continually invest in evolving the Azure platform to help achieve this on a daily basis. Your ability to monitor virtual machine (VM) availability in a robust and comprehensive way is paramount to ensuring that your applications are available and resilient. For today’s post in the series, I have asked Program Manager, Pujitha Desiraju, from our Azure Core Platform Fundamentals Engineering team to talk about the latest observability enhancements for VM availability monitoring, as well as planned investments to deliver the best monitoring experience.”—Mark Russinovich, CTO, Azure
This post was co-authored by Principal Software Engineering Manager, Gaurav Jagtiani.
Flash, as the project is internally known, is a collection of efforts across Azure Engineering, that aims to evolve Azure’s virtual machine (VM) availability monitoring ecosystem into a centralized, holistic, and intelligible solution customers can rely on to meet their specific observability needs. Today, we’re excited to announce the completion of the project’s first two milestones—the preview of VM availability data in Azure Resource Graph, and the private preview of a VM availability metric in Azure Monitor.
Project Flash derives its name from our commitment to building robust and rapid ways to monitor virtual machine (VM) availability as comprehensively as possible—a key prerequisite for efficient application performance. It’s our mission to ensure you can:
With these goals in mind, we’ve divided our execution strategy into two phases—a near-term phase to meet critical current needs, and a long-term phase to deliver the best VM availability monitoring experience. This two-phased approach helps us continually bridge gaps, iterate on service quality, and learn from your feedback at every step along the way.
For the first phase, we are providing different options to enable convenient access to VM availability data to address a range of observability needs. We aim to maintain data consistency with similar rigorous quality standards across all of these existing features and solutions, like Resource Health or Activity Log, to deliver a consistent view agnostic of the solution you choose.
Today, we’re excited to reach our first Project Flash milestone—with the preview release of VM availability states in Azure Resource Graph for at-scale programmatic consumption.
Azure Resource Graph is a service in Azure that is extensively adopted for its efficient ability to query across many subscriptions, all at once and at low latencies. We’re currently emitting VM availability states (Available, Unavailable, and Unknown) to the Health Resources table in Azure Resource Graph, so you can perform complex Kusto Query Language (KQL) queries for sieving through large datasets at once. This functionality is handy for tracking historical changes in VM availability, for building custom dashboards, and for performing detailed investigations across numerous resource properties spread across multiple tables.
Figure 1: Azure Resource Graph Explorer Window with query and results, to demonstrate fetching data from the HealthResources table.
We are planning to add failure details and degraded VM scenarios to the Health Resources table in Azure Resource Graph, later this year. These details will ensure you are properly informed on the cause and impact of any failures—so you can either failover, reboot in place, or take the appropriate mitigations to prevent end-user impact.
Navigate to Azure Resource Graph Explorer on the Azure portal to get started with any of the KQL queries published for the Health Resources table.
We’re also pleased to announce the private preview of an out-of-box VM availability metric in Azure Monitor, for a curated metric alerting and monitoring experience.
Metrics in Azure Monitor are great for monitoring and analyzing time series representations of VM availability for quick and easy debugging, receiving scoped alerts on concerning trends, catching early indicators of degraded availability, correlating with other platform metrics, and more.
The metric allows you to track the pulse of your VMs—during expected behavior, the metric displays a value of 1. In response to any VM availability disruptions, the metric dips to a 0 for the duration of impact. In case of an Azure infrastructure outage, we will emit nulls represented as a dotted line on the portal.
Figure 2: Screenshot of VM availability metric as seen on Metrics Explorer in the Azure portal, with occasional dips to reflect VM availability disruptions.
We released the private preview of the metric as phase one of our rollout plan, and are currently collecting customer feedback, to further improve our offering. We are planning to add failure details such as metric dimensions and platform logs next year, to allow you to precisely alert on failure scenarios that are impactful.
The two monitoring options introduced above are just the beginning for Project Flash! We will continue to build upon our existing solutions by improving data quality and failure attribution. In parallel, we are designing two new monitoring offerings to meet your latency and mitigation needs, while also investing heavily in the underlying platform to make our fault detection more resilient and comprehensive.
Successfully running business-critical applications requires hyper-awareness of any VM availability impacting event, so remediation actions can be triggered instantaneously to prevent end-user impact. To support you in your daily operations, we are planning to design a notification mechanism that leverages the low-latency technology of Azure Event Grid. This will allow you to simply subscribe to an Event Grid system topic, and route scoped events via event handlers to any downstream tooling, instantaneously.
Considering the numerous ongoing investments to improve your VM availability monitoring experience, Project Flash intends to empower you even further by providing you knobs to customize recovery policies triggered by the platform, in response to cases of VM availability disruptions.
One such knob we are designing is the ability to opt-out of Service Healing for single-instance VMs, in response to a specific set of unanticipated Availability disruptions. This knob will be made available via the portal or at the time of VM deployment and can be updated dynamically. Note that leveraging this feature will render the usual Azure Virtual Machine availability SLAs ineffective.
In the future, we will explore introducing knobs to also opt-out of other applicable recovery policies (for example, Live Migration or Tardigrade), to ensure you can easily adapt to your ever-changing mitigation needs.
While the first phase is designed to meet your current observability needs, we remain focused on our long-term goal of delivering a world-class observability experience surrounding VM availability. We are extremely excited for all the data enrichments and technology advancements that will contribute to this experience, so here’s an early look at our roadmap of planned investments:
This list is certainly not exhaustive as we have multiple enrichments planned as part of our long-term strategy. To reiterate, our intention with Project Flash is to make VM availability monitoring extremely intuitive, comprehensive, and seamless—so you are always prepared for and informed about any changes in the health of your workloads, ultimately to maintain your own SLAs and business promises.
We will continue to share updates on Project Flash through blogs like this, to ensure you stay up to date on the latest. Stay tuned!
Azure updates
Ask questions