Information
Active - Virtual Machines and dependent services - Service management issues in multiple regions
Impact statement: As early as 19:46 UTC on 2 February 2026, we are aware of an ongoing issue causing customers to receive error notifications when performing service management operations - such as create, delete, update, scaling, start, stop - for Virtual Machines (VMs) across multiple regions. These issues are also causing impact to services with dependencies on these service management operations - including Azure Arc Enabled Servers, Azure Batch, Azure DevOps, Azure Load Testing, and GitHub. For details on the latter, please see https://www.githubstatus.com.
Current status: We have determined that these issues were caused by a recent configuration change that affected public access to certain Microsoft‑managed storage accounts, used to host extension packages. We are actively working on mitigation, including updating configuration to restore relevant access permissions. We have applied this update in one region so far, and are assessing the extent to which this mitigates customer issues. Our next update will be provided by 22:30 UTC, approximately 60 minutes from now.
Also had a few instance types which won't spin up in some regions/AZs recently. I assume this is capacity issues.
There’s a bunch of hardware, and they can’t run more servers than they have hardware. I don’t see a way around that.
That’s was years ago, wild to see they have the same issues.
It means that any service designed to survive a control plane outage must statically allocate its compute resources and have enough slack that it never relies on auto scaling. True for AWS/GCP/Azure.
That sounds oddly similar to owning hardware.
AWS has never had this type of outage in 20 years. Yet Azure constantly had them.
This is a total failure of engineering and has nothing to do with capacity. Azure is a joke of a cloud.
Which is my point.
The same fault on Azure would be a global (all-regions) fault.
Ran into an issue upgrading an AKS cluster last week. It completely stalled and broke the entire cluster in a way where our hands were tied as we can't see the control plane at all...
I submit a severity A ticket and 5 hours later I get told there was a known issue with the latest VM image that would create issues with the control plane leaving any cluster that was updated in that window to essentially kill itself and require manual intervention. Did they notify anyone? Nope, did they stop anyone from killing their own clusters. Nope.
It seems like every time I'm forced to touch the Azure environment I'm basically playing Russian roulette hoping that something's not broken on the backend.
Being happy means:
- you don't feel the need to automate more manual tasks (you lack laziness)
- you don't feel the need to make your system faster (you lack impatience)
- you don't feel the need to make your system better (you lack hubris)
So basically, happiness is a Sin.
As in why don't they mention Azure by name?
Or as in there shouldn't be isolated silos?
edit: Before someone says something. I do understand that the underlying issue is some issue with Azure.
I don't get how Microsoft views this level of service as acceptable.
Must be nice to be a monopoly that has most of the businesses in the world as their hostages.
Gitlab was generous first, to rise as a valid alternative to GitHub. They never got the comminity aspect right, perhaps aiming for profitability with a focus on the runners instances which is how they make money.
With profitability, the IPO made sense.
GitHub probably had a different strategy..keep it generous, get the entire open source community, keep raising money and one day someone will buys us out for billions. We we are, Microsoft goal is to capture the community, it works. It's sticky.
Which is again even worse.
Since there is no GitHub CEO, (Satya is not bothered anymore) and human employees not looking, Tay and Zoe are at the helm ruining GitHub with their broken AI generated fixes.
"Impact statement: As early as 19:46 UTC on 2 February 2026, we are aware of an ongoing issue causing customers to receive error notifications when performing service management operations - such as create, delete, update, scaling, start, stop - for Virtual Machines (VMs) across multiple regions. These issues are also causing impact to services with dependencies on these service management operations - including Azure Arc Enabled Servers, Azure Batch, Azure DevOps, Azure Load Testing, and GitHub. For details on the latter, please see https://www.githubstatus.com."