Tell HN: Azure outage | Modern Orange

Azure is down for us, we can't even access the azure portal. Are other experiencing this? Our services are located in Canada/Central and US-East 2

https://downdetector.ca/status/windows-azure/

https://azure.status.microsoft/en-gb/status

croemer
·
1 day ago
·
[ - ]

Preliminary post incident review: https://azure.status.microsoft/en-gb/status/history/

Timeline

15:45 UTC on 29 October 2025 – Customer impact began.

16:04 UTC on 29 October 2025 – Investigation commenced following monitoring alerts being triggered.

16:15 UTC on 29 October 2025 – We began the investigation and started to examine configuration changes within AFD.

16:18 UTC on 29 October 2025 – Initial communication posted to our public status page.

16:20 UTC on 29 October 2025 – Targeted communications to impacted customers sent to Azure Service Health.

17:26 UTC on 29 October 2025 – Azure portal failed away from Azure Front Door.

17:30 UTC on 29 October 2025 – We blocked all new customer configuration changes to prevent further impact.

17:40 UTC on 29 October 2025 – We initiated the deployment of our ‘last known good’ configuration.

18:30 UTC on 29 October 2025 – We started to push the fixed configuration globally.

18:45 UTC on 29 October 2025 – Manual recovery of nodes commenced while gradual routing of traffic to healthy nodes began after the fixed configuration was pushed globally.

23:15 UTC on 29 October 2025 - PowerApps mitigation of dependency, and customers confirm mitigation.

00:05 UTC on 30 October 2025 – AFD impact confirmed mitigated for customers.

xnorswap
·
1 day ago
·
[ - ]

33 minutes from impact to status page for a complete outage is a joke.

neya
·
1 day ago
·
[ - ]

In Microsoft's defense, Azure has always been a complete joke. It's extremely developer unfriendly, buggy and overpriced.

michaelt
·
1 day ago
·
[ - ]

If you call that defending microsoft, I'd hate to see what attacking them looks like :)

dijit
·
1 day ago
·
[ - ]

Not to put too fine a point on it, but if I have a dark passenger in my tech life it is almost entirely caused by what Microsoft wants to inflict on humanity - and more importantly; how successful they are at doing it.

amelius
·
1 day ago
·
[ - ]

In commenter's defense, their comment makes no sense.

dude250711
·
1 day ago
·
[ - ]

Save it for when they stick Copilot into Azure portal.

alias_neo
·
1 day ago
·
[ - ]

Ha, you haven't used it recently have you? Copilot is already there, and it can't do a single useful thing.

Me: "How do I connect [X] to [Y] using [Z]?"

Copilot: "Please select the AKS cluster you'd like to delete"

GuestFAUniverse
·
2 hours ago
·
[ - ]

Perfect answer /s

nflekkhnnn
·
1 day ago
·
[ - ]

Actually one of the inventors of k8s was the project lead for copilot in the azure portal, and deployed it over a year ago.

antonvdi
·
1 day ago
·
[ - ]

They're already doing that.

rk06
·
46 minutes ago
·
[ - ]

Hmm, isn't that the same argument we use in defense of windows and ms teams?

madjam002
·
1 day ago
·
[ - ]

My favourite was the Azure CTO complaining that Git was unintuitive, clunky and difficult to use

lawgimenez
·
23 hours ago
·
[ - ]

Sounds like he’s describing Windows phone.

ac2u
·
23 hours ago
·
[ - ]

Feel like I have to defend windows phone here, I liked it! Although I swore off the platform after the hardware I bought wasn’t eligible for the windows phone 8 upgrade even though the hardware was less than two years old. They punished early adopters

lawgimenez
·
10 hours ago
·
[ - ]

Yeah Windows Phone's first releases were decent. I have developed apps for Windows actually using Window's UWP framework but there weren't enough users on their platform sadly.

macintux
·
1 day ago
·
[ - ]

Isn’t it?

Hilift
·
1 day ago
·
[ - ]

Ironically, the GitHub Desktop Windows app is quite nice.

dspillett
·
1 day ago
·
[ - ]

Yes. But the point is compared to Azure in places the statement was very much the pot commenting on the kettles sooty arse. And git makes no particular pretence to be particularly friendly, just that it does a particular job efficiently.

sofixa
·
23 hours ago
·
[ - ]

> In Microsoft's defense, Azure has always been a complete joke. It's extremely developer unfriendly, buggy and overpriced.

Don't forget extremely insecure. There is a quarterly critical cross-tenant CVE with trivial exploitation for them, and it has been like that for years.

hinkley
·
13 hours ago
·
[ - ]

Given how much time I spent on my first real multi-tenant project, dealing with the consequences of architecture decisions meant to prevent these sorts of issues, I can see clearly the temptation to avoid dealing with them.

But what we do when things are easy is not who we are. That's a fiction. It's how we show up when we are in the shit that matters. It's discipline that tells you to voluntarily go into all of the multi-tenant mitigations instead of waiting for your boss to notice and move the goalposts you should have moved on your own.

sfn42
·
1 day ago
·
[ - ]

I've only used Azure, to me it seems fine ish. Some things are rather overcomplicated and it's far from perfect but I assumed the other providers were similarly complicated and imperfect.

Can't say I've experienced many bugs in there either. It definitely is overpriced but I assume they all are?

voidfunc
·
23 hours ago
·
[ - ]

They are all broken, weird, and expensive in their own ways. Its nothing unique to Azure.

ukblewis
·
18 hours ago
·
[ - ]

Some are much worse than others…

lokar
·
19 hours ago
·
[ - ]

For something fairly small, they are about the same.

At a large scale, azure is dramatically worse then Aws.

sfn42
·
18 hours ago
·
[ - ]

Worse at what?

lokar
·
13 hours ago
·
[ - ]

Pretty much anything

campbel
·
17 hours ago
·
[ - ]

As a technologist, you should always avoid MS. Even if they have a best-in-class solution for some domain, they will use that to leverage you into their absolute worst-in-class ecosystem.

hinkley
·
13 hours ago
·
[ - ]

I see Amazon using a subset of the same sorts of obfuscations that Microsoft was infamous for. They just chopped off the crusts so it's less obvious that it's the same shit sandwich.

imglorp
·
1 day ago
·
[ - ]

That's about how long it took to bubble up three levels of management and then go past the PR and legal teams for approvals.

infaloda
·
1 day ago
·
[ - ]

More importantly `15:45 UTC on 29 October 2025 – Customer impact began.

16:04 UTC on 29 October 2025 – Investigation commenced following monitoring alerts being triggered. ` A 19-minute delay in alert is a joke.

hinkley
·
13 hours ago
·
[ - ]

10 minutes to alert, to avoid flapping false positives. 10 minute response window for first responders. Or, 5 minute window before failing over to backup alerts, and 4 minutes to wake up, have coffee, and open the appropriate windows.

tremon
·
11 hours ago
·
[ - ]

I'd like to think that a company the size of Microsoft can afford to have incident response teams in enough time zones to cover basic operations without relying on night shifts.

hinkley
·
11 hours ago
·
[ - ]

That’s some very carefully chosen phrasing.

I think if you really wanted to do on call right to avoid gaps you’d want no more than 6 hours on primary per day per shift, and you want six, not four, shifts per day. So you’re only alone for four hours in the middle of your shift and have plenty of time to hand off.

Xss3
·
23 hours ago
·
[ - ]

That does not say it took 19 minutes for alerts to appear. Following could mean any amount of time.

hinkley
·
13 hours ago
·
[ - ]

It's 19 minutes until active engagement by staff. And planned rolling restarts can trigger alerts if you don't set thresholds of time instead of just thresholds of count.

It would be nice though if alert systems made it easy to wire up CD to turn down sensitivity during observed actions. Sort of like how the immune system turns down a bit while you're eating.

thayne
·
20 hours ago
·
[ - ]

Unfortunately,that is also typical. I've seen it take longer than that for AWS to update their status page.

The reason is probably because changes to the status page require executive approval, because false positives could lead to bad publicity, and potentially having to reimburse customers for failing to meet SLAs.

ape4
·
20 hours ago
·
[ - ]

Perhaps they could set the time to when it really started after executive approval.

schainks
·
18 hours ago
·
[ - ]

AWS either is “on it” or you they will say something somewhere between 60-90 minutes after impact.

We should be lucky MSFT is so consistent!

Hug ops to the Azure team, since management is shredding up talent over there.

sbergot
·
1 day ago
·
[ - ]

and for a while the status was "there might be issues on azure portal".

ambentzen
·
1 day ago
·
[ - ]

There might have been, but they didn't know because they couldn't access it. Could have been something totally unrelated.

HeavyStorm
·
19 hours ago
·
[ - ]

I've been on bridges where people _forgot_ to send comms for dozens of minutes. Too many inexperienced people around these days.

skeezyjefferson
·
20 hours ago
·
[ - ]

[flagged]

neop1x
·
2 hours ago
·
[ - ]

>> We began the investigation and started to examine configuration changes within AFD.

Troubleshooting has completed

Troubleshooting was unable to automatically fix all of the issues found. You can find more details below.

>> We initiated the deployment of our ‘last known good’ configuration.

System Restore can help fix problems that might be making your computer run slowly or stop responding.

System Restore does not affect any of your documents, pictures, or other personal data. Recently installed programs and drivers might be uninstalled.

Confirm your restore point

Your computer will be restored to the state it was in before the event in the Description field below.

onionisafruit
·
1 day ago
·
[ - ]

At 16:04 “Investigation commenced”. Then at 16:15 “We began the investigation”. Which is it?

ssss11
·
1 day ago
·
[ - ]

Quick coffee run before we get stuck in mate

ozim
·
1 day ago
·
[ - ]

Load some carbs with chocolate chip cookies as well, that’s what I would do.

You don’t want to debug stuff with low sugar.

normie3000
·
1 day ago
·
[ - ]

One crash after another

red-iron-pine
·
22 hours ago
·
[ - ]

burn a smoko and take a leak

taco_emoji
·
18 hours ago
·
[ - ]

    16:04 Started running around screaming
    16:15 Sat down & looked at logs

not_a_bot_4sho
·
1 day ago
·
[ - ]

I read it as the second investigation being specific to AFD. The first more general.

onionisafruit
·
23 hours ago
·
[ - ]

I think you’re right. I missed that subtlety on first reading.

notorandit
·
18 hours ago
·
[ - ]

What puzzles me too is the time it took to recognize an outage.

Looks like there was no monitoring and no alerts.

Which is kinda weird.

hinkley
·
13 hours ago
·
[ - ]

I've seen sensitivity get tuned down to avoid false positives during deployments or rolling restarts for host updates. And to a lesser extent for autoscaling noise. It can be hard to get right.

I think it's perhaps a gap in the tools. We apply the same alert criteria at 2 am that we do while someone is actively running deployment or admin tasks and there's a subset that should stay the same, like request failure rate, and others that should be tuned down, like overall error rate and median response times.

And it means one thing if the failure rate for one machine is 90% and something else if the cluster failure rate is 5%, but if you've only got 18 boxes it's hard to discern the difference. And which is the higher priority error may change from one project to another.

deadbolt
·
10 hours ago
·
[ - ]

Just what you want in a cloud provider, right?

oofbey
·
20 hours ago
·
[ - ]

“Our protection mechanisms, to validate and block any erroneous deployments, failed due to a software defect which allowed the deployment to bypass safety validations.”

Very circular way of saying “the validator didn’t do its job”. This is AFAICT a pretty fundamental root cause of the issue.

It’s never good enough to have a validator check the content and hope that finds all the issues. Validators are great and can speed a lot of things up. But because they are independent code paths they will always miss something. For critical services you have to assume the validator will be wrong, and be prepared to contain the damage WHEN it is wrong.

mystcb
·
1 day ago
·
[ - ]

Update 16:57 UTC:

Azure Portal Access Issues

Starting at approximately 16:00 UTC, we began experiencing Azure Front Door issues resulting in a loss of availability of some services. In addition. customers may experience issues accessing the Azure Portal. Customers can attempt to use programmatic methods (PowerShell, CLI, etc.) to access/utilize resources if they are unable to access the portal directly. We have failed the portal away from Azure Front Door (AFD) to attempt to mitigate the portal access issues and are continuing to assess the situation.

We are actively assessing failover options of internal services from our AFD infrastructure. Our investigation into the contributing factors and additional recovery workstreams continues. More information will be provided within 60 minutes or sooner.

This message was last updated at 16:57 UTC on 29 October 2025

---

Update: 16:35 UTC:

Azure Portal Access Issues

Starting at approximately 16:00 UTC, we began experiencing DNS issues resulting in availability degradation of some services. Customers may experience issues accessing the Azure Portal. We have taken action that is expected to address the portal access issues here shortly. We are actively investigating the underlying issue and additional mitigation actions. More information will be provided within 60 minutes or sooner.

This message was last updated at 16:35 UTC on 29 October 2025

---

Azure Portal Access Issues

We are investigating an issue with the Azure Portal where customers may be experiencing issues accessing the portal. More information will be provided shortly.

This message was last updated at 16:18 UTC on 29 October 2025

---

Message from the Azure Status Page: https://azure.status.microsoft/en-gb/status

planewave
·
1 day ago
·
[ - ]

Azure Network Availability Issues

Starting at approximately 16:00 UTC, we began experiencing Azure Front Door issues resulting in a loss of availability of some services. We suspect that an inadvertent configuration change as the trigger event for this issue. We are taking two concurrent actions where we are blocking all changes to the AFD services and at the same time rolling back to our last known good state.

We have failed the portal away from Azure Front Door (AFD) to mitigate the portal access issues. Customers should be able to access the Azure management portal directly.

We do not have an ETA for when the rollback will be completed, but we will update this communication within 30 minutes or when we have an update.

This message was last updated at 17:17 UTC on 29 October 2025

croemer
·
1 day ago
·
[ - ]

"We have initiated the deployment of our 'last known good' configuration. This is expected to be fully deployed in about 30 minutes from which point customers will start to see initial signs of recovery. Once this is completed, the next stage is to start to recover nodes while we route traffic through these healthy nodes."

"This message was last updated at 18:11 UTC on 29 October 2025"

croemer
·
1 day ago
·
[ - ]

At this stage, we anticipate full mitigation within the next four hours as we continue to recover nodes. This means we expect recovery to happen by 23:20 UTC on 29 October 2025. We will provide another update on our progress within two hours, or sooner if warranted.

This message was last updated at 19:57 UTC on 29 October 2025

cyptus
·
1 day ago
·
[ - ]

AFD is down quite often regionally in Europe for our services. In 50%+ the cases they just don‘t report it anywhere, even if its for 2h+.

RajT88
·
1 day ago
·
[ - ]

Spam those Azure tickets. If you have a CSAM, build them a nice powerpoint telling the story of all your AFD issues (that's what they are there for).

> In 50%+ the cases they just don‘t report it anywhere, even if its for 2h+.

I assume you mean publicly. Are you getting the service health alerts?

tomashubelbauer
·
1 day ago
·
[ - ]

CSAM apparently also means Customer Success Account Manager for those who might have gotten startled by this message like me.

ifwinterco
·
1 day ago
·
[ - ]

Alternative für Deutschland was strange enough, when I saw CSAM I was really wondering what thread I had stumbled into

cyptus
·
1 day ago
·
[ - ]

haha :D

linohh
·
1 day ago
·
[ - ]

Thank you, not going to google that shit.

andrewinardeer
·
1 day ago
·
[ - ]

"Apply to become a CSAM mentor"

·
1 day ago
·
[ - ]

psunavy03
·
1 day ago
·
[ - ]

Some really unfortunate acronyms flying around the Microsoft ecosystem . . .

RajT88
·
1 day ago
·
[ - ]

Quite so. The acronym collision rate is high.

riffic
·
1 day ago
·
[ - ]

In general, plain language works so much better than throwing bowls of alphabet soup around.

RajT88
·
1 day ago
·
[ - ]

That's a funny criticism to make on a tech forum.

But, for future reference:

site:microsoft.com csam

Noumenon72
·
1 day ago
·
[ - ]

That's an even 5:5 split between both meanings.

nijave
·
1 day ago
·
[ - ]

Back when we used Azure the only outcome was them trying to upsell us on Premium Support

RajT88
·
21 hours ago
·
[ - ]

Do you recall the kind of premium support? Azure Rapid Response?

cyptus
·
1 day ago
·
[ - ]

in many cases: no service health alerts, no status page updates and no confirmations from the support team in tickets. still we can confirm these issues from different customers accross europe. Mostly the issues are regional dependent.

alias_neo
·
1 day ago
·
[ - ]

Where do these alerts supposedly come from? I started having issues around 4PM (GMT), couldn't access portal, and couldn't make AKV requests from the CLI, and initially asked our Ops guys but with no info and a vague "There may be issues with Portal" on their status page, that was me done for the day.

llama052
·
1 day ago
·
[ - ]

I got a service health alert an hour after it started, saying the portal was having issues. Pretty useless and misleading.

RajT88
·
1 day ago
·
[ - ]

That should go into the presentation you provide your CSAM with as well.

Storytelling is how issues get addressed. Help the CSAM tell the story to the higher ups.

cyberax
·
1 day ago
·
[ - ]

> CSAM

Child Sex-Abuse Material?!? Well, a nice case of acronym collision.

mirekrusin
·
1 day ago
·
[ - ]

They should rename to Success Customer Account Manager.

tanseydavid
·
1 day ago
·
[ - ]

>> They should rename to Success Customer Account Manager.

No -- the one referencing crime should NEVER have be turned into an acronym.

Crimes should not be described in euphemistic terms (which is exactly what the acronym is)

rightbyte
·
1 day ago
·
[ - ]

You could argue "pornography" is the euphemism?

xp84
·
1 day ago
·
[ - ]

Most companies just call 'em CSMs

red-iron-pine
·
22 hours ago
·
[ - ]

but that makes them sound like Managers, which is not what they are -- glorified sales people, really.

actual Managers hate that

pndy
·
1 day ago
·
[ - ]

Supervisor Customer Account Manager: a remote kind of job, paid occasionally with gift cards

mirekrusin
·
1 day ago
·
[ - ]

...performed by cheap, open weight LLM.

RajT88
·
1 day ago
·
[ - ]

Definitely the most baffling acronym collision I have seen with Microsoft. I did one time count 4 different products abbreviated VSTS at one point.

dotancohen
·
1 day ago
·
[ - ]

Didn't MS have three things called "link" at one time? They were all spelled differently, of course.

SAI_Peregrinus
·
1 day ago
·
[ - ]

They must really depend on their government contracts with this administration…

codeduck
·
1 day ago
·
[ - ]

Oh dear. Will make for an awkward thing to have on your resume.

senderista
·
1 day ago
·
[ - ]

"CSAM Ninja"

zemariagp
·
1 day ago
·
[ - ]

Wait till you hear about the Keen Kubernetes Knowledge iniciative

red-iron-pine
·
22 hours ago
·
[ - ]

North American Zigbee Initiative

nevf1
·
1 day ago
·
[ - ]

This is the single most frustrating thing about these incidents. As you're harmstrung on what you can do or how you can react until Microsoft officially acknowledges a problem. Took nearly 90mins both today and when it happened on 9th October.

cyptus
·
1 day ago
·
[ - ]

so true. instead of getting a fast feedback we are wasting time searching for our own issues first.

hallh
·
1 day ago
·
[ - ]

Same experience. We've recently migrated fully away from AFD due to how unreliable it is.

jjp
·
1 day ago
·
[ - ]

Whilst the status message acknowledge's the issue with Front Door (AFD), it seems as though the rest of the actions are about how to get Portal/internal services working without relying on AFD. For those of us using Front Door does that mean we're in for a long haul?

llama052
·
1 day ago
·
[ - ]

Please migrate off of front door. It's been a failure mode since it came out historically. Anything else is better at this point

everfrustrated
·
1 day ago
·
[ - ]

Didn't the underlying vendor they used for Azure Front Door go bankrupt? It's probably on life support.

guptadagger
·
1 day ago
·
[ - ]

i understood that to be a different third party that provided a CDN and was different than afd. https://learn.microsoft.com/en-us/azure/frontdoor/migrate-cd...

progmetaldev
·
1 day ago
·
[ - ]

Currently even the Front Door landing page is only partially loading.

https://azure.microsoft.com/en-us/products/frontdoor

8cvor6j844qw_d6
·
1 day ago
·
[ - ]

I'll be interested in the incident writeup since DNS is mentioned. It will be interesting in a way if it is similar to what happened at AWS.

Insanity
·
1 day ago
·
[ - ]

It's pretty unlikely. AWS published a public 'RCA' https://aws.amazon.com/message/101925/. A race condition in a DNS 'record allocator' causing all DNS records for DDB to be wiped out.

I'm simplifying a bit, but I don't think it's likely that Azure has a similar race condition wiping out DNS records on _one_ system than then propagates to all others. The similarity might just end at "it was DNS".

parliament32
·
1 day ago
·
[ - ]

That RCA was fun. A distributed system with members that don't know about each other, don't bother with leader elections, and basically all stomp all over each other updating the records. It "worked fine" until one of the members had slightly increased latency and everything cascade-failed down from there. I'm sure there was missing (internal) context but it did not sound like a well-architected system at all.

nijave
·
1 day ago
·
[ - ]

>slightly increased latency

They didn't provide any details on latency. It could have been delayed an hour or a day and no one noticed

RajT88
·
1 day ago
·
[ - ]

Needs STONITH

kyrra
·
1 day ago
·
[ - ]

https://isitdns.com/

cdr420
·
1 day ago
·
[ - ]

It's always DNS

tempest_
·
1 day ago
·
[ - ]

It is a coin flip, heads DNS, tails BGP

r_lee
·
1 day ago
·
[ - ]

THIS is the real deal. Some say it's always DNS but many times it's some routing fuckup with BGP. two most cursed 3 letter acronym technologies out there

chasd00
·
1 day ago
·
[ - ]

when a service goes down it's DNS when an entire nation or group of nations vanish it's BGP.

layer8
·
1 day ago
·
[ - ]

DNS has both naming and cache invalidation, so no surprise it’s among the hardest things to get right. ;)

dotancohen
·
1 day ago
·
[ - ]

That's three of the hardest problems in CS ))

jdc0589
·
1 day ago
·
[ - ]

yea its not just the portal. microsoft.com is down too

mystcb
·
1 day ago
·
[ - ]

Yeah, I am guessing it's just a placeholder till they get more info. I thought I saw somewhere that internally within Microsoft it's seen as a "Sev 1" with "all hands on deck" - Annoyingly I can't remember where I saw it, so if someone spots it before I do, please credit that person :D

Edit: Typo!

verst
·
1 day ago
·
[ - ]

It's a Sev 0 actually (as one would expect - this isn't a big secret). I was on the engineering bridge call earlier for a bit. The Azure service I work on was minimally impacted (our customer facing dashboard could not load, but APIs and data layer were not impacted) but we found a workaround.

chad_c
·
1 day ago
·
[ - ]

It was here https://news.ycombinator.com/item?id=45749054 but that comment has been deleted.

PeterCorless
·
1 day ago
·
[ - ]

Seems all Microsoft-related domains are impacted in some way.

• https://www.xbox.com/en-US also doesn't fully paint. Header comes up, but not the rest of the page.

• https://www.minecraft.net/en-us is extremely slow, but eventually came up.

daxfohl
·
1 day ago
·
[ - ]

Downdetector says aws and gcp are down too. Might be in for a fun day.

rozenmd
·
1 day ago
·
[ - ]

From what I can tell, Downdetector just tracks traffic to their pages without actually checking if the site is down.

The other day during the AWS outage they "reported" OVH down too.

jdc0589
·
1 day ago
·
[ - ]

yea I saw that, but im not sure on how accurate that is. a few large apps/companies I know to be 100% on AWS in us-east-1 are cranking along just fine.

linhns
·
1 day ago
·
[ - ]

Not sure if this is true. I just login to the console with no glitch.

NetMageSCW
·
1 day ago
·
[ - ]

AWS was performance issues and I believe is resolved.

bossyTeacher
·
1 day ago
·
[ - ]

It sure must be embarrassing for the website of the second richest company in the world to be down.

planewave
·
1 day ago
·
[ - ]

yes, and it seems that at least for some login.microsoftonline.com is down too, which is part of the Entra login / SSO flow.

jonathanlydall
·
1 day ago
·
[ - ]

Yet another reason to move away from Front Door.

We already had to do it for large files served from Blob Storage since they would cap out at 2MB/s when not in cache of the nearest PoP. If you’ve ever experienced slow Windows Store or Xbox downloads it’s probably the same problem.

I had a support ticket open for months about this and in the end the agent said “this is to be expected and we don’t plan on doing anything about it”.

We’ve moved to Cloudflare and not only is the performance great, but it costs less.

Only thing I need to move off Front Door is a static website for our docs served from Blob Storage, this incident will make us do it sooner rather than later.

out_sider
·
1 day ago
·
[ - ]

we are considering the same but because our website uses APEX domain we would need to move all DNS resolver to cloudfront right ? Does it have as a nice "rule set builder" as azure ?

jonathanlydall
·
1 day ago
·
[ - ]

Unless you pay for CloudFlare’s Enterpise plan, you’re required to have them host your DNS zone, you can use a different registrar as long as you just point your NS records to Cloudflare.

Be aware that if you’re using Azure as your registrar, it’s (probably still) impossible to change your NS records to point to CloudFlare’s DNS server, at least it was for me about 6 months ago.

This also makes it impossible to transfer your domain to them either, as CloudFlare’s domain transfer flow requires you set your NS records to point to them before their interface shows a transfer option.

In our case we had to transfer to a different registrar, we used Namecheap.

However, transferring a domain from Azure was also a nightmare. Their UI doesn’t have any kind of transfer option, I eventually found an obscure document (not on their Learn website) which had an az command which would let you get a transfer code which I could give to Namecheap.

Then I had to wait over a week for the transfer timeout to occur because there is no way on Azure side that I could find to accept the transfer immediately.

I found CloudFlare’s way of building rules quite easy to use, different from Front Door but I’m not doing anything more complex than some redirects and reverse proxying.

I will say that Cloudflare’s UI is super fast, with Front Door I always found it painfully slow when trying to do any kind of configuration.

Cloudflare also doesn’t have the problem that Front Door has where it requires a manual process every 6 months or so to renew the APEX certificate.

out_sider
·
1 day ago
·
[ - ]

Thanks :). We don't use Azure as our registrar. It seems I'll have to plan for this then, we also had another issue, AFD has a hard 500ms tls handshake timeout (doesn't matter how much you put on the origin timeout settings) which means if our server was slow for some reason we would get 504 origin timeout.

Figs
·
1 day ago
·
[ - ]

CloudFlare != CloudFront

out_sider
·
1 day ago
·
[ - ]

I meant cloudfare

nosefrog
·
1 day ago
·
[ - ]

Front Door is not good.

NDizzle
·
1 day ago
·
[ - ]

They briefly had a statement about using Traffic Manager to work with your AFD to work around this issue, with a link to learn.microsoft.com/...traffic-manager, and the link didn't work. Due to the same issue affecting everyone right now.

They quickly updated the message to REMOVE the link. Comical at this point.

Aperocky
·
1 day ago
·
[ - ]

The statement is still there though on the status page though

NDizzle
·
1 day ago
·
[ - ]

They re-added it once the site was accessible.

eddie_catflap
·
1 day ago
·
[ - ]

We saw issues before 16:00 UTC - approx 15:38

ThatManulTheCat
·
1 day ago
·
[ - ]

DNS. Ofc.

rconti
·
1 day ago
·
[ - ]

Sounds like they need to move their portal to a region with more capacity for the desired instance type. /s

Uehreka
·
1 day ago
·
[ - ]

I noticed that Starbucks mobile ordering was down and thought “welp, I guess I’ll order a bagel and coffee on Grubhub”, then GrubHub was down. My next stop was HN to find the common denominator, and y’all did not disappoint.

pants2
·
1 day ago
·
[ - ]

Good thing HN is hosted on a couple servers in a basement. Much more reliable than cloud, it seems!

dang
·
1 day ago
·
[ - ]

Just don't use genetically identical hardware:

https://news.ycombinator.com/item?id=32031639

https://news.ycombinator.com/item?id=32032235

Edit: wow, I can't believe we hadn't put https://news.ycombinator.com/item?id=32031243 in https://news.ycombinator.com/highlights. Fixed now.

hinkley
·
1 day ago
·
[ - ]

I’ve seen this up close twice and I’m surprised it’s only twice. Between March and September one year, 6 people on one team had to get new hard drives in their thinkpads and rebuild their systems. All from the same PO but doled out over the course of a project rampup. That was the first project where the onboarding docs were really really good, since we got a lot of practice in a short period of time.

Long before that, the first raid array anyone set up for my (teams’) usage, arrived from Sun with 2 dead drives out of 10. They RMA’d us 2 more drives and one of those was also DOA. That was a couple years after Sun stopped burning in hardware for cost savings, which maybe wasn’t that much of a savings all things considered.

gogusrl
·
1 day ago
·
[ - ]

I got burnt by this bug on freakin' Christmas Eve 2020 ( https://forum.hddguru.com/viewtopic.php?f=10&t=40766 ). There was some data loss and a lot of lessons learned.

praccu
·
1 day ago
·
[ - ]

Many years ago (13?), I was around when Amazon moved SABLE from RAM to SSDs. A whole rack came from a single batch, and something like 128 disks went out at once.

I was an intern but everyone seemed very stressed.

airstrike
·
1 day ago
·
[ - ]

I love that "Ask HN: What'd you do while HN was down?" was a thing

Cthulhu_
·
20 hours ago
·
[ - ]

My plan B was going to the Stack Exchange homepage for some interesting threads but it got repetitive.

Cthulhu_
·
20 hours ago
·
[ - ]

Man I hit something like that once, a SSD had a firmware bug where it would stop working at an exact number of hours.

lysace
·
1 day ago
·
[ - ]

It was on AWS at least (for a while) in 2022.

https://news.ycombinator.com/item?id=32030400

jjice
·
1 day ago
·
[ - ]

Yeah looks like they're back on M5.

dang saying it's temporary: https://news.ycombinator.com/item?id=32031136

    $ dig news.ycombinator.com

    ; <<>> DiG 9.10.6 <<>> news.ycombinator.com
    ;; global options: +cmd
    ;; Got answer:
    ;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 54819
    ;; flags: qr rd ra; QUERY: 1, ANSWER: 1, AUTHORITY: 0, ADDITIONAL: 1

    ;; OPT PSEUDOSECTION:
    ; EDNS: version: 0, flags:; udp: 512
    ;; QUESTION SECTION:
    ;news.ycombinator.com.  IN A

    ;; ANSWER SECTION:
    news.ycombinator.com. 1 IN A 209.216.230.207

    ;; Query time: 79 msec
    ;; SERVER: 100.100.100.100#53(100.100.100.100)
    ;; WHEN: Wed Oct 29 13:59:29 EDT 2025
    ;; MSG SIZE  rcvd: 65

And that IP says it's with M5 again.

parliament32
·
1 day ago
·
[ - ]

Always has been.

Havoc
·
1 day ago
·
[ - ]

The sysadmin subreddit tends to beat hn on outage reports by an hour+ in my experience.

Bunch of on-call peeps over there that definitely know the instant something major goes down

sergiotapia
·
1 day ago
·
[ - ]

Wow I just left a Starbucks drivethru line because it was just not moving. I guess it was because of this.

iso1631
·
1 day ago
·
[ - ]

You'd think that Starbucks execs would be held accountable for the fragile system they have put in place.

But they won't be.

peanut-walrus
·
1 day ago
·
[ - ]

Why? Starbucks is not providing a critical service. Spending less money and resources and just accepting the risk that occasionally you won't be able to sell coffee for a few hours is a completely valid decision from both management and engineering pov.

bobro
·
1 day ago
·
[ - ]

Or maybe we should throw them in jail.

DaSHacka
·
1 day ago
·
[ - ]

I agree, but because the coffee is crap

munchlax
·
1 day ago
·
[ - ]

And ridiculously expensive

bombcar
·
1 day ago
·
[ - ]

It's absolutely batshit that an in-person transaction with cash becomes impossible when the computers are down.

I've seen it multiple times at various stores; only once did I see them taking cash and writing things down (probably to enter into the system later when it came back up).

iso1631
·
1 day ago
·
[ - ]

If I were a Starbucks shareholder I wouldn't be happy that my company is throwing away revenue because of the CTO's decision to outsource accountability

Time and time again it's shown that AWS is far more expensive than other solutions, just easier for the Execs to offshore the blame.

hypeatei
·
1 day ago
·
[ - ]

Starbucks mobile was down during the AWS outage too...

SoftTalker
·
1 day ago
·
[ - ]

They are multi-cloud --- vulnerable to all outages!

mring33621
·
1 day ago
·
[ - ]

you wouldn't believe some of the crap enterprise bigco mgmt put in place for disaster recovery.

they think that they are 'eliminating a single point of failure', but in reality, they end up adding multiple, complicated points of mostly failure.

andoma
·
1 day ago
·
[ - ]

Go multi-cloud they said...

Hamuko
·
1 day ago
·
[ - ]

Gonna build my application to be multicloud so that it requires multiple cloud platforms to be online at the same time. The RAID 0 of cloud computing.

01284a7e
·
1 day ago
·
[ - ]

Ha, maybe rethink the I AM NOTHING BUT A HUGE CLOUD CONSUMER thing on some fundamental levels? Like food?

port11
·
1 day ago
·
[ - ]

I noticed it when my Netatmo rigamajig stopped notifying me of bad indoor air quality. Lovely. Why does it need to go through the cloud if the data is right there in the home network…

pasc1878
·
20 hours ago
·
[ - ]

Same here for netatmo - ironically I replied to an incident report with netatmo saying all was OK when the whole system was falling over.

However netatmo does need to have a server to store data as you need to consolidate acreoss devices plus you can query gfor a year's data and that won't and can't be held locally.

·
1 day ago
·
[ - ]

garbagewoman
·
1 day ago
·
[ - ]

Service culture is so hollow

Theodores
·
1 day ago
·
[ - ]

My inner Nelson-from-the-Simpsons wishes I was on your team today, able to flaunt my flask of tea and homemade packed sandwiches. I would tease you by saying 'ha ha!' as your efforts to order coffee with IP packets failed.

I always go everywhere adequately prepared for beverages and food. Thanks to your comment, I have a new reason to do so. Take out coffees are actually far from guaranteed. Payment systems could go down, my bank account could be hacked or maybe the coffee shop could be randomly closed. Heck, I might even have an accident crossing the road. Anything could happen. Hence, my humble flask might not have the top beverage in it but at least it works.

We all design systems with redundancy, backups and whatnot, but few of us apply this thinking to our food and drink. Maybe get a kettle for the office and a backup kettle, in case the first one fails?

jeffrallen
·
1 day ago
·
[ - ]

You know you can talk to your barista and ask for a bagel, right? If you're lucky they still take cash... if you still _have_ cash. :)

0_____0
·
1 day ago
·
[ - ]

I was at a McDs a couple months back and I'm pretty sure you had to use the kiosk to order. Some places are deprecating the cashier entirely.

foresterre
·
1 day ago
·
[ - ]

It still surprises me how much essential services like public transport are completely reliant on cloud providers, and don't seem to have backups in place.

Here in The Netherlands, almost all trains were first delayed significantly, and then cancelled for a few hours because of this, which had real impact because today is also the day we got to vote for the next parlement (I know some who can't get home in time before the polls close, and they left for work before they opened).

conductr
·
1 day ago
·
[ - ]

Is voting there a one day only event? If not, I feel the solution to that particular problem is quite clear. There’s a million things that could go wrong causing you to miss something when you try to do it in a narrow time range (today after work before polls close)

If it’s a multi day event, it’s probably that way for a reason. Partially the same as the solution to above.

DontBreakAlex
·
1 day ago
·
[ - ]

In europe, voting typically happens in one day, where everyone physically goes to their designated voting place and puts papers in a transparent box. You can stay there and wait for the count at the end of the day if you want to. Tom Scott has a very good video about why we don't want electronic/mail voting: https://www.youtube.com/watch?v=w3_0x6oaDmI

tempestn
·
1 day ago
·
[ - ]

Electronic voting and mail voting are very different things though.

vasco
·
1 day ago
·
[ - ]

They both share the fact that you don't see your vote enter a ballot box.

samtp
·
1 day ago
·
[ - ]

Well "mail in voting" in Washington state pretty much means you drop off your ballot in a drop box in your neighborhood. Which is pretty much the same thing as putting it in a ballot box.

_heimdall
·
1 day ago
·
[ - ]

How is that the same?

The description of voting in the Netherlands is that you can see your ballot physically go into a clear box and stay to see that exact box be opened and all ballots tallied.

Dropping a ballot in a box in tour neighborhood helps ensure nothing with regards to the actually ballot count.

maxdamantus
·
1 day ago
·
[ - ]

Here in NZ when I've been to vote, there are usually a couple of party affiliates at the voting location, doing what one of the parent posts described:

> You can stay there and wait for the count at the end of the day if you want to.

And if you watch the election night news, you'll see footage of multiple people counting the votes from the ballot boxes, again with various people observing to check that nothing dodgy is going on.

Having everyone just put their ballots in a postbox seems like a good way remove public trust from the electoral system, because noone's standing around waiting for the postie to collect the mail, or looking at what happens in the mail truck, or the rest of the mail distribution process.

I'm sure I've seen reports in the US of people burning postboxes around election time. Things like this give more excuses to treat election results as illegitimate, which I believe has been an issue over there.

(Yes, we do also have advanced voting in NZ, but I think they're considered "special votes" and are counted separately .. the elections are largely determined on the day by in-person votes, with the special votes being confirmed some days later)

belorn
·
1 day ago
·
[ - ]

In Sweden, mail/early votes get sent through the postal system to the official ballot box for those votes. In 2018, a local election had to be redone because the post delivered votes late. Mail delivery occasionally have packaged delayed or lost, and votes are note immune to this problem. In one case the post also gave the votes to an unauthorized person, through the votes did end up at the right place.

It is a small but distinct difference between mail/early voting and putting the votes directly into the ballot box.

arsome
·
1 day ago
·
[ - ]

One of these things is much easier to burn or otherwise tamper with.

trevoragilbert
·
1 day ago
·
[ - ]

You should research what’s inside the boxes in Oregon before just assuming they’re easier to tamper with.

maxdamantus
·
1 day ago
·
[ - ]

Doesn't look difficult: https://www.fbi.gov/wanted/seeking-info/ballot-box-fires (yes, that's in Oregon)

bee_rider
·
1 day ago
·
[ - ]

I’m not sure what’s so special in Oregon’s ballot boxes. But, tampering that is detected (don’t need much special to detect a burning box I guess!) is not a complete failure for a system. If any elections were close enough for a box to matter, they could have rerun them.

vincnetas
·
1 day ago
·
[ - ]

With proper mail voting you have a way to verify that your mailed in vote is counted.

(AI generated explanation) How the double-envelope system works

Inner “secrecy” envelope

You mark your ballot, fold it, and slip it into an unmarked inner envelope. No name or identifying info is on this envelope, so your choices stay anonymous. Outer declaration envelope

The inner envelope goes inside a larger outer envelope that carries: – A ballot ID/barcode unique to you. – A signature line that must match the one on file with your election office. In many states, a detachable privacy flap or perforated strip hides the signature until election officials open the outer envelope, keeping the ballot secret.

edoceo
·
1 day ago
·
[ - ]

Is it possible to trace your own vote after? There has to be a technical solution to ensure that your own vote was counted

vincnetas
·
1 day ago
·
[ - ]

yes there is. Check double envelope mail in voting mechanics.

edoceo
·
13 hours ago
·
[ - ]

That's just that they got my ballot. How to ensure they allocated my specific vote to the specific candidate/measure.

hahajk
·
1 day ago
·
[ - ]

I mail in to Florida and I can log in and see that they received it and it was counted. So, close to seeing it enter the box.

kaashif
·
1 day ago
·
[ - ]

That doesn't seem at all like the same thing as literally seeing the ballot enter the box in the presence of observers from all parties.

There's so much more you have to trust.

throwaway7783
·
1 day ago
·
[ - ]

Even with ballot boxes you still need to trust what happens after ballot enters the box.

heroic
·
1 day ago
·
[ - ]

In India we have electronic voting and we get to see our vote going in the ballot box.

vasco
·
1 day ago
·
[ - ]

You can see electrons or what do you mean?

bee_rider
·
1 day ago
·
[ - ]

It could be possible to have a system like:

If you wish, you can write a phrase on your ballot. The phrases and their corresponding vote are broadcast (on tv, internet, etc). So if you want to validate that your vote was tallied correctly, write a unique phrase. Or you could pick a random 30 digit number, collisions should be zero-probability, right?

I mean, this would be annoying because people would write slurs and advertisements, and the government would have to broadcast them. But, it seems pretty robust.

I’d suggest the state handle the number issuing, but then they could record who they issues which numbers to, and the winning party could go about rounding up their opposition, etc.

parliament32
·
1 day ago
·
[ - ]

Voting systems require that there be no way to prove that you voted a certain way, otherwise it opens the market for vote-selling.

bee_rider
·
1 day ago
·
[ - ]

Hmm, good point.

Googling around a bit, it sounds like there are systems that let you verify that your ballot made it, but not necessarily that it was counted correctly. (For this reason, I guess?)

mulmen
·
1 day ago
·
[ - ]

In Washington you can track your ballot return status: https://www.sos.wa.gov/elections/data-research/ballot-status...

esseph
·
1 day ago
·
[ - ]

Seeing your ballot drop in a box is no indicator the vote is actually recorded in the grand tally, or what was recorded for your vote.

ceejayoz
·
1 day ago
·
[ - ]

My county lets you look up if it was received. You can vote on Election Day in person if they don’t.

kaashif
·
1 day ago
·
[ - ]

You have to trust that whole system. Maybe you do, I don't know the details of how any of that works.

When I vote in person, I know all the officials there from various parties are just like...looking at the box for the whole day to make sure everything is counted. It's much easier to understand and trust.

panarky
·
1 day ago
·
[ - ]

My county sends me a text message when they've counted my ballot.

esseph
·
1 day ago
·
[ - ]

My point is, you don't actually know that.

Sure you got a notification! That doesn't mean anything. Even with human counted ballots or electronic ballots.

Following the chain of custody from vote to verification, in some way, would be nice.

w3ll_w3ll_w3ll
·
1 day ago
·
[ - ]

In Italy we typically vote for two days, usually Sunday and Monday or Saturday and Sunday.

jampekka
·
1 day ago
·
[ - ]

Many countries in Europe have advance voting.

silversmith
·
1 day ago
·
[ - ]

Off the top of my head, I can't think of an EU country that does not have some form of advance voting.

Here in Latvia the "election day" is usually (always?) on weekend, but the polling stations are open for some (and different!) part of every weekday leading up. Something like couple hours on monday morning, couple hours on tuesday evening, couple around midday wednesday, etc. In my opinion, it's a great system. You have to have a pretty convoluted schedule for at least one window not to line up for you.

ndom91
·
1 day ago
·
[ - ]

Germany has mail-in voting, not sure if that counts as advanced voting though

generalspecific
·
1 day ago
·
[ - ]

Ireland doesn't have it.

alborzb
·
1 day ago
·
[ - ]

That's not true (as somebody who had to do this last year in 2024 because I was traveling in another country for work on election day)

Here is the form to register for postal voting in the Republic of Ireland - https://www.dublincity.ie/sites/default/files/2024-01/pv4-wo...

Instructions on how to submit the form / register for mail-in votes is on page 4.

Hope that helps anyone else out who needs in Ireland

embedding-shape
·
1 day ago
·
[ - ]

I think they meant "don't have it" as in except in special circumstances, and that form says:

> You may use this form to apply for a postal vote if, due to the circumstances of your work/service or your full-time study in the State, you cannot go to your polling station on polling day.

Which seems to indicate that's only for people who can't go to the polling station, otherwise you do have to go there.

esperent
·
1 day ago
·
[ - ]

I think that a lot of Ireland's voting practices come from having a small population but a huge diaspora. I imagine the percentage of people living outside Ireland what would be eligible to vote in many other countries is significant enough to effect elections, certainly if they are close.

As someone who spent the first 30 years of my life in Ireland but is now part of that diaspora, it's frustrating but I get it. I don't get to vote, but neither do thousands of plastic paddys who have very little genuine connection to Ireland.

That said, I'm sure they could expand the voting window to a couple of days at least without too much issue.

mrighele
·
1 day ago
·
[ - ]

Italy has mail-in vote only for citizen residing abroad. The rest vote on the election Sunday (and Monday morning in some cases, at least in the past).

chrisandchris
·
21 hours ago
·
[ - ]

Europe, expect in the middle of it in Switzerland where at least I know nobody that actually goes to the voting place. We do it by mail.

wodenokoto
·
1 day ago
·
[ - ]

We do mail voting from embassies or consulates when abroad.

ed_elliott_asc
·
1 day ago
·
[ - ]

UK is a one day affair with voting booths typically open like 6 am to 10 pm

xnorswap
·
1 day ago
·
[ - ]

With the option to do a postal vote, or vote-by-proxy.

speakfreely
·
1 day ago
·
[ - ]

Voting seems like one of the few problems that blockchain is actually the solution for.

Sammi
·
1 day ago
·
[ - ]

Nope. Blockchain has no anonymity.

ehnto
·
1 day ago
·
[ - ]

You don't have to attribute any name to the transaction, just a voting booth ID and the vote. The actual benefit is just that it is hard to tamper and easy to trace where tampering happened.

But I still prefer the paper vote and I usually a blockchain apathetic.

johnsonelephant
·
1 day ago
·
[ - ]

Monero demonstrates a solution (https://en.wikipedia.org/wiki/Ring_signature)

fancyswimtime
·
1 day ago
·
[ - ]

wouldn't that be a feature in this case?

kaibee
·
1 day ago
·
[ - ]

Anonymous voting means that you can't sell your vote. Like, if I pay you $5 to vote for X, but I can't actually verify that you voted for X and not Y, then I wouldn't bother trying. Or if I'm your boss and I want you to vote for X... etc.

konimex
·
1 day ago
·
[ - ]

Not really. Your ballot should be secret, which goes against blockchain, I guess.

ehnto
·
1 day ago
·
[ - ]

The blockchain doesn't require your ID, just the voting station ID.

Sammi
·
22 hours ago
·
[ - ]

Any id can be correlated.

klardotsh
·
1 day ago
·
[ - ]

Washington State having full vote-by-mail (there is technically a layer of in-person voting as a fallback for those who need it for accessibility reasons or who missed the registration deadline) has spoiled me rotten, I couldn't imagine having to go back to synchronous on-site voting on a single day like I did in Illinois. Awful. Being able to fill my ballot at my leisure, at home, where I can have all the research material open, and drive it to a ballot drop box whenever is convenient in a 2-3 week window before 20:00 on election night, is a game-changer for democracy. Of course this also means that people who serve to benefit from disenfranchising voters and making it more difficult to vote, absolutely hate our system and continually attack it for one reason or another.

vanviegen
·
1 day ago
·
[ - ]

As a Dutchman, I have to go vote in person on a specific day. But to be honest: I really don't mind doing so. If you live in a town or city, there'll usually be multiple voting locations you can choose from within 10 minutes walking distance. I've never experienced waiting times more than a couple of minutes. Opening times are pretty good, from 7:30 til 21:00. The people there are friendly. What's not to like? (Except for some of the candidates maybe, but that's a whole different story. :-))

jfengel
·
1 day ago
·
[ - ]

In the US, hours-long lines are routine. Not everywhere, but poorer places tend to have fewer voting machines and longer lines.

We've been closing a lot of polling places recently:

https://abcnews.go.com/US/protecting-vote-1-5-election-day-p...

christkv
·
1 day ago
·
[ - ]

Voting machines slow down voting from what I understand

jfengel
·
1 day ago
·
[ - ]

Not as much as hanging chads do.

vitorgrs
·
1 day ago
·
[ - ]

At least in Brazil, that's not the case. You get there to vote, and it doesn't take longer than 5 minutes to leave the place.

esseph
·
1 day ago
·
[ - ]

Have not for me. I mark on a paper ballot that then gets fed into a machine to be recorded. That leaves a paper copy and a digital voting record.

conductr
·
1 day ago
·
[ - ]

We have early voting, nobody has to wait, they choose to wait

iAMkenough
·
1 day ago
·
[ - ]

We're on year five of one of the two parties telling voters to not trust early voting. Their choice is because of the Fear, Uncertainty, and Doubt created by the propaganda they are fed.

Here's the President of the United States on Sunday: https://truthsocial.com/@realDonaldTrump/posts/1154418712892...

"No mail-in or 'Early' Voting, Yes to Voter ID! Watch how totally dishonest the California Prop Vote is! Millions of Ballots being 'shipped.' GET SMART REPUBLICANS, BEFORE IT IS TOO LATE!!!"

conductr
·
1 day ago
·
[ - ]

That's all happening too, but it's honestly a different topic altogether. We have the ability to vote early. Whether you trust it or politicians are trying to undermine your trust in it, etc.... whole other can of worms

jfengel
·
1 day ago
·
[ - ]

Not everyone does. It varies from state to state. Red states in particular have little to no early voting.

tags2k
·
1 day ago
·
[ - ]

You have early voting, some choose not to trust the early voting system.

Sleaker
·
1 day ago
·
[ - ]

Please lookup US voting poll overflow issues that come up every election cycle. Just because you experience a well streamlined process doesn't mean that it's the norm everywhere.

yellow_postit
·
1 day ago
·
[ - ]

Don’t forget you can’t dare offer water or food to those stuck in lines else that’s considered tampering in many (all?) locales in the US.

Mail in voting is just better all around for a geographically diverse place as the US and I wish would be adopted by all states.

wyre
·
1 day ago
·
[ - ]

Rule of thumb: if Republicans are against it, it’s probably a good thing for everyone else, like mail-in voting.

So excited to see how the right-wing pedants here disagree with this.

vanviegen
·
1 day ago
·
[ - ]

Oh, I know. I'm just saying it can be done properly on a single day. It is a pretty challenging and expensive logistical operation though.

conductr
·
1 day ago
·
[ - ]

So, if you have a minor emergency, like a kidney stone and hospitalized for the day - you just miss your chance to vote in that election?

If so, I see a lot to dislike. As the point I was making is you can’t anticipate what might come up. Just because it’s worked thus far doesn’t mean it’s designed for resilience. There’s a lot of ways you could miss out in that type of situation. I seems silly to make sure everything else is redundant and fault tolerant in the name of democracy when the democratic process itself isn’t doing the same.

Boltgolt
·
1 day ago
·
[ - ]

If hospitalized on that specific day: Sign the back of the voting card and give your ID to a family member, they can cast your vote

conductr
·
1 day ago
·
[ - ]

How is that an acceptable response? Honestly. You’re in the hospital, in pain, likely having a minor surgery, and having someone cast your vote for you is going to be on your mind too? Do you have your voting card in your pocket just in case this were to play out?

That’s just ridiculous in my opinion. Makes me wonder how many well intentioned would be voters end up missing out each election cause shit happens and voting is pretty optional

gowld
·
1 day ago
·
[ - ]

What percent of the electorate is incapacitated on voting day?

What is the that group's deviation from the general voting population's preferences?

What are the margins of the votes on those ballot questions?

conductr
·
1 day ago
·
[ - ]

Mild curiosity, no idea whether it would be statistically relevant but asking the question is the first step. If you knew the answer, you might want to extend the voting window even if it wouldn't effect an elections outcome it would be a quantified number of people excluded from the democratic process for simply having bad luck at the wrong time.

mc32
·
1 day ago
·
[ - ]

If India can have voters vote and tally all the votes in one day, then so can everyone else. It’s the best way to avoid fraud and people going with whoever is ahead. I am sympathetic with emergency protocols for deadly pandemics, but for all else, in-person on a given day.

sampo
·
1 day ago
·
[ - ]

> If India can have voters vote and tally all the votes in one day, then so can everyone else.

In most countries, in the elections you vote or the member of parliament you want. Presidential elections, and city council elections are held separately, but are also equally simple. But in one election you cast your vote for one person, and that's it.

With this kind of elections, many countries manage to hold the elections on paper ballots, count them all by hand, and publish results by midnight.

But on an American ballot, you vote for, for example:

    - US president
    - US senator  
    - US member of congress  
    - state governor  
    - state senator  
    - state member of congress  
    - several votes for several different state judge positions  
    - several other state officer positions  
    - several votes for several local county officers  
    - local sheriff  
    - local school board member  
    - several yes/no votes for several proposed laws, whether they should be passed or not

I don't think it would be possible to calculate all these 20 or 40 votes, if calculated by hand. That's why they use voting machines in America.

https://ballotpedia.org/Official_sample_ballots,_2020

konimex
·
1 day ago
·
[ - ]

Say, how many voting stations are there in a typical city/county in the US?

Here in Indonesia, in a city of 2 million people there are over 7000 voting stations. While we vote for 5 ballots (President, Legislative (National, Province, and City/Regency), we still use paper ballots and count them by hand.

Freedom2
·
1 day ago
·
[ - ]

How is it not possible? It's just additional votes, there isn't anything actually stopping counting by hand, is there? How was it counted historically without voting machines?

stevenwoo
·
1 day ago
·
[ - ]

It takes a lot of people (redundancy and to keep shift hours low to increase count accuracy) to accurately count by hand. https://verifiedvoting.org/election-system/hand-counted-pape...

Freedom2
·
1 day ago
·
[ - ]

That makes it difficult, but the original comment said it wasn't 'possible'. I'm failing to see the impossibility still.

20kleagues
·
1 day ago
·
[ - ]

Voting in India is staggered over multiple phases over multiple days/weeks. Only the vote count happens on a single day at the end.

platevoltage
·
1 day ago
·
[ - ]

If it's not a national holiday where the vast majority of people don't have to work, and if there aren't polling places reasonably near every voting age citizen, it's voter suppression.

trenchpilgrim
·
1 day ago
·
[ - ]

In particular India has a law that no one shall be made to walk more than 2km to vote. The Indian military will literally deploy a voting booth into the jungle so that a single caretaker of an old temple can vote.

Mr_Minderbinder
·
1 day ago
·
[ - ]

Finance is increasingly reliant on it too, my bank moved their entire system to AWS. The amount of power being handed over to these cloud companies in exchange for “convenience” is astonishing.

hinkley
·
1 day ago
·
[ - ]

Voting days should be a national holiday.

tmtvl
·
1 day ago
·
[ - ]

Here in Belgium voting is usually done during the weekend, although it shouldn't matter because voting is a civic duty (unless you have a good reason you have to go vote or you'll be fined), so those who work during the weekend have a valid reason to come in late or leave early.

brendoelfrendo
·
1 day ago
·
[ - ]

In the US, where I assume a lot of the griping comes from, election day is not a national holiday, nor is it on a weekend (in fact, by law it is defined as "the Tuesday next after the first Monday in November"), and even though it is acknowledged as an important civic duty, only about half of the states have laws on the books that require employers provide time off to vote. There are no federal laws to that effect, so it's left entirely to states to decide.

t0lo
·
1 day ago
·
[ - ]

[flagged]

hshdhdhehd
·
1 day ago
·
[ - ]

In Australia there are so many places to vote, it is almost popping out to get milk level if convenience. Just detour your dog walk slightly. Always at the weekend.

wanderingmind
·
1 day ago
·
[ - ]

From mandatory voting to preferential voting, australia seems to have figured out what this works best for democracy.

Archelaos
·
1 day ago
·
[ - ]

In Germany it is always a Sunday.

Hamuko
·
1 day ago
·
[ - ]

Same in Finland. And even if you work Sundays, there's a week's worth of early voting so you can take your pick.

hshdhdhehd
·
1 day ago
·
[ - ]

In Australia there are so many places to vote, it is almost popping out to get milk level if convenience. (At least in urbia and suburbia) Just detour your dog walk slightly. Always at the weekend.

dullcrisp
·
1 day ago
·
[ - ]

In the US getting milk involves driving multiple miles, finding parking, walking to the store, finding a shopping cart, finding the grocery department, navigating the aisles to the dairy section, finding the milk, waiting in line to check out, returning the cart if you’re courteous, and driving back. Could take an hour or so.

johnfn
·
1 day ago
·
[ - ]

Yes, or you do it on your drive back from work, and it takes 3 minutes.

hshdhdhehd
·
1 day ago
·
[ - ]

No convenience stores?

dullcrisp
·
1 day ago
·
[ - ]

There are gas stations but I’m not sure I’d trust the milk there.

hshdhdhehd
·
13 hours ago
·
[ - ]

America seems so strange from outside... but Im sure that applies in reverse.

dullcrisp
·
13 hours ago
·
[ - ]

No it is strange.

sleepybrett
·
1 day ago
·
[ - ]

In washington we have a 100% mail-in voting system (for all intents and purposes). I can put my ballot back in the mail or drop at any number of drop-boxes throughout the city (less dropboxes in rural areas i'm sure). I think there are some allowances for in-person voting but I don't think they are often used.

There is a ballot tracking system as well, I can see and be notified as my ballot moves through the counting system. It's pretty cool.

I actually just got back from dropping off my local elections ballot 15m ago, quick bike trip maybe a mile or so away and back.

Of course, because it makes it easy for people to vote, the republicans want to do away with it. If you have to stand in line for several hours (which seems to be very normal in most cities) and potentially miss work to do it that's going to all but guarantee that working people and the less motivated will not vote.

So yes in places that only do in person voting, national or state holiday.

hinkley
·
1 day ago
·
[ - ]

You have a mail-in voting system... for now.

tuukkah
·
1 day ago
·
[ - ]

Could you provide a reference about the train disruption? I tried but couldn't find anything in English.

foresterre
·
1 day ago
·
[ - ]

Here is an article in English:

https://nltimes.nl/2025/10/29/ns-hit-microsoft-cloud-outage-...

It should be noted that the article isn't complete: while the travel planner and ticket machines were the first to fail, trains were cancelled soon after; it took a few hours before everything restarted.

Based on what the conductors said, I would speculate that the train drivers digital schedule was not operative, so they didn't know where to go next.

tuukkah
·
1 day ago
·
[ - ]

Thanks! Perhaps one of the sites that track train delays can give a statistic?

This list doesn't have anything that looks relevant: https://www.rijdendetreinen.nl/en/disruptions/archive?date_b...

The day does not appear as an outlier in the monthly statistics: https://www.rijdendetreinen.nl/en/statistics/2025/10

I don't find a detailed statistic on the overall delays, but the per-station statistics for Amsterdam Centraal say 5% of trains were cancelled and 17% were delayed by 5 minutes or more (mostly by 10 minutes): https://www.rijdendetreinen.nl/en/train-archive/2025-10-29/a...

stefs
·
1 day ago
·
[ - ]

i'm not sure this is an easily solvable problem. i remember reading an article arguing that your cloud provider is part of your tech stack and it's close to impossible/a huge PITA to make a non-trivial service provider-agnostic. they'd have to run their own openstack in different datacenters, which would be costly and have their own points of failure.

dotancohen
·
1 day ago
·
[ - ]

I run non trivial services on EC2, using that service as a VPS. My deploy script works just as well on provisioned Digital Ocean services and on docker containers using docker-compose.

I do need a human to provision a few servers and configure e.g. load balancing and when to spin up additional servers under load. But that is far less of a PITA than having my systems tied to a specific provider or down whenever a cloud precipitates.

baby_souffle
·
1 day ago
·
[ - ]

It's absolutely doable if you design for it.

The moment you choose to use S3 instead of hosting your own object store, though, you either use AWS because S3 and IAM already have you or spend more time on the care and feeding of your storage system as opposed to actually doing the thing you customers are paying you to do.

It's not impossible, just complicated and difficult for any moderately complex architecture.

Tostino
·
21 hours ago
·
[ - ]

There are plenty of compatible S3-like offerings. That's one of the lesser things that tie me to a cloud.

dotancohen
·
15 hours ago
·
[ - ]

Even on non-AWS projects, I still use S3. I haven't really explored the other options, but if you have opinions or advice I'd love to hear them.

One thing very important, is that I can authorise specific web clients (users) to access specific resources from S3. Such as a document that he can download, but others with the link should not be able to download.

Thank you!

Tostino
·
10 hours ago
·
[ - ]

The way I solved auth in my case was just proxying everything through my backend and having that do the auth. I have my own URL scheme and the users never see the URL for the file in S3.

Another way you can do it is generating pre-signed URLs in your backend on each request to download something... but the URL that is generated when you do that is only valid for some small time period, so not a stable URL at all.

In my use case, I needed stable URLs, so I went the proxy route.

zharknado
·
1 day ago
·
[ - ]

“precipitates” ha! Wonderfully evocative.

myself248
·
1 day ago
·
[ - ]

How ever did buses run before The Cloud™? What a weird world that must have been.

j45
·
1 day ago
·
[ - ]

Organizations who had their own datacenters were chided for being resistant to modernizing, and now they modernized to use someone else's shared computers and they stopped working.

I really do feel the only viable future for clouds is hybrid or agnostic clouds.

esseph
·
1 day ago
·
[ - ]

Hybrid was always the way. Use different tools to solve different problems.

alt227
·
1 day ago
·
[ - ]

Wow thats crazy! National transport infrastructure being so fragile. What a great age we live in.

onionisafruit
·
1 day ago
·
[ - ]

It is a great age to live in where we have transportation infrastructure beyond foot paths.

dijit
·
1 day ago
·
[ - ]

yeah, those are the two options. and before the cloud there was no transportation possible except via foot.

Horses were famously tamed in 2007 after AWS released S3 to the public, this is the best of times.

tmtvl
·
1 day ago
·
[ - ]

The Flemish bus company (de Lijn) uses Azure and I couldn't activate my ticket when I came home after training a couple of hours ago. I should probably start using physical tickets again, because at least those work properly. It's just stupid that there's so much stuff being moved to digital only (often even only being accessible through an Android or iOS app, despite the parent companies of those two being utterly atrocious) when the physical alternatives are more reliable.

bironran
·
1 day ago
·
[ - ]

Yet... deploy on two clouds and you'll get tax payers scream at you for "wasting money" preparing for a black swan event. Can't have both, either reliability or lower cost.

barrenko
·
1 day ago
·
[ - ]

> wouldn't put China or Russia above this

gowld
·
1 day ago
·
[ - ]

Why would you put Microsoft above this?

barrenko
·
1 day ago
·
[ - ]

Nope :)

platevoltage
·
1 day ago
·
[ - ]

I wouldn't put Texas above this either.

alliao
·
1 day ago
·
[ - ]

dang even zealand didn't survive! new zealand got some soul searching with this outage which took down government person ID service, it's called RealME and it can be used to file your taxes apply for passport etc

bethekidyouwant
·
1 day ago
·
[ - ]

You are not getting more 9’s rolling your own

bombcar
·
1 day ago
·
[ - ]

You can get way more 9s if you roll your own procedures - and each step has a way of handling failure of the other steps.

Old trains had paper tickets, the locomotive was its own power source, the conductor had a flashlight, and the conductor could sell tickets for cash.

And if everything else failed, the conductor would just let you ride for free.

Now everything's so interconnected that any one part failing brings everything to a halt.

vachina
·
1 day ago
·
[ - ]

At least I get to control when the 0.01 happens.

fylo
·
1 day ago
·
[ - ]

How?

isbvhodnvemrwvn
·
1 day ago
·
[ - ]

No you don't, lol.

varispeed
·
1 day ago
·
[ - ]

Wasn't cloud sold based on a premise to prevent the very thing that is happening? Sounds like a massive fail of the whole concept.

fHr
·
1 day ago
·
[ - ]

can't believe it's 2025 and some still need to go to some place to vote. I can vote since I can remember(at least 20 years) by mail for anything, we also vote multiple times a year(4-6 times), we just get 1 Month before the things to vote by mail and then mail in back votes. Hope we can soon vote online to get rid of the paper overhead.

vanviegen
·
1 day ago
·
[ - ]

When voting remotely, how can we be sure that your vote was not bought or coerced?

qrios
·
1 day ago
·
[ - ]

Is that you? The same guy with the comment "hahahhahaha"[1] on "Women dating safety app 'Tea' breached, users' IDs posted to 4chan"[2]?

[1] https://news.ycombinator.com/item?id=44689366

[2] https://news.ycombinator.com/item?id=44684373

Imustaskforhelp
·
1 day ago
·
[ - ]

Google cloud run or cloudflare workers it is.

Personally I am thinking more and more about hetzner, yes I know its not an apples to orange comparison. But its honestly so good

Someone had created a video where they showed the underlying hardware etc., I am wondering if there is something like https://vpspricetracker.com/ but with geek-benchmarks as well.

This video was affiliated with scalahosting but still I don't think that there was too much bias of them and they showed at around 3:37 a graph comparison with prices https://www.youtube.com/watch?v=9dvuBH2Pc1g

Now it shows how contabo has better hardware but I am pretty sure that there might be some other issues, and honestly I feel a sense of trust with hetzner I am not sure about others.

Either hetzner or self hosting stuff personally or just having a very cheap vps and going to hetzner if need be but hetzner already is pretty cheap or I might use some free service that I know of are good as well.

hshdhdhehd
·
1 day ago
·
[ - ]

Are you after nines? Maybe do multi provider?

Havoc
·
1 day ago
·
[ - ]

Hetzner seems sound, but I doubt they play in the same reliability league as google

dijit
·
1 day ago
·
[ - ]

Probably not, but at least you don’t delude yourself into thinking reliability is a solved problem just because you’re paying through the nose for compute and storage.

TiredOfLife
·
1 day ago
·
[ - ]

One of recent (4 months ago) Cloudflare outages (I think it was even workers) was caused by Google Cloud being down and Cloudflare hosting an essential service there

kentonv
·
1 day ago
·
[ - ]

It was Workers KV (an optional storage add-on to Workers), and we fixed it, it no longer depends on GCP:

https://blog.cloudflare.com/rearchitecting-workers-kv-for-re...

Imustaskforhelp
·
1 day ago
·
[ - ]

Hm it seemed that they hosted a critical service for cloudflare kv on google itself, but I wonder about the update.

Personally I just trust cloudflare more than google, given how their focus is on security whereas google feels googly...

I have heard some good things about google cloud run and the google's interface feels the best out of AWS,Azure,GCloud but I still would just prefer cloudflare/hetzner iirc

Another question: Has there ever been a list of all major cloud outages, like I am interested how many times google cloud and all cloud providers went majorly down I guess y'know? is there a website/git project that tracks this?

bob1029
·
1 day ago
·
[ - ]

For some reason an Azure outage does not faze me in the same way that an AWS outage does.

I have never had much confidence in Azure as a cloud provider. The vertical integration of all the things for a Microsoft shop was initially very compelling. I was ready to fight that battle. But, this fantasy was quickly ruined by poor execution on Microsoft's part. They were able to convince me to move back to AWS by simply making it difficult to provision compute resources. Their quota system & availability issues are a nightmare to deal with compared to EC2.

At this point I'd rather use GCP over Azure and I have zero seconds of experience with it. The number of things Microsoft gets right in 2025 can be counted single-handedly. The things they do get right are quite good, but everything else tends to be extremely awful.

xmcp123
·
1 day ago
·
[ - ]

Many years back was the first time I used Azure, evaluating it for a client.

I remember I at one point had expanded enough menus that it covered the entirety of the screen.

Never before have I felt so lost in a cloud product.

WorldMaker
·
1 day ago
·
[ - ]

The "Blades" experience [0] where instead of navigating between pages it just kept opening things to the side and expanding horizontally?

Yeah, that had some fun ideas but was way more confusing than it needed to be. But also that was quite a few years back now. The Portal ditched that experience relatively quickly. Just long enough to leave a lot of awful first impressions, but not long enough for it to be much more than a distant memory at this point, several redesigns later.

[0] The name "Blades" for that came from the early years of the Xbox 360, maybe not the best UX to emulate for a complex control panel/portal.

btown
·
1 day ago
·
[ - ]

Azure to me has always suffered from a belief that “UI innovations can solve UX complexity if you just try hard enough.”

Like, AWS, and GCP to a lesser extent, has a principled approach where simple click-ops goals are simple. You can access the richer metadata/IAM object model at any time, but the wizards you see are dumb enough to make easy things easy.

With Azure, those blades allow tremendously complex “you need to build an X Container and a Container Bucket to be able to add an X” flows to coexist on the same page. While this exposes the true complexity, and looks cool/works well for power users, it is exceedingly unintuitive. Inline documentation doesn’t solve this problem.

I sometimes wonder if this is by design: like QuickBooks, there’s an entire economy of consultants who need to be Certified and thus will promote your product for their own benefit! Making the interface friendly to them and daunting to mere mortals is a feature, not a bug.

But in Azure’s case it’s hard to tell how much this is intentional.

xnorswap
·
1 day ago
·
[ - ]

I still feel lost just trying to view my application logs.

I don't want to pay for or lock myself into, "Azure Insights".

I just want to see the logging, that I know if I can remember the right buttons to click, are available.

The worst place to try is "Monitoring > Logs", this is where you get faced up front with a query designer. I've never worked out how to do a simple "list by time" on that query designer, but it doesn't matter, because if you suffer through that UX, you find out that's not actually where the logs are anyway.

You have to go down a different path. Don't be distracted by "Log Stream", that's not it either, it sounds useful but it's not. By default it doesn't log anything. If you do configure it to log, then it still doesn't actually log everything.

What you have to actually do, and I've had to open the portal to check this, is click "Diagnose and Solve Problems" and then look for "Diagnostic tools" and then a small link to "Application Event Logs".

Finally you get to your logs, although it's still a bad way to try to view logs, it's at least marginally better than the real windows event viewer, an application that feels like it hasn't been updated since NT4. ( Although some might suggest that's a good thing. )

WorldMaker
·
19 hours ago
·
[ - ]

A lot of Azure Insights is a value add on top of OTel. You can use "just" OTel feeding Azure Insights (and that's what the modern Aspire-influenced defaults mostly do) and possibly avoid that feeling of vendor lockin. That perspective might also give you ideas of other "modern" vendors to audition such as Grafana if you wanted to see what other people are doing with OTel rather than Event Logs and file system logs.

btown
·
17 hours ago
·
[ - ]

I love that OTel exists, but I've always been confused why OTel even existed. If I were running a cloud provider, I'd put a "skunkworks" team on all things observability and log management a decade ago, and make their product free to use. It's not critical-path on infrastructure, so it's not a correlated risk other than the investment in manpower, but it dramatically changes how user-friendly your web interface is, and how likely people are to use it daily vs. managing things over command line or with third-party interfaces.

By bringing those eyeballs onto your cloud console, you're creating infinitely more opportunities for branded interaction and discovery of your other cloud products - you could even quantify these eyeballs as you would ad inventory! There should have been an arms race for each cloud provider to have the best log-tailing and log-searching and log-aggregation system imaginable. OTel could have been killed before it began, because Honeycomb and its other originators would have been acquired years ago and made specific locked-in value-adds for each cloud.

But nobody had this foresight, and thus comments like yours are absolutely correct. OTel is a blessing and I love the tools coming out. But from a cloud provider's perspective, it's a massive missed opportunity that continues to be missed.

WorldMaker
·
16 hours ago
·
[ - ]

> By bringing those eyeballs onto your cloud console, you're creating infinitely more opportunities for branded interaction and discovery of your other cloud products - you could even quantify these eyeballs as you would ad inventory! There should have been an arms race for each cloud provider to have the best log-tailing and log-searching and log-aggregation system imaginable. OTel could have been killed before it began, because Honeycomb and its other originators would have been acquired years ago and made specific locked-in value-adds for each cloud.

I think that's what Application Insights has always been, Azure's free-to-start, suggest-out-of-the-box Honeycomb. App Insights had a long slow road away from Microsoft-specific log and metrics ingesters that weren't OTel, but it is hard to argue that standard ingestors are a bad idea. App Insights still downplays that it can be "just a Honeycomb" using only OTel sources and still encourages "secret sauce" ingestors in addition to OTel ones. App Insights is a small moat (around a data lake; to mix metaphors). That said, it's also a standards-supporting tool now as well.

It's not been as clear of an arms race because AWS and GCP didn't invest in it in a similar way and it mostly impacted what are often called "dark matter" teams (Microsoft shops doing "boring" stuff that rarely makes HN headlines), but I have worked in teams that absolutely favored Azure over AWS/GCP with one of the reasons being Application Insights was an easy install and powerful first-party supported tool rather than an extra third party vendor relationship like Grafana/Honeycomb/Dynatrace/etc.

Insanity
·
1 day ago
·
[ - ]

Not sure what to imagine with this given I didn't use Azure at the time. Is this like the Windows XP style task menu?

WorldMaker
·
1 day ago
·
[ - ]

Think Niri [0], but worse, embedded in a web browser tab, and without keyboard navigation.

Here's a somewhat ancient Stack Overflow screenshot I found: https://i.sstatic.net/yCseI.png

(I think that's from near the transition because it has full "windowing" controls of minimize/maximize/close buttons. I recall a period with only close buttons.)

All that blue space you could keep filling with more "blades" as you clicked on things until the entire page started scrolling horizontally to switch between "blades". Almost everything you could click opened in a new blade rather than in place in the existing blade. (Like having "Open in New Window" as your browser default.)

It was trying to merge the needs of a configurable Dashboard and a "multi-window experience". You could save collections of blades (a bit like Niri workspaces) as named Dashboards. Overall it was somewhere between overkill and underthought.

(Also someone reminded me that many "blades" still somewhat exist in the modern Portal, because, of course, Microsoft backwards compatibility. Some of the pages are just "maximized Blades" and you can accidentally unmaximize them and start horizontally scrolling into new blades.)

[0] https://github.com/YaLTeR/niri

csydas
·
1 day ago
·
[ - ]

azure likes to open new sections on the same tab / page as opposed to reloading or opening a new page / tab (overlays? modals? I'm lost on graphic terms)

depending on the resource you're accessing, you can get 5+ sections each with their own ui/ux on the same page/tab and it can be confusing to understand where you're at in your resources

if you're having trouble visualizing it, imagine an url where each new level is a different application with its own ui/ux and purpose all on the same webpage

high_priest
·
1 day ago
·
[ - ]

Imagine OG Xbox menus, or the PS3/PSP menus.

nazgulsenpai
·
1 day ago
·
[ - ]

Count your blessings. You could have to use Azure SSO through Oracle Cloud..... ; ;

foresterre
·
1 day ago
·
[ - ]

AWS' UI is similarly messy, and to this day. They regularly remove useful data from the UI, or change stuff like the default sort order of database snapshots from last created to initial instance created date.

I never understood why a clear and consistent UI and improved UX isn't more of a priority for the big three cloud providers. Even though you talk mostly via platform SDK's, I would consider better UI especially initially, a good way to bind new customers and pick your platform over others.

I guess with their bottom line they don't need it (or cynically, you don't want to learn and invest in another cloud if you did it once).

brap
·
1 day ago
·
[ - ]

It’s more than just the UI itself (which is horrible), it’s the whole thing that is very hostile to new users even if they’re experienced. It’s such an incoherent mess. The UI, the product names, the entire product line itself, with seemingly overlapping or competing products… and now it’s AI this and AI that. If you don’t know exactly what you’re looking for, good luck finding it. It’s like they’re deliberately trying to make things as confusing as possible.

For some reason this applies to all AWS, GCP and Azure. Seems like the result of dozens of acquisitions.

conductr
·
1 day ago
·
[ - ]

I still find it much easier to just self host than learn cloud and I’ve tried a few times but it just seems overly complex for the sake of complexity. It seems they tie in all their services to jack up charges, eg. I came for S3 but now I’m paying for 5 other things just to get it working.

Any time something is that unintuitive to get started, I automatically assume that if I encounter a problem that I’ll be unable to solve it. That thought alone leads me to bounce every time.

reddalo
·
1 day ago
·
[ - ]

100% agree. I've been working in the industry for almost 20 years, I'm a full stack developer and I manage my servers. I've tried signing up for AWS and I noped out.

AWS Is a complete mess. Everything is obscured behind other products, and they're all named in the most confusing way possible.

lovich
·
1 day ago
·
[ - ]

I found it intuitive but admittedly it felt a lot like their Xbox UI which I used a lot during my formative years

jeffrallen
·
1 day ago
·
[ - ]

Amazon: here's two buttons, some check boxes and a random popup.

MSFT : Hold my beer...

multiplegeorges
·
1 day ago
·
[ - ]

> At this point I'd rather use GCP over Azure and I have zero seconds of experience with it.

TBH, GCP is very good! More people should use it.

macintux
·
1 day ago
·
[ - ]

I know for some people the prospect of losing their Google Cloud access due to an automated terms of service violation on some completely unrelated service is worrisome.

https://cloud.google.com/resource-manager/docs/project-suspe...

I'd hope you can create a Google Cloud account under a completely different email address, but I do as little business with Google as I can get away with, so I have no idea.

kccqzy
·
1 day ago
·
[ - ]

That's generally speaking a good practice anyways. My Amazon shopping account has a different email than my Amazon Web Services account. I intuited that they need to be different from the get go.

bpye
·
1 day ago
·
[ - ]

I haven't used much of GCP, but I have had a good experience with Cloud Run and really haven't found a comparable offering from the other clouds.

vrosas
·
1 day ago
·
[ - ]

Cloud Run is incredible. It’s one of those things I wish more devs knew about. Even at work where we use GCP all the “smart” devs insist on GKE for their “webscale” services that get dozens of requests a second. Dozens!

antonkochubey
·
1 day ago
·
[ - ]

Isn’t ECS Fargate pretty much the same thing?

nflekkhnnn
·
1 day ago
·
[ - ]

ACI is the corresponding service in our favorite cloud.

archon810
·
1 day ago
·
[ - ]

>I've used AWS, Azure, and recently GCP. You do NOT want to use GCP.

>TBH, GCP is very good! More people should use it.

These takes couldn't be further apart. Gotta love HN comments.

a012
·
1 day ago
·
[ - ]

GCP console is not the best but as a long term multicloud user, I can assure you that GCP is much better than Azure portal and/or Azure APIs which is fucking hell

the_black_hand
·
5 hours ago
·
[ - ]

have used both. I prefer Azure. Much easier to use. found GCP unintuitive.

aftbit
·
1 day ago
·
[ - ]

The problem is that in some industries, Microsoft is the only option. Many of these regulated industries are just now transitioning from the data center to the cloud, and they've barely managed to get approval for that with all of the Microsoft history in their organization. AWS or GCloud are complete non-starters.

bob1029
·
1 day ago
·
[ - ]

I moved a 100% MS shop to AWS circa 2015. We ran our DCs on EC2 instances just as if they were on prem. At some point we installed the AAD connector and bridged some stuff to Azure for office/mail/etc., but it was all effectively in AWS. We were selling software to banks so we had a lot of due diligence to suffer. AWS Artifact did much of the heavy lifting for us. We started with Amazon's compliance documentation and provided our own feedback on top where needed.

I feel like compliance is the entire point of using these cloud providers. You get a huge head start. Maintaining something like PCI-DSS when you own the real estate is a much bigger headache than if it's hosted in a provider who is already compliant up through the physical/hardware/networking layers. Getting application-layer checkboxes ticked off is trivial compared to "oops we forgot to hire an armed security team". I just took a look and there are currently 316 certifications and attestations listed under my account.

https://aws.amazon.com/artifact/faq/

thewebguyd
·
1 day ago
·
[ - ]

I've found that lift and shifting to EC2 is also generally cheaper than the equivalent VMs on Azure.

Microsoft really wants you to use their PaaS offerings, and so things on Azure are priced accordingly. A Microsoft shop just wanting to lift-and-shift, Azure isn't the best choice unless the org has that "nobody ever got fired for buying Microsoft" attitude.

issafram
·
1 day ago
·
[ - ]

I've used AWS, Azure, and recently GCP. You do NOT want to use GCP.

Nemo_bis
·
1 day ago
·
[ - ]

Microsoft is better at regulatory capture, so Azure has many customers in the public sector. So an Azure outage probably affects the public sector more (see example above about trains).

karel-3d
·
1 day ago
·
[ - ]

Microsoft has the regulatory capture. All the European privacy and regulatory laws are good for Azure. That's why your average European government or baking app runs most likely on Azure. (or Oracle, but more likely Azure)

major505
·
1 day ago
·
[ - ]

I like azure. I think they are more intuitive tua aws and good, and they have good prices for startups ( essentially free for a whole year )

otterdude
·
1 day ago
·
[ - ]

What Amazon, Azure, and Google are showing with their platform crashes amid layoffs, while they supports governments that are Oppressing's Citizens and Ignoring the Law, is that they do not care about anything other than the bottom line.

They think they have the market captured, but I think what their dwindling quality and ethics are really going to drive is adoption of self hosting, distributed computing frameworks. Nerds are the ones who drove adoption of these platforms, and we can eventually end if we put in the work.

Seriously with container technology, and a bit more work / adoption on distributed compute systems and file storage (IPFS,FileCoin) there is a future where we dont have to use big brothers compute platform. Fuck these guys.

amnesty6249
·
1 day ago
·
[ - ]

These were my thoughts exactly. I may have my tinfoil hat on, but outages these close together between the largest cloud providers amid social unrest, my wonder is the government / tech companies implementing some update that adds additional spyware / blackout functionality.

I really hope this pushes the internet back to how it used to be, self hosted, privacy, anonymity. I truly hope that's where we're headed, but the masses seem to just want to stay comfortable as long as their show is on TV

lazystar
·
1 day ago
·
[ - ]

> they do not care about anything other than the bottom line.

if all companies focused on fixing each and every social issue that exists in the world, how would they make any money?

otterdude
·
23 hours ago
·
[ - ]

All I'm saying Its a complete 360 from the marketing they did to establish their brand... you know, save the world kind of stuff, especially from google.

From 2000-2016 most tech marketing\branding was aimed at some kind of social benefit.

WD-42
·
1 day ago
·
[ - ]

Preach

arccy
·
1 day ago
·
[ - ]

The only reason you'd notice MS was down was if Github was down....

foresterre
·
1 day ago
·
[ - ]

GitHub doesn't use Azure yet, but has just published their migration path to azure a few days ago.

I would link to that article, but that one does seem down ;)

OptionOfT
·
1 day ago
·
[ - ]

https://www.githubstatus.com/incidents/4jxdz4m769gy

> They're stating they're working with the Azure teams, so I suspect this is related.

lebski88
·
1 day ago
·
[ - ]

At least some bits of it do. I was writing something to pull logs the other day and the redirect was to an azure bucket. It also returned a 401 with the valid temporary authed redirect in the header. I was a bit worried I'd found a massive security hole but it appears after some testing it just returned the wrong status code.

abhinavk
·
1 day ago
·
[ - ]

I only noticed it because VSCode's updater failed.

danielovichdk
·
1 day ago
·
[ - ]

What did you do when AWS was down last week?

WD-42
·
1 day ago
·
[ - ]

Azure outages don’t faze anyone because nobody notices when it happens.

redwood
·
1 day ago
·
[ - ]

I read "Microsoft shop" as "Microsoft slop". Fitting. But at least they open source wash themselves so much they're practically a charity right?

Jamie452
·
1 day ago
·
[ - ]

Currently standing in a half closed supermarket because the tills are down and they cant take payments

david422
·
1 day ago
·
[ - ]

IIRC, the grocery chain I worked for used to have an offline mode to move customers out the door. But it meant that when the system came back online, if the customers card was denied, the customer got free groceries.

tcmart14
·
1 day ago
·
[ - ]

Yea, good old store and forward. We implemented it in our PoS system. Now, we do non PCI integrations so we arn't in PCI scope, but depending on the processor, it can come with some limitations. Like, you can do store and forward, but only up to X number of transactions. I think for one integration, it's 500-ish store wide (it uses a local gateway that store and forwards to the processors gateway). The other integration we have, its 250, but store and forward on device, per device.

fy20
·
1 day ago
·
[ - ]

In many places it's also possibly just a left over feature from older times. I worked at a major UK supermarket in the mid-00s, and their checkout system had this feature. But it was like that because that's how it was originally built, it wasn't a 'feature' they added.

Credit card information would be recorded by the POS, synced to a mini-server in the back office (using store-and-forward to handle network issues) and then in a batch process overnight, sent to HQ where the payment was processed.

It wasn't until chip-and-PIN was rolled out that they started supporting "online" (i.e. processed then and there) card transactions, and even then the old method still worked if there was a network issues or power failure (all POSes has their own UPS).

The only real risk at the time was that someone tried to pay with a cancelled credit card - the bank would always honour the payment otherwise. But that was pretty uncommon back then, as you'd have to phone your bank to do it, not just press a button in an app.

onionisafruit
·
1 day ago
·
[ - ]

What I gather from this is to always try a dead card first just in case the store is in offline mode

progmetaldev
·
1 day ago
·
[ - ]

They still capture the name on the card, so I would be careful about trying this, unless you can make use of a prepaid card.

reaperducer
·
1 day ago
·
[ - ]

IIRC, the grocery chain I worked for used to have an offline mode to move customers out the door.

Chick-fil-a has this.

One of the tech people there was on HN a few years ago describing their system. Credit card approval slows down the line, so the cards are automatically "approved" at the terminal, and the transaction is added to a queue.

The loss from fraudulent transactions turns out to be less than the loss from customers choosing another restaurant because of the speed of the lines.

joshstrange
·
23 hours ago
·
[ - ]

The POS I work on also has this feature. Line busters take the order and payment but we have a toggle where you can immediately “approve” and queue it up. If the payment fails then the person handing you your food will see it on the order and ask you for alternative payment. It helps prevent loss and speeds up the line overall.

ransom1538
·
1 day ago
·
[ - ]

I was shopping at a mall with a visa vanilla card once. I got it as a gift and didn't know the limit. No matter what I bought the card kept going -- and I never got a balance of what was on the card. Eventually, later that day it stopped. I called customer support and asked how much was left on the balance. They told me they had no idea my balance - but everything I bought was mine.

cyberax
·
1 day ago
·
[ - ]

I remember that banks will try to honor the transactions, even if the customer's balance/credit limit is exhausted. It doesn't apply only to some gift cards.

chasd00
·
1 day ago
·
[ - ]

There's a Family Dollar by my house that is down at least 2 full days per month because of bad inet connectivity. I live close enough that with a small tower on my roof i can get line of sight to theirs. I've thought about offering them a backup link off my home inet if they give me 50% of sales whenever its in use. It would be a pretty good deal for them, better some sales when their inet is down vs none.

jrodom
·
1 day ago
·
[ - ]

50% of sales? what do you think the gross margin is on average for each item sold?

chasd00
·
1 day ago
·
[ - ]

It's Family Dollar, margin has to be almost nothing and sales per day is probably < $1k. That's why I said 50% of sales and not profit.

I go there daily because it's a nice 30min round trip walk and I wfh. I go up there to get a diet coke or something else just to get out of the house. It amazes me when i see a handwritten sign on the door "closed, system is down". I've gotten to know the cashiers so I asked and it's because the internet connection goes down all the time. That store has to one of the most poorly run things i've ever seen yet it stays in business somehow.

hinkley
·
1 day ago
·
[ - ]

I think the point people are trying and failing to make is that asking for half of means sales is half of revenue not half of net and that you’re out of your goddamned mind if you think a store with razor thin margins would sell at a massive loss rather than just close due to connectivity problems.

Your responses imply that you think people are questioning whether you would lose money on the deal while we are instead saying you’ll get laughed out of the store, or possibly asked never to come back.

chasd00
·
21 hours ago
·
[ - ]

i'm not a businessman, just a computer nerd ( swe turned tech. consultant )

It seems like an easy problem to fix and a retail store being closed for a whole weekday because of inet access sounds crazy to me.

gwbas1c
·
1 day ago
·
[ - ]

They're all run on a shoestring:

1: I doubt they're "with it" enough to put together a backup arrangement for internet.

2: Their internet problems are probably due to a cheapo router, loose wire, ect.

3: The employees probably like the break.

progmetaldev
·
1 day ago
·
[ - ]

Unfortunately they are largely corporate, which is how they can sell items for such a cheap price. The store manager probably has zero say in nearly anything. Even if they wanted to "break the rules," I doubt they could make use of your connection as a backup, but I've also worked for smaller companies that were able to sell internet access to individual locations like Denny's and various large hotels in the US. Being able to somehow share sales would be the difficult part, since all sales are reported back to corporate.

Good luck if you make this work for you, it would be exciting to hear about if you're able to get them to work with you.

consp
·
1 day ago
·
[ - ]

2-3%, bit higher on perishables. Though i'd just ask lump sum payments in cash since it likely has to no go through corporate (as in, avoid the corporation).

OptionOfT
·
1 day ago
·
[ - ]

In that case the other 50%.

jiveturkey
·
1 day ago
·
[ - ]

it's retail. the margin is 30-50% for sure.

EDIT: their last quarterly was 36%. they lost $3.7bn in 24Q4 -- the christmas quarter. sold to PE in Q1.

hinkley
·
1 day ago
·
[ - ]

All my limited knowledge about retail is that losing money in Q4 means you’re dead. Are they fundamentally different than retail?

ryandrake
·
1 day ago
·
[ - ]

You'd think any SeriousBusiness would have a backup way to take customers' money. This is the one thing you always want to be able to do: accept payment. If they made it so they can't do that, they deserve the hit to their revenue. People should just walk out of the store with the goods if they're not being charged.

Why doesn't someone in the store at least have one of those manual kachunk-kachunk carbon copy card readers in the back that they can resuscitate for a few days until the technology is turned back on? Did they throw them all away?

BenjiWiebe
·
1 day ago
·
[ - ]

I think a lot of payment terminals have an option to record transactions offline and upload them later, but apparently it's not enabled by default - probably because it increases your risk that someone pays with a bad card.

hinkley
·
16 hours ago
·
[ - ]

After having my credit card locked by stupid fraud prevention algorithms on my honeymoon, I had a long chat with them before going overseas a second time.

And that was the day Visa had a full on outage. We would walk into one shop, try to buy stuff, get declined, then go into the next and get accepted because they were running in offline mode.

Got a nice big bill from my cellphone carrier for making the call to visa to ask them wtf as well.

ElevenLathe
·
1 day ago
·
[ - ]

The kachunk-kachunk credit card machines need raised digits on the cards, and I don't think most banks have been issuing those for years at this point. Mine have been smooth for at least 10 years.

hinkley
·
16 hours ago
·
[ - ]

My credit union has been behind for a while. I think I had an embossed one until about nine years ago. Six at the latest. Still doesn’t have NFC in it.

progmetaldev
·
1 day ago
·
[ - ]

My card tied to my main financial institution have the raised digits, but most cards you'd sign up for online now no longer have the raised digits (and often allow you to select art to appear on the card face).

rkomorn
·
1 day ago
·
[ - ]

> kachunk-kachunk credit card machines

How aptly descriptive.

xboxnolifes
·
1 day ago
·
[ - ]

It's hit or miss. My (brand new) bank card and chase credit card are raised. But my other credit cards are flat.

voidmain0001
·
1 day ago
·
[ - ]

If they used standalone merchant terminals, then those typically use the local LAN which can rollover to cellular or PoT in the event of a network outage. The store can process a card transaction with the merchant terminal and then reconcile with the end of day chit. This article from 2008 describes their PoS https://www.retailtouchpoints.com/topics/store-operations/ca...

Spooky23
·
1 day ago
·
[ - ]

It’s family dollar. They don’t care about customer satisfaction and the cost of reliability is cost.

The stores are in the hood or middle of nowhere. The customers don’t have many options.

progmetaldev
·
1 day ago
·
[ - ]

These stores appear everywhere, even in areas with high income. You'd be surprised, but often people with those high incomes shop for certain products at very low rates, and that's how they keep their savings. A good example is garbage bags. Most people don't care too much about the quality of their garbage bags, unless they rip on the way to the bin.

Finnucane
·
1 day ago
·
[ - ]

Then they would need to get the little booklets of invalid numbers to keep by the register to check (yes, I am old).

wat10000
·
1 day ago
·
[ - ]

Many businesses don't lose revenue from short outages, it just gets shifted.

·
1 day ago
·
[ - ]

BenjiWiebe
·
1 day ago
·
[ - ]

Pretty sure it'd be a lot better deal for them to have no sales than to pay out 50% of sales on stuff with single digit margins.

pndy
·
1 day ago
·
[ - ]

I remember last mechanical cash registers in my country in 90s and when these got replaced by early electronic ones with blue vacuum fluorescent tubes. Then everything got smaller and smaller. Now I'm pestered to "add the item to the cart" by software.

Last week I couldn't pay for flowers for grandma's grave because smartphone-sized card terminal refused to work - it stuck on charging-booting loop so I had to get cash. Tho my partner thinks she actually wanted to get cash without a receipt for herself excluding taxes

thisOtterBeGood
·
1 day ago
·
[ - ]

In Germany many stores still accept cash and some even only accept cash and we are ridiculed for this... Seems like one of the rare instances where this is useful :D

bombcar
·
1 day ago
·
[ - ]

It's sad the number of stores I've seen where they just shut down when they can't use the checkout machines; the clerks aren't allowed to do math even if they could.

Whereas the smaller, owner-run stores have more leeway; the local tiny grocery "sold" all freezer/refrigerator food for cheap/free during a power failure. The big Walmart closed and threw everything away the next day.

hinkley
·
16 hours ago
·
[ - ]

The odd thing is that the US has been teaching math using the “in your head” heuristic (New Math) for almost 20 years and yet young adults cannot make change without the machine to save their lives.

God help me if I hand someone $25 for a $14.75 total. I’m getting small bills back.

I wonder what they teach in Germany.

the_black_hand
·
5 hours ago
·
[ - ]

why tf would a supermarket depend on Azure? Payment processing isn't their thing

Jamie452
·
1 day ago
·
[ - ]

Just to add - this particular supermarket wasn’t fully down, it took ages for them to press “sub total” and then pick the payment method. I suspect it was slow waiting for a request to timeout perhaps

SoftTalker
·
1 day ago
·
[ - ]

Mind-boggling that any retailer would not have the capability to at least run the checkout stations offline.

tcmart14
·
1 day ago
·
[ - ]

You can, but it's all about risk mitigation. Most processors have some form of store and forward (and it can have limitations like only X number of transactions). Some even have controls to limit the amount you can store-and-forward (for instance, only charges under $50). But ultimately, it's still risk mitigation. You can store-and-forward, but you're trusting that the card/account has the funds. If it doesn't, you loose and ain't shit you can do about it. If you can't tolerate any risk, you don't turn on store and forward systems and then you can't process cards offline.

Its not the we are not capable. Its, is the business willing to assume the risk?

bombcar
·
1 day ago
·
[ - ]

Most retailers trust their cashiers a bit less than they trust the customers. They'd rather shut down during a power/Internet failure than give any autonomy to the worker drones.

withinboredom
·
1 day ago
·
[ - ]

I knew an old guy in the '00s who specialized in cobal/fortran for working on tiller software. Guess he retired and they couldn't maintain it

computerdork
·
1 day ago
·
[ - ]

Anyone remember Bob's number?? Bob?! Oh the humanity! We're all gonna be canned!

reaperducer
·
1 day ago
·
[ - ]

Currently standing in a half closed supermarket because the tills are down and they cant take payments

There's a fairly large supermarket near me that has both kinds of outages.

Occasionally it can't take cards because the (fiber? cable?) internet is down, so it's cash only.

Occasionally it can't take cash because the safe has its own cellular connection, and the cell tower is down.

I was at Frank's Pizza in downtown Houston a few weeks ago and they were giving slices of pizza away because the POS terminal died, and nobody knew enough math to take cash. I tried to give them a $10 and told them to keep the change, but "keep the change" is an unknown phrase these days. They simply couldn't wrap their brains around it. But hey, free pizza!

gmassman
·
1 day ago
·
[ - ]

I’ve been migrating our services off of Azure slowly for the past couple of years. The last internet facing things remaining are a static assets bucket and an analytics VM running Matomo. Working with Front Door has been an abysmal experience, and today was the push I needed to finally migrate our assets to Cloudflare.

I feel pretty justified in my previous decisions to move away from Azure. Using it feels like building on quicksand…

alt227
·
1 day ago
·
[ - ]

All the clouds hav had major outages this year.

At this point I dont believe that any one of them is any better or reliable than the others.

not_a_bot_4sho
·
1 day ago
·
[ - ]

> I feel pretty justified in my previous decisions to move away from Azure

I felt this way about AWS last week

btmiller
·
1 day ago
·
[ - ]

Never let a good disaster go to waste ;)

kierenj
·
1 day ago
·
[ - ]

Ouch, and login.microsoftonline.com too - i.e. SSO using MS accounts. We'd just rolled that out across most (all?) of our internal systems...

And microsoft.com too - that's gotta hurt

planewave
·
1 day ago
·
[ - ]

It is interesting to see the differential across different tenants in different geographies:

- on a US tenant I am unable to access login.microsoftonline.com and the login flow stalls on any SSO authentication attempt.

- on a European tenant, probably germany-west, I am able to login and access the Azure portal.

parliament32
·
1 day ago
·
[ - ]

SSO and 365 are working fine for us, but admin portals for Azure/365 are down. Our workloads in Azure don't seem to be impacted.

juancroldan
·
1 day ago
·
[ - ]

Guess you have NASSO now (Not A Single Sign On)

btbuildem
·
1 day ago
·
[ - ]

It's Safe and Secure!

ocdtrekkie
·
1 day ago
·
[ - ]

I am still stunned people choose to do this, considering major Office 365 outages are basically a weekly thing now.

NetMageSCW
·
1 day ago
·
[ - ]

We are very dependent on Azure and Microsoft Authentication and Microsoft 365 and haven’t had weekly or even monthly issues. I can think of maybe three issues this year.

basfo
·
1 day ago
·
[ - ]

We’re 100% on Azure but so far there’s no impact for us.

Luckily, we moved off Azure Front Door about a year ago. We’d had three major incidents tied to Front Door and stopped treating it as a reliable CDN.

They weren’t global outages, more like issues triggered by new deployments. In one case, our homepage suddenly showed a huge Microsoft banner about a “post-quantum encryption algorithm” or something along those lines.

Kinda wild that a company that big can be so shaky on a CDN, which should be rock solid.

Aperocky
·
1 day ago
·
[ - ]

Outages are one thing, but having your content polluted seems like a more serious problem? Unless you subscribed to microsoft banners somehow.

basfo
·
1 day ago
·
[ - ]

And it was HUGE, the microsoft logo was like 50% of the screen.

qiller
·
1 day ago
·
[ - ]

We battled https://learn.microsoft.com/en-us/answers/questions/1331370/... for over a year, and finally decided to move off since there was no any resolution. Unfortunately our API servers were still behind AFD so they were affected by today's stuff...

gianpaj
·
1 day ago
·
[ - ]

Can't download VSCode :D

Error: visual-studio-code: Download failed on Cask 'visual-studio-code' with message: Download failed: https://update.code.visualstudio.com/1.105.1/darwin-arm64/st...

progmetaldev
·
1 day ago
·
[ - ]

I have had intermittent issues with winget today. I use UniGetUI for a front-end, and anything tied to Microsoft has failed for me. Judging by the logs, it's mostly retrieving the listing of versions (I assume similar to what 'apt-get update' does, I'm fairly new to using winget for Windows package management).

robotnikman
·
1 day ago
·
[ - ]

Also cant do anything right now with the repo's we have in Azure Devops, how lovely...

loopduplicate
·
1 day ago
·
[ - ]

get vscodium then

agency
·
1 day ago
·
[ - ]

So that's why I can't check in for my Alaska Airlines flight... https://news.microsoft.com/source/features/digital-transform...

MangoCoffee
·
1 day ago
·
[ - ]

"BREAKING: Alaska Airlines' website, app impacted amid Microsoft Azure outage"

https://www.youtube.com/watch?v=YJVkLP57yvM

Shuddown
·
1 day ago
·
[ - ]

Pretty much every single Microsoft domain I've tried to access loads for a looooong time before giving me some bare html. I wonder if someone can explain why that's happening.

sodafountan
·
1 day ago
·
[ - ]

I was wondering the same thing

kurttheviking
·
1 day ago
·
[ - ]

I am unable to load this article...presumably for related reasons

vachina
·
1 day ago
·
[ - ]

microsoft.com and some subdomains (answers.microsoft.com) has no A and AAA records. They screwed up big time.

https://archive.is/Q4izZ

0xbadcafebee
·
1 day ago
·
[ - ]

That specific subdomain has issues with propagation: https://dnschecker.org/#A/answers.microsoft.com (only four resolvers return records)

The root zone and www. do not: https://dnschecker.org/#A/microsoft.com (all resolvers return records)

And querying https://www.microsoft.com/ results in HTTP 200 on the root document, but the page elements return errors (a 504 on the .css/.js documents, a 404 on some fonts, Name Not Resolved on scripts.clarity.ms, Connection Timed Out on wcpstatic.microsoft.com and mem.gfx.ms). That many different kinds of errors is actually kind of impressive.

I'm gonna say this was a networking/routing issue. The CDN stayed up, but everything else non-CDN became unroutable, and different requests traveled through different paths/services, but each eventually hit the bad network path, and that's what created all the different responses. Could also have been a bad deploy or a service stopped running and there's different things trying to access that service in different ways, leading to the weird responses... but that wouldn't explain the failed DNS propagation.

Aperocky
·
1 day ago
·
[ - ]

wow, right after AWS suffered a similar thing.

I wonder if this is microsoft "learning" to "prevent" such an issue and instead triggered it...

"One often meets his destiny on the path he takes to avoid it" -- Master Oogway

move-on-by
·
1 day ago
·
[ - ]

Instead of cyber security awareness month, we should rename it to cloud availability awareness month.

ape4
·
1 day ago
·
[ - ]

2026: the year of your own metal in a rack

0xbadcafebee
·
1 day ago
·
[ - ]

2027: the year of migrating from your own metal to a managed provider

2028: the year of migrating from a managed provider to the cloud

2029: the year of migrating from the cloud to your own metal in a rack

People keep thinking the solution to their problems is to do something new (that they don't fully understand).

TIL it's called Nirvana Fallacy

hshdhdhehd
·
1 day ago
·
[ - ]

The upside of keep moving it acts as a hardener and chaos monkey. You shake out any crufty service no one knows how to build let alone deploy.

t0lo
·
1 day ago
·
[ - ]

Just experienced this with moving around multiple states and universities in the past year :) Grass really was greener in my hometown

reaperducer
·
1 day ago
·
[ - ]

TIL it's called Nirvana Fallacy

We used to call it "The grass is always greener on the other side of the fence."

Aperocky
·
1 day ago
·
[ - ]

I'd predict the year of linux desktop instead.

hshdhdhehd
·
1 day ago
·
[ - ]

At least YOLD is possible. Is there capacity in the world for everyone to ditch clouds.

drewnick
·
1 day ago
·
[ - ]

I've been doing it since 1998 in my bedroom with a dual T1 (and on to real DCs later). While I've had some outages for sure it makes me feel better I am not that divergent in uptime in the long run vs big clouds.

dylan604
·
1 day ago
·
[ - ]

Are you still on a dual T1? that's gotta be expensive

daveguy
·
1 day ago
·
[ - ]

(and on to real DCs later) would imply their bare metal is now located in a data center.

dylan604
·
1 day ago
·
[ - ]

really should stop skimming the comment when i find a part to comment on <facepalm>

chemodax
·
1 day ago
·
[ - ]

For me the same. It's very confusing that status page [1] is green

[1]: https://azure.status.microsoft/en-us/status

martini333
·
1 day ago
·
[ - ]

That status page is never red. Absolutely useless.

> There are currently no active events. Use Azure Service Health to view other issues that may be impacting your services.

Links to a page on Azure Portal which is down...

endianswap
·
1 day ago
·
[ - ]

It's red right now.

Sharparam
·
1 day ago
·
[ - ]

Only for the Azure Portal, despite Front Door also being down but showing as green on the status page.

12_throw_away
·
1 day ago
·
[ - ]

Heh, now it says Front Door and "Network Infrastructure" are down. That second one seems bad.

kylecazar
·
1 day ago
·
[ - ]

They added a message at the same time as your comment:

"We are investigating an issue with the Azure Portal where customers may be experiencing issues accessing the portal. More information will be provided shortly."

givemeethekeys
·
1 day ago
·
[ - ]

Surely more vibecoding will fix this problem. Time to fire more staff

0000000000100
·
1 day ago
·
[ - ]

Yeah just took down the prod site for one of our clients since we host the front-end out of their CDN. Just got wrapped up panic hosting it somewhere else for the past hour, very quickly reminds you about the pain of cookies...

alt227
·
1 day ago
·
[ - ]

... and DNS caching, and browser file cache, and sessions...

Moving a website quickly is never fun.

flumpcakes
·
1 day ago
·
[ - ]

Pretty much all Azure services seem to be down. Their status page says it's only the portal since 16:00. It would be nice if these mega-companies could update their status page when they take down a large fraction of the Internet and thousands of services that use them.

parliament32
·
1 day ago
·
[ - ]

All of our Azure workloads are up, but we don't use Azure Front Door. That seems to be the only impacted product, apart from the management portal.

flumpcakes
·
1 day ago
·
[ - ]

We're using Application Gateway for ingress, that seems to be effected.

kierenj
·
1 day ago
·
[ - ]

FWIW, all of our databases, VMs, AKS clusters, services, jobs etc - are all working fine. Which services are down for you, maybe we can build a list?

reknih
·
1 day ago
·
[ - ]

Front Door is down for us (as Azure‘s Twitter account confirms)

wbsun
·
1 day ago
·
[ - ]

Does their status page depend on something that is down already, so the page just fails static now hence no new updates?

jayw_lead
·
1 day ago
·
[ - ]

Same playbook for AWS. When they admitted that Dynamo was inaccessible, they failed to provide context that their internal services are heavily dependent on Dynamo

It's only after the fact they are transparent about the impact

kure256
·
1 day ago
·
[ - ]

We’ve been experimenting with multi-cluster failover for Kubernetes workloads, and one open-source project that actually works really well is k8gb .

It acts as a GSLB controller inside Kubernetes — doing DNS-level health checks, region awareness, and automatic failover between clusters when one goes down.

It integrates with ExternalDNS and supports multiple DNS providers (Infoblox, Route53, Azure DNS, NS1, etc.), so it can handle failover across both on-prem and cloud clusters.

It’s not a silver bullet for every architecture, but it’s one of the few OSS projects that make multi-region failover actually manageable in practice.

port11
·
1 day ago
·
[ - ]

So much of Belgium runs on Azure… it's honestly baffling how many services are down, there's no resilience built into (even large) companies anymore.

MangoCoffee
·
1 day ago
·
[ - ]

The Internet is supposed to be decentralized. The big three seem to have all the power now (Amazon, Microsoft, and Google) plus Cloudflare/Oracle.

How did we get here? Is it because of scale? Going to market in minutes by using someone else's computers instead of building out your own, like co-location or dedicated servers, like back in the day.

kube-system
·
1 day ago
·
[ - ]

It still is very decentralized. We are discussing this via the internet right now.

kbelder
·
1 day ago
·
[ - ]

I need to drop AWS and start passing data through encrypted HN posts.

Mr_Bees69
·
1 day ago
·
[ - ]

Yeah, but MyChart is down.

chasd00
·
1 day ago
·
[ - ]

When AWS was down we were talking about it here, now Azure is down and we're still talking about it here. Where does HN actually live?

·
1 day ago
·
[ - ]

acedTrex
·
1 day ago
·
[ - ]

big, if true

·
1 day ago
·
[ - ]

mrinterweb
·
1 day ago
·
[ - ]

A lot of money and years of marketing the cloud as the responsible business decision led us here. Now that the cloud providers have vendor lock-in, few will leave, and customers will continue to wildly overpay for cloud services.

gwbas1c
·
1 day ago
·
[ - ]

Ahh, but you forget what it used to be like. Sites used to go down all the time.

Now, they go down a lot less frequently, but when they do, it's more widespread.

bossyTeacher
·
1 day ago
·
[ - ]

Not sure how the current situation is better. Being stranded with no way whatsoever to access most/all of your services sounds way more terrifying than regular issues limited to a couple of services at a time

gwbas1c
·
1 day ago
·
[ - ]

> no way whatsoever to access most/all of your services

I work on a product hosted on Azure. That's not the case. Except for front door, everything else is running fine. (Front door is a reverse proxy for static web sites.)

The product itself (an iot stormwater management system) is running, but our customers just can't access the website. If they need to do something, they can go out to the sites or call us and we can "rub two sticks together" and bypass the website. (We could also bypass front door if someone twisted our arms.)

Most customers only look at the website a few times a year.

---

That being said, our biggest point of failure is a completely different iot vendor who you probably won't hear about on Hacker News when they, or their data networks, have downtime.

bossyTeacher
·
13 hours ago
·
[ - ]

The portal was down for most of the day and accessing any resources from the portal once the portal was up was not possible.

gwbas1c
·
11 minutes ago
·
[ - ]

You were able to manage everything through the command line the entire time.

Now I will admit I am more of a point-and-click person; but if I had to I could have figured out how to use the command line.

JoBrad
·
1 day ago
·
[ - ]

It’s the Heisenberg cloud principal.

deaux
·
1 day ago
·
[ - ]

From today [0].

> Big Tech lobbying is riding the EU’s deregulation wave by spending more, hiring more, and pushing more, according to a new report by NGO’s Corporate Europe Observatory and LobbyControl on Wednesday (29 October).

> Based on data from the EU’s transparency register, the NGOs found that tech companies spend the most on lobbying of any sector, spending €151m a year on lobbying — a 33 percent increase from €113m in 2023.

Gee whizz, I really do wonder how they end up having all the power!

[0] https://news.ycombinator.com/item?id=45744973

alt227
·
1 day ago
·
[ - ]

Thats the whole point, big players like AWS and MS can go down, but here we are still talking on the internet.

Decentralisation is winning it seems.

jslaby
·
1 day ago
·
[ - ]

Not everyone has moved over, but I'm sure there have been thoughts or plans to.

nzach
·
1 day ago
·
[ - ]

> How did we get here?

I think the response lies in the surrounding ecosystem.

If you have a company it's easier to scale your team if you use AWS (or any other established ecosystem). It's way easier to hire 10 engineers that are competent with AWS tools than it is to hire 10 engineers that are competent with the IBM tools.

And from the individuals perspective it also make sense to bet on larger platforms. If you want to increase your odds of getting a new job, learning the AWS tools gives you a better ROI than learning the IBM tools.

anonymars
·
1 day ago
·
[ - ]

Efficiency (aka cost) <---> Resiliency/redundancy

Pick your point on the scale

deathanatos
·
1 day ago
·
[ - ]

Maybe in a perfect world, or in a free market.

But the cloud compute market is basically centralized into 2.5 companies at this point. The point of paying companies like Azure here is that they've in theory centralized the knowledge and know-how of running multiple, distributed datacenters, so as to be resilient.

But that we keep seeing outages encompassing more than a failure domain, then it should be fair game for engineers / customers to ask "what am I paying for, again?"

Moreover, this seems to be a classic case of large barriers to entry (the huge capital costs associated with building out a datacenter) barring new entrants into the market, coupled with "nobody ever got fired for buying IBM" level thinking. Are outages like these truly factored into the napkin math that says externalizing this is worth it?

codethief
·
1 day ago
·
[ - ]

Meredith Whittaker (of Signal) addressed your question the other day: https://mastodon.world/@Mer__edith/115445701583902092

pphysch
·
1 day ago
·
[ - ]

Consolidation is the inevitable outcome of free unregulated markets.

In our highly interconnected world, decentralization paradoxically requires a central authority to enforce decentralization by restricting M&A, cartels, etc.

SoKamil
·
1 day ago
·
[ - ]

Is there a theorem that models this behavior? Capital feels like a mass that attracts more mass the larger it becomes, like gravity.

AndrewKemendo
·
1 day ago
·
[ - ]

A natural monopoly is a monopoly in an industry in which high infrastructure costs and other barriers to entry relative to the size of the market give the largest supplier in an industry, often the first supplier in a market, an overwhelming advantage over potential competitors. Specifically, an industry is a natural monopoly if a single firm can supply the entire market at a lower long-run average cost than if multiple firms were to operate within it. In that case, it is very probable that a company (monopoly) or a minimal number of companies (oligopoly) will form, providing all or most of the relevant products and/or services.

https://en.wikipedia.org/wiki/Natural_monopoly

TacticalCoder
·
1 day ago
·
[ - ]

[dead]

SecretDreams
·
1 day ago
·
[ - ]

> How did we get here?

Stonks

Aldipower
·
1 day ago
·
[ - ]

Hetzner, Netcup, OVH, BunnyCDN, ClouDNS, Postmark

You name them. Other good providers you have experience with?

There is no reason for an expensive cloud. Never has been, but decision makers tried to keep their pants dry.

sedatk
·
1 day ago
·
[ - ]

The paradox of cloud provider crashes is that if the provider goes down and takes the whole world with it, it's actually good advertisement. Because, that means so many things rely on it, it's critically important, and has so many big customers. That might be why Amazon stock went up after AWS crash.

If Azure goes down and nobody feels it, does Azure really matter?

thewebguyd
·
1 day ago
·
[ - ]

People feel it, but usually not general consumers like they do when AWS goes down.

If Azure goes down, it's mostly affecting internal stuff at big old enterprises. Jane in accounting might notice, but the customers don't. Contrast with AWS which runs most of the world's SaaS products.

People not being able to do their jobs internally for a day tends not to make headlines like "100 popular internet services down for everyone" does.

kierenj
·
1 day ago
·
[ - ]

Sorry - my bad. I literally just connected an old XP VM to the internet to activate it.

tyfon
·
1 day ago
·
[ - ]

Seems to be down in Norway.

Even the national digital id service is down.

hexbin010
·
1 day ago
·
[ - ]

> Even the national digital id service is down.

Can't help but smirk as my country is ramming through "Digital ID" right now

bombcar
·
1 day ago
·
[ - ]

Someone somewhere thought that "national digital ID service" should absolutely rely on a cloud provider in and from another country.

What a time to be alive.

jmspring
·
1 day ago
·
[ - ]

The outage was really weird. For me, parts of the portal worked, other parts didn't. I had access to a couple of resource groups, but no resources visible in those groups. Azure Devops Pipelines that needed do download from packages.microsoft.com didn't work.

The Microsoft status page mostly referenced the portal outage, but it was more than that.

bombcar
·
1 day ago
·
[ - ]

I hate these failures because you end up with things that keep working fine because the login credentials are cached, etc; but if you restart or otherwise refresh, you're doomed.

Steven_Vellon
·
1 day ago
·
[ - ]

For us, it looks like most services are still working (eastus and eastus2). Our AKS cluster is still running and taking requests. Failures seem limited to management portal.

jammo
·
1 day ago
·
[ - ]

We all need to move away from these big cloud providers. Two medium size smaller providers is enough.

-Cloudflare for R2 (object storage) and CDN (Fastly+backblaze also available). -Two VPS/Server providers with a decent reputation and mid-size (using a comparison site like https://serversearcher.com or look directly into people like Hetzner or latitude) -PlanetScale or Neon for database if you don't co-locate it, though better to use someone like digital ocean, vultr or latitude who offer databases too)

dspillett
·
1 day ago
·
[ - ]

> We all need to move away from these big cloud providers.

But then who do we blame when things are down? If we manage our own infrastructure we have to stay late to fix it when it breaks instead of saying “sorry, Microsoft, nothing we can do” and magically our clients accepting that…

blcknight
·
1 day ago
·
[ - ]

Ah yes, let's put our multibillion dollar ecommerce site on... checks notes Hetzner.

Lol

mythz
·
1 day ago
·
[ - ]

High availability is touted as a reason for their high prices, but I swear I read about major cloud outages far more than I experience any outages at Hetzner.

prmoustache
·
1 day ago
·
[ - ]

I think the biggest features of the big cloud vendors is that when they are down, not only you but your customers and your competitors usually have issues at the same time so everybody just shrug and have a lazy/off day at the same time. Even on call teams reall just have to wait and stay on standby because there is very little they can do. Doing a failover can be slower than waiting for the recovery, not help at all if outage is spanned accross several region, or bring aditional risks.

And more importantly nobody lose any reputation except AWS/Azure/Google.

zavec
·
1 day ago
·
[ - ]

It's like back in school when there was a snow day!

graemep
·
1 day ago
·
[ - ]

Ostensible reason.

The real reason is that outages are not your fault. Its the new version of "nobody ever got fired for buying IBM" - later it became MS, and now its any big cloud provider.

·
1 day ago
·
[ - ]

jmaker
·
1 day ago
·
[ - ]

For one it’s statistics - Hetzner simply runs far fewer major services than hyperscalers. And the services they run are also more affluent, with larger customer bases, so downtimes are systemically critical. Therefore it’s louder.

On the merits though, I agree, haven’t had any serious issues with Hetzner.

bad_haircut72
·
1 day ago
·
[ - ]

Same with DigitalOcean. I run one box and it hasnt gone down for like 2 years

yabones
·
1 day ago
·
[ - ]

DO has been shockingly reliable for me. I shut down a neglected box almost 900 days uptime the other day. In that time AWS has randomly dropped many of my boxes with no warning requiring a manual stop/start action to recover them... But everybody keeps telling me that DO isn't "as reliable" as the big three are.

ipdashc
·
1 day ago
·
[ - ]

To be fair, in the AWS/Azure outages, I don't think any individual (already created) boxes went down, either. In AWS' case you couldn't start up new EC2 instances, and presumably same for Azure (unless you bypass the management portal, I guess). And obviously services like DynamoDB and Front Door, respectively, went down. Hetzner/DO don't offer those, right? Or at least they're not very popular.

robotnikman
·
1 day ago
·
[ - ]

Same here, I run a few droplets for personal projects and never had any issues with then.

bongodongobob
·
1 day ago
·
[ - ]

It's just the admin portal.

12_throw_away
·
1 day ago
·
[ - ]

Nope, more than the portal. For instance, I just searched for "Azure Front Door" because I hadn't heard of it before (I now know it's a CDN), and neither the product page itself [1] nor the technical docs [2] are coming up for me.

[1] https://azure.microsoft.com/en-us/products/frontdoor

[2] https://learn.microsoft.com/en-us/azure/frontdoor/front-door...

Foobar8568
·
1 day ago
·
[ - ]

Plenty of sites are down and/or login not available. It's just really a mess.

pocketman
·
1 day ago
·
[ - ]

It looks like it is just the 365 admin panels for us. Admittedly, we don't currently host any other services on Azure though.

NDizzle
·
1 day ago
·
[ - ]

It's absolutely not only the admin portal.

It's CDN and FrontDoor at least.

bongodongobob
·
1 day ago
·
[ - ]

Interesting, everything else is working just fine for us. Offices across the US.

Jarwain
·
1 day ago
·
[ - ]

Do you use on front door? Our VMs that don't are working fine, but our app services that do aren't.

out_sider
·
1 day ago
·
[ - ]

we use front door (as does miccrosoft.com) and our website was down, I was able to change the DNS records to point directly to our server and will leave it like that for a few hours until everything is green

ikamm
·
1 day ago
·
[ - ]

The bank I work at is reporting all Power Apps applications are down.

ApolloFortyNine
·
1 day ago
·
[ - ]

They admit in their update blurb azure front door is having issues but still report azure front door as having no issues on their status page.

And it's very clear from these updates that they're more focused on the portal than the product, their updates haven't even mentioned fixing it yet, just moving off of it, as if it's some third party service that's down.

consp
·
1 day ago
·
[ - ]

> as having no issues on their status page

Unsubstantiated idea: So the support contract likely says there is a window between each reporting step and the status page is the last one and the one in the legal documents giving them several more hours before the clauses trigger.

cbovis
·
1 day ago
·
[ - ]

Looks to be affecting our pipelines that rely on Playwright as they download images from Azure e.g. https://playwright.azureedge.net/builds/chromium/1124/chromi... which aren't currently resolving.

reid
·
1 day ago
·
[ - ]

This is impacting the Azure CDN at azureedge.net. DNS A records for azureedge.net tenants are taking 2-6 seconds and often return nothing.

etyhhgfff
·
1 day ago
·
[ - ]

It's always DNS, unless it's not DNS.

zimpenfish
·
1 day ago
·
[ - ]

"Microsoft Azure will serve as the backbone of Asda’s digital infrastructure"[0]

Oh, that'll be why Scan & Go was down yesterday evening. I thought it was another instance of an iOS 26 update breaking their crappy code.

[0] https://corporate.asda.com/newsroom/2025/22/09/asda-announce...

AdmiralAsshat
·
1 day ago
·
[ - ]

Some exec at Microsoft told the Azure guys to ape everything Amazon does and they took it literally.

Telemakhos
·
1 day ago
·
[ - ]

Or, the NSA needed to upgrade their access at both.

embedding-shape
·
1 day ago
·
[ - ]

Do Microsoft still say "If the government has a broader voluntary national security program to gather customer data, we don't participate in it" today (which PRISM proved very false), or are they at least acknowledging they're participating in whatever NSA has deployed today?

terminalshort
·
1 day ago
·
[ - ]

PRISM wasn't voluntary. Also there are 3 levels here:

1. Mandatory

2. "Voluntary"

3. Voluntary

And I suspect that very little of what the NSA does falls into category 3. As Sen Chuck Schumer put it "you take on the intelligence community, they have six ways from Sunday at getting back at you"

cruffle_duffle
·
1 day ago
·
[ - ]

“Voluntold”

jrochkind1
·
1 day ago
·
[ - ]

I was gonna say that obv AWS hacked em to even things up.

dboreham
·
1 day ago
·
[ - ]

This is funny but also possibly true because: business/MBA types see these outages as a way to prove how critical some services are, leading to investors deciding to load up on the vendor's stock.

alt227
·
1 day ago
·
[ - ]

I may or may not have been known to temporarily take a database down in the past to make a point to management about how unreliable some old software is.

aftbit
·
1 day ago
·
[ - ]

I still can't log into Azure Gov Cloud with

https://microsoft.com/deviceloginus

Seems like they migrated the non-Gov login but not the Gov one. C'mon Microsoft, I've got a deadline in a few days.

buttscicles
·
1 day ago
·
[ - ]

Interesting that everybody knows when AWS goes down but Azure needs a "Tell HN" :)

Best of luck to the teams responding to this incident.

tartieret
·
20 hours ago
·
[ - ]

I was a little puzzled as we got notified our apps were down, and then I tried to login in the Azure portal with no success. But the Azure status page reported no incident, so I posted here and quickly confirmed that others were impacted! They did a pretty bad job with their status page as the front door service was shown green all along

mystcb
·
1 day ago
·
[ - ]

Updated 16:35 UTC

Azure Portal Access Issues

This message was last updated at 16:35 UTC on 29 October 2025

----

Azure Portal Access Issues

We are investigating an issue with the Azure Portal where customers may be experiencing issues accessing the portal. More information will be provided shortly.

This message was last updated at 16:18 UTC on 29 October 2025

-- From the Azure status page

amaccuish
·
1 day ago
·
[ - ]

Seeing users having issues with the "Modern Outlook", specifically empty accounts. Switching back to the "Legacy Outlook" which functions largely without the help of the cloud fixes the issue. How ironic.

blenderob
·
1 day ago
·
[ - ]

https://azure.status.microsoft/en-us/status says everything's fine! Any place I can read more about this outage?

reid
·
1 day ago
·
[ - ]

You're looking at it. I couldn't find any discussion elsewhere yet...

sbergot
·
1 day ago
·
[ - ]

official status pages are useless most of the time.

jeffrallen
·
1 day ago
·
[ - ]

I work for a cloud provider which is serious about transparency. Our customers know they are going to get the straight story from our status page.

When you find an honest vendor, cherish them. They are rare, and they work hard to earn and keep your confidence.

sbergot
·
1 day ago
·
[ - ]

now there is an information about "Azure Portal Access Issues". No word about front door being down.

The_President
·
1 day ago
·
[ - ]

[dead]

hedayet
·
1 day ago
·
[ - ]

The sad thing is - $MSFT isn't even down by 1%. And IIRC, $AMZN actually went up during their previous outage.

So if we look at these companies' bottom lines, all those big wigs are actually doing something right. Sales and lobbying capacity is way more effective than reliability or good engineering (at least in the short term).

locusofself
·
1 day ago
·
[ - ]

AMZN went up almost 4 percent between the day of the outage and the day after. Crazy market.

jlarocco
·
1 day ago
·
[ - ]

Because it shows how much lock-in they have.

You know nobody is migrating off of AWS or Azure because of these.

navane
·
1 day ago
·
[ - ]

Look how important we are, is what these failures show

marcosdumay
·
1 day ago
·
[ - ]

What do you mean? That IT isn't important for Microsoft and Amazon?

That's certainly not the right conclusion.

alt227
·
1 day ago
·
[ - ]

I think he was implying that those companies think they are so important that it doesnt matter they are down, they wont loose any customers over it because they are too big and important.

cyberax
·
1 day ago
·
[ - ]

So we can look forward to "accidental" cloud outages just to show their importance?

I guess the GCP is next.

Arrath
·
1 day ago
·
[ - ]

"They'll learn their lesson and be rock solid after this! I better invest now!"

bigstrat2003
·
1 day ago
·
[ - ]

That's a good thing. Stock prices shouldn't go down because of rare incidents which don't accurately represent how successful a company is likely to be in the future.

iamtheworstdev
·
1 day ago
·
[ - ]

well, at this point, 90% of the market cap of FAANGS plus Microsoft is... OMG AI LLM hype

AtNightWeCode
·
1 day ago
·
[ - ]

I looked into this before and the stocks of these large corps simply does not move when outages happens. Maybe intra-day, I don't have that data, but in general no effect.

vincebowdren
·
1 day ago
·
[ - ]

UK, and other regions too; our APAC installation in Australia is affected.

delf
·
1 day ago
·
[ - ]

The outage impacted GitSocial minor version bump release: https://marketplace.visualstudio.com/items?itemName=GitSocia...

There's no way to tell, and after about 30 minutes, the release process on VS Code Marketplace failed with a cryptic message: "Repository signing for extension file failed.". And there's no way to restart/resume it.

Sharparam
·
1 day ago
·
[ - ]

The learning modules on https://learn.microsoft.com/ also seem to have a lot of issues properly loading.

progmetaldev
·
1 day ago
·
[ - ]

I was having issues a few hours ago. I'm now able to access the portal, although I get lots of errors in the browser console, and things are loading slowly. I have services in the US-East region.

I have been having issues with GitHub and the winget tool for updates throughout the day as well. I imagine things are pulling from the same locations on Azure for some of the software I needed to update (NPM dependencies, and some .NET tooling).

vpears87
·
1 day ago
·
[ - ]

At least MSFT is consistent: https://www.microsoft.com/en-us/ is down as well

CommanderData
·
1 day ago
·
[ - ]

Likely behind Azure Front Door.

Much of Xbox is behind that too.

·
1 day ago
·
[ - ]

tartieret
·
1 day ago
·
[ - ]

Microsoft posted an update on X: https://x.com/AzureSupport/status/1983569891379835372?ref_sr...

"We’re investigating an issue impacting Azure Front Door services. Customers may experience intermittent request failures or latency. Updates will be provided shortly."

llama052
·
1 day ago
·
[ - ]

Always fun when you can't trust the main status page but have to go to some opinionated social medial website to see the actual problem.

drjasonharrison
·
1 day ago
·
[ - ]

https://www.cbc.ca/news/investigates/tesla-grok-mom-9.695693...

This mom’s son was asking Tesla’s Grok AI chatbot about soccer. It told him to send nude pics, she says

xAI, the company that developed Grok, responds to CBC: 'Legacy Media Lies'

bragma
·
1 day ago
·
[ - ]

They suggest to use Traffic Manager to router around failing FrontDoor CDN, but DNS is failing too, making the suggestion another failure.

asciii
·
1 day ago
·
[ - ]

Yeah they're suggesting to use CLI but then my Frontdoor deployment failed. Welp.

glzone1
·
1 day ago
·
[ - ]

Wasn't the saying "It's always DNS" floating around somewhere?

Be interesting to understand cause here. Pretty big impact on services we use

mikestew
·
1 day ago
·
[ - ]

Could be DNS, I'm seeing SERVFAIL trying to resolve what look to be MS servers when I'm hitting (just one example) mygoodtogo.com (trying to pay a road toll bill, and failing).

borg16
·
1 day ago
·
[ - ]

i guess folks in azure wanted to show some solidarity with aws brethren

(couldn't resist adding it. i acknowledge this comment adds no value to the discussion)

aurumque
·
1 day ago
·
[ - ]

Azure goes down all the time. On Friday we had an entire regional service down all day. Two weeks ago same thing different region. You only hear about it when it's something everyone uses like the portal, because in general nobody uses Azure unless they're held hostage.

Mr_Bees69
·
1 day ago
·
[ - ]

Yeah, im regretting my decision to buy an xbox now. Every once in a while, everything goes down.

alt227
·
1 day ago
·
[ - ]

Microsoft have started putting customer status pages up on windows.net, so it must be really really bad!

For example when I try to log into our payroll provider Brightpay, it sends me here:

https://bpuk1prod1environment.blob.core.windows.net/host-pro...

eeasss
·
1 day ago
·
[ - ]

Deglobalization in geopolitics should be followed by deglobalization in cloud providers as well. Viva la local vendors.

zaoui_amine
·
18 hours ago
·
[ - ]

Language models aren't perfect; they can still generate similar outputs. Invertibility is a stretch.

altcognito
·
18 hours ago
·
[ - ]

wrong thread

reid
·
1 day ago
·
[ - ]

Portal and Azure CDN are down here in the SF Bay Area. Tenant azureedge.net DNS A queries are taking 2-6 seconds and most often return nothing. I got a couple successful A response in the last 10 minutes.

Edit: As of 9:19 AM Pacific time, I'm now getting successful A responses but they can take several seconds. The web server at that address is not responding.

m_fayer
·
1 day ago
·
[ - ]

And there goes https://www.microsoft.com/

chrisgeleven
·
1 day ago
·
[ - ]

"Front Door" has to be the worst product name for a CDN I've ever heard of. I used to work for a CDN too.

oliyoung
·
1 day ago
·
[ - ]

We should've never let marketing in the door honestly, all of the product names for the big three are awful.

Microsoft CDN

There, that's it. You're selling it to (hopefully) technical people

unethical_ban
·
1 day ago
·
[ - ]

I wonder if many Germans are eager to sign up for AFD.

But seriously I thought it would be the console, not a CDN.

jeffrallen
·
1 day ago
·
[ - ]

Front Door (tm), with Back Door access for the FBI included free with your subscription! ;)

joquarky
·
1 day ago
·
[ - ]

It so strongly implies a counterpart.

·
1 day ago
·
[ - ]

jacquesm
·
1 day ago
·
[ - ]

It is much more than azure. One of my kids needs a key for their laptop and can't reach that either. Great excuse though, 'Azure ate my homework'. What a ridiculous world we are building. Fuck MS and their account requirements for windows.

udfalkso
·
1 day ago
·
[ - ]

OpenAI Clip python library fails because the model download is a hardcoded azure cdn url :(

elFarto
·
1 day ago
·
[ - ]

We saw all incoming traffic to our app drop to zero at about 15:45. I wonder how long this one will take to fix.

sech8420
·
1 day ago
·
[ - ]

Same exact time for us as well.

tonymet
·
1 day ago
·
[ - ]

Any healthcare IT admins care to chime in? A predominantly MS industry with critical workloads.

vs4vijay
·
1 day ago
·
[ - ]

Service Status: https://status.cloud.microsoft/ and https://azure.status.microsoft/en-us/status

ipsum2
·
1 day ago
·
[ - ]

Status page (first link) is down for me. Second one works

charv
·
1 day ago
·
[ - ]

oh the irony, the status link being down too

karateka01
·
1 day ago
·
[ - ]

status page being affected by the same issue is so lame

millzlane
·
1 day ago
·
[ - ]

It begs the question from a noob like me... Where should they host the status page? Surely it shouldn't be on the same infra that it's supposed to be monitoring. Am I correct in thinking that?

·
1 day ago
·
[ - ]

aftergibson
·
1 day ago
·
[ - ]

Looks like the status page is overloaded...

ApolloFortyNine
·
1 day ago
·
[ - ]

Two hours after the initial outage, they have finally updated the Front Door status on their status page.

andhuman
·
1 day ago
·
[ - ]

I bet it’s DNS.

andhuman
·
1 day ago
·
[ - ]

“ Starting at approximately 16:00 UTC, we began experiencing DNS issues resulting in availability degradation of some services. Customers may experience issues accessing the Azure Portal. We have taken action that is expected to address the portal access issues here shortly. We are actively investigating the underlying issue and additional mitigation actions. More information will be provided within 60 minutes or sooner.

This message was last updated at 16:35 UTC on 29 October 2025”

pbhjpbhj
·
21 hours ago
·
[ - ]

That was my bet too, then I looked at ISC and noticed there were PoCs released for critical BIND9 vulns yesterday ... might be related?

·
1 day ago
·
[ - ]

LouisLazaris
·
1 day ago
·
[ - ]

The VS Code website is down: https://code.visualstudio.com/

And so is Microsoft: http://www.microsoft.com/

codethief
·
1 day ago
·
[ - ]

https://www.microsoft.com works for me (with the www subdomain).

SoftTalker
·
1 day ago
·
[ - ]

We're on Office 365 and so far it's still responding. At least Outlook and Teams is.

jeffdn
·
1 day ago
·
[ - ]

They don't run on Azure!

RajT88
·
1 day ago
·
[ - ]

They definitely do run on Azure. Probably not 100%, but at least some footprint of those services do.

rcarmo
·
1 day ago
·
[ - ]

Are you absolutely sure?

jansper39
·
1 day ago
·
[ - ]

They don't, however authentication for those services relies on Entra ID which seems to be affected.

rcarmo
·
1 day ago
·
[ - ]

I'd say DNS/Front Door (or some carrier interconnect) is the thing affected, since I can auth just fine in a few places. (I'm at MS, but not looped into anything operational these days, so I'm checking my personal subscription).

chemodax
·
1 day ago
·
[ - ]

It seems Azure FrontDoor is affected, because our private VM works fine in different regions.

Jarwain
·
1 day ago
·
[ - ]

On our end, our VMs are still working, so our gitlab instance is still up. Our services using Azure App Services are available through their provided url. However, Front Door is failing to resolve any domains that it was responsible for.

zaoui_amine
·
18 hours ago
·
[ - ]

Yeah, Azure is a mess today. Can't do anything without the portal.

irusensei
·
1 day ago
·
[ - ]

I was working when I saw the portal page showing only resource groups and lots of items missing. I thought it was a weird browser cache issue.

The actual stuff I was working on (App Insights, Function App) that was still open was operational.

·
1 day ago
·
[ - ]

tartieret
·
20 hours ago
·
[ - ]

it took a good half hour after we detected the problem to see a notification on the Azure status page. Thanks to those who responded to my question as it validated the issue was global and we contacted our users t right away

_pdp_
·
1 day ago
·
[ - ]

With all the recent outages considered, it is time to move off the cloud.

·
1 day ago
·
[ - ]

baconbrand
·
1 day ago
·
[ - ]

All of our sites went down. This is my company’s busiest time of year. Hooray.

hypeatei
·
1 day ago
·
[ - ]

All of my employers things are hosted on Azure and running just fine and didn't go down at all. Portal access has been fixed.

Doesn't seem to be too bad of an outage unless you were relying on Azure Front Door.

randomsofr
·
1 day ago
·
[ - ]

SSO is down, Azure Portal Down and more, seems like a major outage. Already a lot of services seem to be affected: banks, airlines, consumer apps, etc.

hypeatei
·
1 day ago
·
[ - ]

The portal is up for me and their status page confirms they did a failover for it. Definitely not disputing that its reach is wide, but a lot of smaller setups probably aren't using Front Door.

rahkiin
·
1 day ago
·
[ - ]

Both work for me in the Netherlands

a_f
·
1 day ago
·
[ - ]

Looks like MyGet is impacted too. Seems like they use Azure:

>What is required to be able to use MyGet? ... MyGet runs its operations from the Microsoft Azure in the West Europe region, near Amsterdam, the Netherlands.

empath75
·
1 day ago
·
[ - ]

Friend of mine at MSFT says it's a Sev-0 outage and they can't even get to the ticket tracking system.

rcarmo
·
1 day ago
·
[ - ]

Not seeing it. I have VMs in US East and Netherlands and they're up.

tgv
·
1 day ago
·
[ - ]

I tried to look some things up on their support pages before 1600Z, and it timed-out. The Dutch railways are also affected (they're an MS shop, IIRC).

LaserToy
·
1 day ago
·
[ - ]

Azure portal still insists the issue is jsut with Console.

We had to bypass the Frontdoor

CKMo
·
1 day ago
·
[ - ]

Reasons to not use hyperscalers, exhibit 654

There's a lot of outages this month!

8cvor6j844qw_d6
·
1 day ago
·
[ - ]

Quite close to the recent AWS outage. Let me take a look if its a major one similar to AWS.

Any guess on what's causing it?

In hindsight, I guess the foresight of some organizations to go multi-cloud was correct after all.

jcims
·
1 day ago
·
[ - ]

We're multi-cloud and it really saved a few workloads last week with the AWS issue.

It's not easy though.

sanskarix
·
1 day ago
·
[ - ]

This is the eternal tension for early-stage builders, isn't it? Multi-cloud gives you resilience, but adds so much complexity that it can actually slow down shipping features and iterating.

I'm curious—at what point did you decide the overhead was worth it? Was it after experiencing an outage, or did you architect for it from day one?

As someone launching a product soon (more on the builder/product side than infra-engineer), I keep wrestling with this. The pragmatist in me says "start simple, prove the concept, then layer in resilience." But then you see events like this week and think "what if this happens during launch?"

How did you handle the operational complexity? Did you need dedicated DevOps folks, or are there patterns/tools that made it manageable for a smaller team?

jcims
·
1 day ago
·
[ - ]

I don't think I would recommend multi-cloud right out of the gate unless you already have a lot of experience in the space or there is a strong demand from your customers. There's a tremendous amount of overhead with security/compliance, incident management, billing, tooling, entitlements, etc. There are a number of external factors that drove our decision to do it, resiliency is just one of them. But we are a pretty big shop, spending ~$10M/mo on cloud infra and have ~100 people in the platform management department.

I would recommend focusing on multi-region within a single CSP instead (both for workloads AND your tooling), which covers the vast majority of incidents and lays some of the architectural foundation for multi-cloud down the road. Develop failover plans for each service in your architecture (eg. planned/tested runbooks to migrate to Traffic Manager in the event AFD goes down)

Also choose your provider wisely. We experience 3-5x the number of service-impacting incidents on Azure that we do on AWS. I'm sure others have different experiences, but I would never personally start a company on Azure. AWS has its own issues, of course, but reliability has not been a major one (relatively speaking) over the past 10 years. Last week's incident with DynamoDB in us-east-1 had zero impact on our AWS workloads in other regions.

stuff4ben
·
1 day ago
·
[ - ]

It's always freakin DNS...

conroydave
·
1 day ago
·
[ - ]

cost cutting attempts

iAMkenough
·
1 day ago
·
[ - ]

Trusting AI without sufficient review and oversight of changes to production.

whynotminot
·
1 day ago
·
[ - ]

Yeah, these things never happened when humans were trusted without sufficient review and oversight of changes to production.

shepherdjerred
·
1 day ago
·
[ - ]

Do you have any insight or do you just dislike AI? Incidents like this happened long before AI generated code

Capricorn2481
·
1 day ago
·
[ - ]

I don't think it's meant to be serious. It's a comment on Microsoft laying off their staff and stuffing their Azure and Dotnet teams with AI product managers.

avgDev
·
1 day ago
·
[ - ]

I am having a bunch of issues. It looks like their sites and azure are both affected.

I also got weird notification in VS2022 that my license key was upgraded to Enterprise, but we did not purchase anything.

Mr_Bees69
·
1 day ago
·
[ - ]

Might be a failsafe, if you cant get a license status, and you're aware that MS is down, just default to the highest tier.

ThatManulTheCat
·
1 day ago
·
[ - ]

Free upgrade

dlcarrier
·
1 day ago
·
[ - ]

Yesterday Amazon, today Microsoft. Are Google's cloud services going down tomorrow?

gtowey
·
1 day ago
·
[ - ]

This is because Azure just copies everything AWS does. Google is a bit more innovative, they will have something else unexpected happen.

dkdcio
·
1 day ago
·
[ - ]

throwback to when they deleted a customer's entire account! https://arstechnica.com/gadgets/2024/05/google-cloud-acciden...

Insanity
·
1 day ago
·
[ - ]

Maybe they are and no one realized yet.. :P

That said, I don't hear about GCP outages all that often. I do think AWS might be leading in outages, but that's a gut feeling, I didn't look up numbers.

luhn
·
1 day ago
·
[ - ]

They had a pretty massive one earlier this year. https://status.cloud.google.com/incidents/ow5i3PPK96RduMcb1S...

This isn't GCP's fault, but the outage ended up taking down Cloudflare too, so in total impact I think that takes the cake.

xenolithis
·
1 day ago
·
[ - ]

fairly certain they had a significant multi region outage within the past few years. I'll try to find some details to link.

Few customers....few voices to complain as well.

Mr_Bees69
·
1 day ago
·
[ - ]

as a victim of xbox, azure is down 'bout as often as its up

m_fayer
·
1 day ago
·
[ - ]

And if they don't, we'll know who the culprit is.

shishcat
·
1 day ago
·
[ - ]

Who?

·
1 day ago
·
[ - ]

briffle
·
1 day ago
·
[ - ]

here's hoping its Oracle's cloud instead....

anon025
·
1 day ago
·
[ - ]

It's the DNS https://dnschecker.org/#A/get.helm.sh is unreachable

I_am_tiberius
·
1 day ago
·
[ - ]

Why are Azure App Services still working?

thimkerbell
·
1 day ago
·
[ - ]

Does (should, could) DownDetector also say what customer-facing services are down, when some infrastructure is unworking? Or is that the info that the malefactors are seeking?

tpl
·
1 day ago
·
[ - ]

Part of this outage involves outlook hanging and then blaming random addins. Pretty terrible practice by Microsoft to blame random vendors for their own outage.

alt227
·
1 day ago
·
[ - ]

Cant access certain banking websites in the UK, I am assuming it because of this.

https://www.natwest.com/

senderista
·
1 day ago
·
[ - ]

Even if the cloud providers have much better reliability than most on-prem infra, the failure correlation they induce negates much of the benefit.

btbuildem
·
1 day ago
·
[ - ]

https://login.microsoftonline.com/ is down, so that's fun

syntaxing
·
1 day ago
·
[ - ]

I absolutely love the utility aspect of LLMs but part of me is curious if moving faster by using AI is going to make these sorts of failure more and more often.

monkaiju
·
1 day ago
·
[ - ]

If true then what "utility" is there?

1718627440
·
1 day ago
·
[ - ]

More visibility for the general person to see how brittle software is?

bronco21016
·
1 day ago
·
[ - ]

Unable to access the portal and any hit to SSO for other corporate accesses is also broken. Seems like there's something wrong in their Identity services.

perks_12
·
1 day ago
·
[ - ]

Thank you. I was wondering what was going on at a company whose web app I need to access. I just checked with BuiltWith and it seems they are on Azure.

·
1 day ago
·
[ - ]

_oleksandr_
·
1 day ago
·
[ - ]

Based on the delay in resolving the issue, it appears MC attempted to rehire some of the DevOps engineers whom AI had previously replaced.

jeffrallen
·
1 day ago
·
[ - ]

They probably hired the ones AWS laid off, causing the AWS outage.

Institutional knowledge matters. Just has to be the right institution is all.

ThatManulTheCat
·
1 day ago
·
[ - ]

Azure portal currently mostly not working (UK)... Downdetector reporting various Microsoft linked services are out (Minecraft, Microsoft 365, Xbox...)

bragma
·
1 day ago
·
[ - ]

They suggest to use Traffic Manager to route around failing CDNs. But DNS is not working too, making the suggestion another fail.

zbowling
·
1 day ago
·
[ - ]

Alaska Airlines is redircting folks to their slimmed down international site and you can't check in on mobile.

·
1 day ago
·
[ - ]

djeastm
·
1 day ago
·
[ - ]

I'm mid-deployment, but thankfully it seems to be running ok so far. Just the portal is not working so my visibility is not good.

nartaczact
·
1 day ago
·
[ - ]

Sounds like Shrodinger's Deploy

ycombinatornews
·
1 day ago
·
[ - ]

So that’s why CapitalOne is out today. Even though their (incorrect) status page says all systems operational.

tecleandor
·
1 day ago
·
[ - ]

LinkedIn has been acting funny for an hour or so, and some pages in the learn.microsoft.com domain have been failing for me too...

speckx
·
1 day ago
·
[ - ]

FYI: https://status.cloud.microsoft/

chuckadams
·
1 day ago
·
[ - ]

Which itself is^H^H was down. Wow.

speckx
·
1 day ago
·
[ - ]

FYI: https://status.cloud.microsoft/

Boxersteavee
·
1 day ago
·
[ - ]

503 Service Unavailable

ZeroConcerns
·
1 day ago
·
[ - ]

Oh, well, I'm sure Azure will be given the same pass that AWS got here recently when they had their 12-hour outage...

taeric
·
1 day ago
·
[ - ]

I didn't realize AWS got a pass?

graemep
·
1 day ago
·
[ - ]

Have repeated outages lost them customers? has it lost them any money in any way?

That is a pass.

taeric
·
1 day ago
·
[ - ]

Apologies, but this just reads like a low effort critique of big things.

To be clear, they should get criticism. They should be held liable for any damage they cause.

But that they remain the biggest cloud offering out there isn't something you'd expect to change from a few outages that, by most all evidence, potential replacements have, as well? More, a lot of the outages potential replacements have are often more global in nature.

graemep
·
1 day ago
·
[ - ]

I would say you are explaining why they get a free pass so they still get one - they are bad but their main competitors are even worse!

I thought one of the major selling points of the big cloud providers was that they were more reliable than running your own stuff (by which i mean anything from a VPS to multiple data centres depending on your scale. Compared to those alternatives they seem to be less reliable in practice!

The solution is to have a multi-region, or even multi-cloud setup, but then bang goes the "they do all the work for you" argument (which i doubt anyway).

taeric
·
20 hours ago
·
[ - ]

That isn't a free pass. You have no data showing how many people did go to competitors over this. You are asserting it is zero, but why do you think that? Going on the talks here, you can find plenty of folks that opted not to go with or stay on them.

You are further asserting that these outages prove they are not still more reliable than home spun. Is that the case? More than a few people aren't ready for a single hard drive to crash on the stuff they are doing.

philipallstar
·
1 day ago
·
[ - ]

Have people left GitHub due to the multiple post-acquisition outages? That is a pass if you don't judge it the same way.

prmoustache
·
1 day ago
·
[ - ]

Well, they have successfully locked their customers captive thanks to huge egress fees.

arccy
·
1 day ago
·
[ - ]

customers like us are certainly looking at expanding from just multi region into instead being multi cloud...

everfrustrated
·
1 day ago
·
[ - ]

GitHub runners (specifically the "larger" runner types) are all down for us. These are known to be hosted on Azure.

major505
·
1 day ago
·
[ - ]

Somewhere, an ex microsoft engineer that where layoff during the last week, is saying to himself “thank god, this shit is not my problem anymore”

martijnvds
·
1 day ago
·
[ - ]

This probably explains why paying for street parking in Cologne by phone/web didn't work (eternal spinner) then

smithkl42
·
1 day ago
·
[ - ]

The iron law of uptime: "The mandatory single point of failure in every possible system is configuration."

baconbrand
·
1 day ago
·
[ - ]

Our Azure DevOps site is still functioning and our Azure hosted databases are accessible. Everything else is cooked.

jimmyl02
·
1 day ago
·
[ - ]

pretty interesting how datadog's uptime tracker (https://updog.ai/) says all the sites are fully available.

if that's true then it's a sign that Azure's control / data plane separation is doing it's job! at least for now

jonathanlydall
·
1 day ago
·
[ - ]

Our Azure hosted dotnet App Service is working fine, but our docs site served via Front Door went down. Can’t access anything through the Portal.

layer8
·
1 day ago
·
[ - ]

Maybe they need a downtime tracker. ;)

montague27
·
1 day ago
·
[ - ]

Guess when/who has the next outage!

glzone1
·
1 day ago
·
[ - ]

I remember the saying "It's always DNS". I'm old.

Kind of mindboggling it's still sometimes DNS maybe.

alt227
·
1 day ago
·
[ - ]

That saying is just as alive today as it ever was.

https://isitdns.com/

Mr_Bees69
·
1 day ago
·
[ - ]

MS website seems to be up but really slow. Think xbox might still be down, Bing works for some reason tho!?

ksec
·
1 day ago
·
[ - ]

>Last week AWS, now this.

This is not the first or second time this happened, multiple Hyperscaler failed one by one.

twodave
·
1 day ago
·
[ - ]

Appears to be an issue in Front Door. Our back end stuff is fine but FD is bouncing everything.

NDizzle
·
1 day ago
·
[ - ]

Yeah, I have non prod environments that don't use FD that are functioning. Routing through FD does not work. And a different app, nonprod doesn't use FD (and is working) but loads assets from the CDN (which is not working).

FD and CDN are global resources and are experiencing issues. Probably some other global resources as well.

Hate to say it, but DNS is looking like it's still the undisputed champ.

qmr
·
1 day ago
·
[ - ]

Always in these large provider outages you see people who have forgotten the old ways.

jasonthorsness
·
1 day ago
·
[ - ]

Ahh it got me, Alaska air web site has an Azure outage banner

redwood
·
1 day ago
·
[ - ]

Is it Cosmos DB? If so the symmetry with AWS/Dynamo would be very eerie.

vanviegen
·
1 day ago
·
[ - ]

Many (all?) LinkedIn profiles are also down for me. Luckily the frontpage still works. ;-)

Go cloud!

AznHisoka
·
1 day ago
·
[ - ]

Luckily?

amluto
·
1 day ago
·
[ - ]

vscode.dev appears to be down. I think this will be my excuse to find an alternative -- I never really liked vscode.dev anyway.

(Coder is currently at the top of the experiment list. Any other suggestions?)

Shuddown
·
1 day ago
·
[ - ]

Github Codespaces (for the 5 people that use them) are also still down.

whalesalad
·
1 day ago
·
[ - ]

Yikes, http://schemas.xmlsoap.org/soap/encoding/ is running on Azure and it's down. So any SOAP/WSDL api's are dead in the water.

    HTTPSConnectionPool(host='schemas.xmlsoap.org', port=443): Max retries exceeded with url: /soap/encoding/ (Caused by SSLError(CertificateError("hostname 'schemas.xmlsoap.org' doesn't match '*.azureedge.net'")))

A service we rely on that isn't even running on Azure is inaccessible due to this issue. For an asset that probably never changes. Wild for that to be the SPOF.

160k+ results on GitHub: https://github.com/search?q=http%3A%2F%2Fschemas.xmlsoap.org...

macshome
·
1 day ago
·
[ - ]

I just tried to check the Xbox services status page and it never even loaded.

chokolad
·
1 day ago
·
[ - ]

Majority of actual Xbox services are working fine, xbox.com itself is busted.

xer0x
·
1 day ago
·
[ - ]

Wow, they are still down 12 hours later. :/

croemer
·
1 day ago
·
[ - ]

Not officially - status page says all healthy

rvz
·
1 day ago
·
[ - ]

Looking forward to the post mortem.

internet_points
·
1 day ago
·
[ - ]

> What went wrong and why?

> An inadvertent tenant configuration change within Azure Front Door (AFD) triggered a widespread service disruption affecting both Microsoft services and customer applications dependent on AFD for global content delivery. The change introduced an invalid or inconsistent configuration state that caused a significant number of AFD nodes to fail to load properly, leading to increased latencies, timeouts, and connection errors for downstream services.

> As unhealthy nodes dropped out of the global pool, traffic distribution across healthy nodes became imbalanced, amplifying the impact and causing intermittent availability even for regions that were partially healthy. We immediately blocked all further configuration changes to prevent additional propagation of the faulty state and began deploying a ‘last known good’ configuration across the global fleet. Recovery required reloading configurations across a large number of nodes and rebalancing traffic gradually to avoid overload conditions as nodes returned to service. This deliberate, phased recovery was necessary to stabilize the system while restoring scale and ensuring no recurrence of the issue.

> The trigger was traced to a faulty tenant configuration deployment process. Our protection mechanisms, to validate and block any erroneous deployments, failed due to a software defect which allowed the deployment to bypass safety validations. Safeguards have since been reviewed and additional validation and rollback controls have been immediately implemented to prevent similar issues in the future.

So, so far they're saying it's a combination of bad config + their config-validator had a bug. Would love more details.

Aldipower
·
1 day ago
·
[ - ]

We have some trouble with the AFD in Germany too.

vinyl7
·
1 day ago
·
[ - ]

Vibe coded internet keeps getting better

avgDev
·
1 day ago
·
[ - ]

Quick find someone who can actually read documentation and code!

the_af
·
1 day ago
·
[ - ]

You just paste the outage error codes back to the LLM and pray it's still working and can fix whatever went wrong!

m_fayer
·
1 day ago
·
[ - ]

When all the people forget to code for themselves, every LLM will code itself out of existence with that one last bug. One, after another.

kryogen1c
·
1 day ago
·
[ - ]

downdetector reports coincident cloudflare outage. is microsoft using cloudflare for management plane, or is there common infra? data center problem somewhere, maybe fiber backbone? BGP?

voidpointer2000
·
1 day ago
·
[ - ]

Down in Sweden Central as well (all our production systems are down)

ChuckMcM
·
1 day ago
·
[ - ]

"On Prem" is looking better and better :-).

kryogen1c
·
1 day ago
·
[ - ]

downdetector reports coincident cloudflare outage. is microsoft using cloudflare for management plane, or is there common infra? data center problem somewhere, maybe fiber backbone? BGP?

Mr_Bees69
·
1 day ago
·
[ - ]

nope, dont see any cf issues.

ThatManulTheCat
·
1 day ago
·
[ - ]

Yudkowsky's feared Superintellignece holding Azure hostage

somerandomness
·
1 day ago
·
[ - ]

yep having trouble logging into https://entra.microsoft.com/ as well

wingless_angel
·
1 day ago
·
[ - ]

Please sort it out, I'll be out of a job tomorrow.

llimos
·
1 day ago
·
[ - ]

Yep, down from here too (in Israel).

Services too, not just the portal.

andoma
·
1 day ago
·
[ - ]

Can confirm

acd
·
1 day ago
·
[ - ]

Putting all your eggs software in one basket

rodolphoarruda
·
1 day ago
·
[ - ]

I could not access MS Clarity the entire day.

I_am_tiberius
·
1 day ago
·
[ - ]

Shouldn't regions be completely independent?

pred8er
·
1 day ago
·
[ - ]

on the line with msft, they said 4 hours is what they are thinking. a workaround they are saying is to use traffic manager,

ukblewis
·
1 day ago
·
[ - ]

GitHub also seems to be having trouble for me

nflekkhnnn
·
1 day ago
·
[ - ]

Shut the front door!

howard941
·
1 day ago
·
[ - ]

Took out the archive.ph and .is sites too?

opengrass
·
1 day ago
·
[ - ]

Github Actions and Codespaces degraded.

uuuubbbb
·
1 day ago
·
[ - ]

Intune, Azure, Entra down in Switzerland

majnata
·
1 day ago
·
[ - ]

The Azure API is still working though.

kierenj
·
1 day ago
·
[ - ]

microsoft.com is back -

edit: it worked once, then died again. So I guess - some resolvers, or FD servers may be working!

philipallstar
·
1 day ago
·
[ - ]

Can't get to microsoft.com even.

zelias
·
1 day ago
·
[ - ]

Anyone have betting odds on when Google will go down next? Are we looking at all 3 providers having outages in the span of 3 weeks?

jacquesclouseau
·
1 day ago
·
[ - ]

My bet is on a bad config change.

croemer
·
1 day ago
·
[ - ]

They already announced that.

xuf
·
1 day ago
·
[ - ]

Down here too (region West Europe)

rluhar
·
1 day ago
·
[ - ]

Looks like AWS is also impacted?

zavec
·
1 day ago
·
[ - ]

Yeah the graph for that one looks exactly the same shape. I wonder if they were depending on some azure component somehow, or maybe there were things hosted on both and the azure failure made enough things failover to AWS that AWS couldn't cope? If that was the case I'd expect to see something similar with GCP too though.

Edit: nope looks like there's actually a spike on GCP as well

estel
·
1 day ago
·
[ - ]

It's possibly more likely that people mis-attribute the cause of an outage to the wrong providers when they use downdetector.

zavec
·
1 day ago
·
[ - ]

Definitely also a strong possibility. I wish I had paid more attention during the AWS one earlier to see what other things looked like on there at the time.

razodactyl
·
1 day ago
·
[ - ]

AWS, now Azure - wasn't this a plot point in Terminator where SkyNet was causing computer systems to have issues much before it finally become self-aware?

Funnily enough, AI has been training on its own data as generated by users writing AI conversations back to the internet - there's a feedback loop at play.

seinecle
·
1 day ago
·
[ - ]

Can't connect to Claude

thewisenerd
·
1 day ago
·
[ - ]

they recently had an incident with front door reachability, wonder if it's back.

QNBQ-5W8

pred8er
·
1 day ago
·
[ - ]

looks like MS completed a failover and things are be recovering slowly

giantg2
·
1 day ago
·
[ - ]

Compare the comments and news coverage on this compared to the AWS outage... pretty telling.

DeathArrow
·
1 day ago
·
[ - ]

Buy cloud because you're always safe! Until you aren't.

mnau
·
22 hours ago
·
[ - ]

It doesn't matter whether you actually are safe or not. What matters is that you are in compliance.

journal
·
1 day ago
·
[ - ]

one day these outages will cause a starvation.

dlcarrier
·
1 day ago
·
[ - ]

We're quickly learning who's relying on a single cloud provider.

Insanity
·
1 day ago
·
[ - ]

Multi cloud is really hard to get right at scale, and honestly not worth the effort for the majority of companies and use-case.

shagie
·
1 day ago
·
[ - ]

Like AWS or GCP? https://downdetector.com/status/aws-amazon-web-services/ - https://downdetector.com/status/google-cloud/

MiguelHudnandez
·
1 day ago
·
[ - ]

When you look at the scale of the reports, you find they are much lower than Azure's. seeing a bunch of 24-hour sparkline type graphs next to each other can make it look like they are equally impacted, but AWS has 500 reports and Azure has 20,000. The scale is hidden by the choice of graph.

In other words, people reporting outages at AWS are probably having trouble with microsoft-run DNS services or caching proxies. It's not that the issues aren't there, it's that the internet is full of intermingled complexity. Just that amount of organic false-positives can make it look like an unrelated major service is impacted.

joaomoreno
·
1 day ago
·
[ - ]

Yup, see it as well.

_andrei_
·
1 day ago
·
[ - ]

https://www.reddit.com/r/cscareerquestions/comments/1ojbebq/...

tonymet
·
1 day ago
·
[ - ]

Hello fellow boomers!

I noticed that winget is also down eg.

  winget upgrade fabric
  Failed in attempting to update the source: winget
  An unexpected error occurred while executing the command:
  InternetOpenUrl() failed.
  0x80072ee7 : unknown error

nextworddev
·
1 day ago
·
[ - ]

Fascinating timing given the APEC summit ;)

pred8er
·
1 day ago
·
[ - ]

things seem to be coming back up now

m_a_g
·
1 day ago
·
[ - ]

It’s not DNS

There is no way it’s DNS

It was DNS

patching-trowel
·
1 day ago
·
[ - ]

As of now Azure Status page still shows no incident. It must be manually updated, someone has to actively decide to acknowledge an issue, and they're just... not. It undermines confidence in that status page.

baconbrand
·
1 day ago
·
[ - ]

I have never noticed that page being updated in a timely manner.

charles_f
·
1 day ago
·
[ - ]

It shows that some people have issues accessing the portal.

user3939382
·
1 day ago
·
[ - ]

I know how to fix this but this community is too close minded and argumentative egocentric sensitive pedantic threatened angry etc to bother discussing it

ycombinator_acc
·
1 day ago
·
[ - ]

Aww man you got me curious for a sec there.

user3939382
·
1 day ago
·
[ - ]

I’ll roll it out

zingababba
·
1 day ago
·
[ - ]

This brings to mind this -> https://thenewstack.io/github-will-prioritize-migrating-to-a...

udev4096
·
1 day ago
·
[ - ]

Luckily, no one uses azure and it's fully expected from azure to go down all the time! Keep it up!

amir734jj
·
1 day ago
·
[ - ]

It's DNS

AtNightWeCode
·
1 day ago
·
[ - ]

Earnings report today. A coincidence?

I can at least login to Azure. But several MS sites are down.

okokwhatever
·
1 day ago
·
[ - ]

This cannot be a coincidence

bossyTeacher
·
1 day ago
·
[ - ]

I noticed issues on Azure so I went to the status page. It said everything was fine even though the Azure Portal was down. It took more than 10 minutes for that status page to update.

How can one of the richest companies in the world not offer a better service?

Ylpertnodi
·
1 day ago
·
[ - ]

>How can one of the richest companies in the world not offer a better service?

Better service costs money.

·
1 day ago
·
[ - ]

NDizzle
·
1 day ago
·
[ - ]

My best guess at the moment is something global like the CDN is having problems affecting things everywhere. I'm able to use a legacy application we have that goes directly to resources in uswest3, but I'm not able to use our more modern application which uses APIM/CDN networks at all.

rsolva
·
1 day ago
·
[ - ]

So that's why all of our municipality's digital services are down ... utter chaos at the political meeting I attended just now.

siva7
·
1 day ago
·
[ - ]

auth services are down

barpol
·
1 day ago
·
[ - ]

still down

improbableinf
·
1 day ago
·
[ - ]

What a time to be alive!

·
1 day ago
·
[ - ]

worik
·
1 day ago
·
[ - ]

An important quality of the cloud is that it is always available.

Except that it is not!

Interesting times...

zzake
·
1 day ago
·
[ - ]

Portal is now accessible, bypassing FDN

llama052
·
1 day ago
·
[ - ]

Just another day with microsoft. Honestly pretty tiring as something is always generally broken.

PacketPundit
·
1 day ago
·
[ - ]

[dead]

bernardo786
·
1 day ago
·
[ - ]

now aws down again?

AtNightWeCode
·
1 day ago
·
[ - ]

From Azure status page: "Customers can consider implementing failover strategies with Azure Traffic Manager, to fail over from Azure Front Door to your origins".

What a terrible advise.

tonyhart7
·
1 day ago
·
[ - ]

Wtf happen with US east????

rawgabbit
·
1 day ago
·
[ - ]

Meanwhile the layoffs continue https://www.entrepreneur.com/business-news/microsoft-ceo-exp...

ctoth
·
1 day ago
·
[ - ]

Layoffs will continue until uptime improves!

onraglanroad
·
1 day ago
·
[ - ]

Now that is actually funny!

FeteCommuniste
·
1 day ago
·
[ - ]

> [Satya Nadella] said that the company’s future opportunity was to bring AI to all eight billion people on the planet.

But what if I don't want AI brought to me?

mring33621
·
1 day ago
·
[ - ]

Sounds like someone has a case of the 'Mondays'...

binarymax
·
1 day ago
·
[ - ]

The mondAIs

bostik
·
1 day ago
·
[ - ]

You'll have to find another planet.

Although judging by the available transports it will likely be colonized by nazis.

gcanyon
·
1 day ago
·
[ - ]

Real life Pluribus https://en.wikipedia.org/wiki/Pluribus_(TV_series)

ryandrake
·
1 day ago
·
[ - ]

Like most technology initiative these tech CEOs dream up: You're going to get it and swallow it, whether you want it or not.

the_af
·
1 day ago
·
[ - ]

I especially like how Nadella speaks of layoffs as some kind of uncontrollable natural disaster, like a hurricane, caused by no-one in particular. A kind of "God works in mysterious ways".

    > “Microsoft is being recognized and rewarded at levels never seen before,” Nadella wrote. “And yet, at the same time, we’ve undergone layoffs. This is the enigma of success in an industry that has no franchise value.”
     
    > Nadella explained the disconnect between thriving financials and layoffs by stating that “progress isn’t linear” and that it is “sometimes dissonant, and always demanding.”

I've read the whole memo and it's actually worse than those excerpts. Nadella doesn't even claim these were low performers:

    > These decisions are among the most difficult we have to make. They affect people we’ve worked alongside, learned from, and shared countless moments with—our colleagues, teammates, and friends.

Ok, so Microsoft is thriving, these were friends and people "we've learned from", but they must go because... uh... "progress isn't linear". Well, thanks Nadella! That explains so much!

inquirerGeneral
·
1 day ago
·
[ - ]

[dead]

·
1 day ago
·
[ - ]

sherinjosephroy
·
1 day ago
·
[ - ]

[flagged]

paj33t
·
1 day ago
·
[ - ]

[dead]

mattdecker100
·
1 day ago
·
[ - ]

[dead]

dedi089
·
1 day ago
·
[ - ]

[dead]

dedi089
·
23 hours ago
·
[ - ]

[dead]

sherinjosephroy
·
1 day ago
·
[ - ]

[flagged]

chalo124
·
20 hours ago
·
[ - ]

[dead]

unit149
·
1 day ago
·
[ - ]

[dead]

widikidiw
·
1 day ago
·
[ - ]

main di jo777 gapernah gagal

dedi089
·
23 hours ago
·
[ - ]

Layanan call center 24 jam Garuda Indonesia bisa anda hubungi melalui O82335469434 atau (0823)354.6-943.4.

almosthere
·
1 day ago
·
[ - ]

Reports of Azure and AWS down on the same day? Infrastructure terrorism?

12_throw_away
·
1 day ago
·
[ - ]

> Infrastructure terrorism?

Unless that's a euphemism for "vibe coding", no.

reaperducer
·
1 day ago
·
[ - ]

Reports of Azure and AWS down on the same day? Infrastructure terrorism?

> We have confirmed that an inadvertent configuration change as the trigger event for this issue.

Save the speculation for Reddit. HN is better than that.

improbableinf
·
1 day ago
·
[ - ]

According to downtector.com - both AWS and GCP are down as well. Interesting

jasonjmcghee
·
1 day ago
·
[ - ]

Don't visit this address.