Hacker News
Bluesky April 2026 Outage Post-Mortem
threecheese
|next
[-]
That’ll do it.
jandrese
|root
|parent
|next
|previous
[-]
tapoxi
|next
|previous
[-]
pfraze
|root
|parent
|next
[-]
The simple answer is that atproto works like the web & search engines, where the apps aggregate from the distributed accounts. So the proper analogy here would be like yahoo going down in 1999.
isodev
|root
|parent
|next
|previous
[-]
drewg123
|next
|previous
[-]
floating-io
|root
|parent
|next
[-]
Goroutines are actually better AFAIK because they distribute work on a thread pool that can be much smaller than the number of active goroutines.
If my quick skim created a correct understanding, then the problem here looks more like architecture. Put simply: does the memcached client really require a new TCP connection for every lookup? I would think you would pool those connections just like you would a typical database and keep them around for approximately forever. Then they wouldn't have spammed memcache with so many connections in the first place...
(edit: ah, it looks like they do use a pool, but perhaps the pool does not have a bounded upper size, which is its own kind of fail.)
tombert
|root
|parent
|next
|previous
[-]
There are certainly plenty of projects where garbage collection is too slow, but I don't know that they're the majority, and more people would likely prefer memory safety by default.
mwkaufma
|next
|previous
[-]
goekjclo
|next
|previous
[-]
I expect this is common.
pembrook
|next
|previous
[-]
Email and the internet don't have "downtime." Certain key infra providers do of course. ISPs can go down. DNS providers can go down. But the internet and email itself can't go down absent a global electricity outage.
You haven't built a decentralized network until you reach that standard imo. Otherwise its just "distributed protocol" cosplay. Nice costume. Kind of like how everybody has been amnesia'd into thinking Obsidian is open source when it really isn't.
jonstaab
|next
|previous
[-]
pfraze
|root
|parent
|previous
[-]
nout
|root
|parent
|next
[-]
If tens of servers go down, then some people may start noticing a bit of inconvenience. If hundreds of servers go down, then some people may need to coordinate out of bound on what relays to use, but it still generally speaking works ok.
jmclnx
|previous
[-]
The article does work in lynx, at least I can read it.