Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

12:18 PST, still down. Someone must be getting his ass chewed about this.


Fired is more like it. Though I doubt Jeff Bezos is the kind of guy to do something like that.


I used to work for Amazon. Unless this was willful I doubt anyone is going to get fired for it. After all, you'd be firing the guy who's least likely to make such a mistake ever again.


It's so silly to fire people for mistakes that no organization that exerts any mental capacity toward human resources would do such a thing. Even Apple didn't fire the guy who left a prototype at the bar.

In a factory in 1965, maybe, but no good employer is going to fire someone for making a mistake, no matter how costly.


> In a factory in 1965, maybe

Depends on the factory, and I doubt it's much more or less likely in the 21st century than it was in 1965. In all times and places, you have enlightened and unenlightened people. In all times and places you have good and bad leaders.

"In every time, in every place, the deeds of men remain the same."


> "In every time, in every place, the deeds of men remain the same."

Citation needed. I'm pretty sure there are fewer war deaths and fewer kittens burned for entertainment per head now than 200 years ago.


It's a quote. I leave that as homework.

Yes, there is less bad stuff, but the quote says nothing of frequency.


The story of the plane servicer who never mis-serviced a plane again after almost killing the pilot he worked for is cute, but screwing up big-time doesn't turn a (presumably) sloppy engineer into one that never messes up again.


I'm of the mind right now that the word "cute" really shouldn't be used in any other context than physical description. It's just condescending and rude.

The problem with your attitude is that it's based upon a premise that is almost never true: that screwups are caused by incompetence, and that they have singular (or overwhelmingly singular) sources.

Neither of these assumptions bear out in reality, and certainly not in our industry.

The vast majority of downtime events trace back to systemic failures, not a freak event, and are more often catalyzed by momentary lapses than long-standing incompetence. Do we penalize the tech who clicked the wrong link on a dashboard, or the guy who wrote the dashboard such that a critical action contains no safeties or confirmations? Or do we penalize the manager for not having any established documentation on protocols surrounding triggering critical actions?

The only reasonable stance here is to collectively take responsibility for the failure. It may feel good to hang someone out to dry, but in all likelihood their failure was only the final link in a long chain of failures that extended well beyond themselves.

You root cause what led to the event (going deeper than "a tech clicked on the wrong thing"), and you fix the root cause, and you move on.


To some extent I agree with you, but it is a slippery slope. If we always deny that one person really is a problem, we may retain a truly bad employee while building excessive safeguards that hinder productivity for others. In my experience this possibility is all too real.

A team of good people should learn from their mistakes and reduce hazards along the way. But bumper bowling is no fun for experienced players. It's a balance, and it does tend to shift as a company grows.


12:22 PST still down... Looking for Mushroom cloud to the NW


12:26 PST and no problems accessing from Seattle.


12:34 PST and OK from Vegas.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: