I had a production system go down this week - one minute no problem, the next, critical functionality stopped working. Worse, it didn't die because something broke - it went down by design. And did so without warning.
The premise that systems should "fail fast" is pretty well established - the idea has it's own wikipedia page, and any number of books talk about it as a fundamental premise.
For example, Release It! from the Pragmatic Bookshelf makes several references to Fail Fast in Chapter 5: Stability patterns.
Recent comments
3 days 21 hours ago
3 weeks 1 day ago
3 weeks 2 days ago
4 weeks 5 days ago
4 weeks 6 days ago
9 weeks 3 days ago
10 weeks 2 days ago
10 weeks 2 days ago
10 weeks 3 days ago
12 weeks 6 days ago