MTTF A = ----------------- MTTF + MTTRWhere "A" stands for the availability, presented as a percentage. "MTTF" is mean time to failure. That's the time that the system is actually operational and "MTTR" is the time it takes to repair, restore, or recover the system. Normally you only consider the period of time when the system was required to be operational. So, if your system is down 30 minutes during a full seven days, 168 hour week, you would divide 167.5 (the uptime) by 168 (the total available time) and get 99.7%. If the requirements were that the system be up only five days a week, eight hours a day, that same 30 minute outage would bring your availability number down to 39.5/40, or 98.75%. In your question, you are not telling me how long the system was up. What you are asking is for a way to calculate availability based on a recovery time objective. If, in your example, the system were to recover in 30 minutes once in a week, your availability figure would be the 99.7% we discussed above. If instead, the system went down for 30 minutes every hour but never for longer than 30 minutes, you would still meet the Service Level Agreement you describe, but you'd only achieve 50% availability. When you do us historical data to calculate uptime percentages, you should base them on the operational hours. So, I would use 90 hour weeks (6 * 15 hours) in my calculations. And. I would count any outage, scheduled or not, that occurred during those periods. If I misunderstood your question, or if you have other questions, don't hesitate to ask. Evan L. Marcus Editor's note: Do you agree with this expert's response? If you have more to share, post it in one of our .bphAaR2qhqA^0@/searchstorage>discussion forums.
This was first published in May 2004