Uptime and risk management – the data centre prerogative
The data centre is the heart of the modern organisation. Irrespective of where it is located, on-premises, in a collocated environment or part of a hyper-scale cloud service, companies rely on it to ensure they have access to their applications, that their data is securely stored and that should anything go wrong that they will be able to recover quickly and efficiently.
While organisations architect their IT strategies to ensure they are protected, key criteria for all companies remain the data centre itself, and how resilient its infrastructure is. All the planning in the world would have no impact if the physical infrastructure at the data centre is unable to deliver the required availability.
Stephane Duproz, CEO of Africa Data Centres, explains that data centres are typically graded according to certifications issued by the US-based Uptime Institute, with level three and level four being the two primary levels that organisations focus on. Data centres certified at these levels need to guarantee 99.98% and 99.995% availability respectively - equating to 1.6 hours and 20 minutes of downtime a year.
“The primary difference between the two certification levels is that a level three data centre operates on an n+1 basis – where there is one standby unit for all critical systems -whereas level four data centres require all systems to have an equivalent backup system – providing full redundancy.
“The expectation from the clients, however, is that their systems are available 24/7, but the cost implications of building the infrastructure to provide full redundancy are significant and as such, the decision to make that investment is driven by the risk appetite of the companies concerned. We typically see that it’s only those organisations that would see a significant financial impact from any downtime – such as banks – who choose to make that commitment.”
As official levels three and four installations are dependent on achieving certification, they have become shorthand for data centres that aim to meet the specific criteria, even if they have not been officially certified. On the other hand, even data centres that are certified are continually striving to improve their service levels.
Risk management strategy
“Thus data centre operators that have level three and four facilities, albeit not officially certified, aim to ensure the service level across both is identical; ie, zero downtime. For companies the decision then becomes not one about availability but rather about risk mitigation,” says Duproz.
With the change in the way people work, companies are having to re-examine their priorities, focusing on how to support a workforce that is no longer centrally located. This has resulted in IT teams having to re-evaluate their risk profiles, assessing what elements of their IT infrastructure are mission-critical and what the impact would be of any downtime.
“We’ve seen our clients starting to look at the benefits that a level four data centre would provide, but they need it to be affordable and this is where the efforts to deliver greater uptime pay off,” he says. “Delivering this is, however, not simple. The ability to ensure that you get the required availability in a data centre environment depends heavily on the skills of the team. Downtime is much more likely to be the result of human error than of equipment failure.”
Looking to the future
While many companies are looking to the cloud as insurance in an uncertain time, Duproz points out that there will always be applications that simply are not suited to the cloud environment. Where the need for a tightly controlled environment, be it for operational, regulatory or performance issues, dictate that it operates from its own dedicated infrastructure. In these situations, the requirement for guaranteed availability will drive companies towards those facilities that ensure operational risk is minimised, be it an enhanced level three data centre of a full level four installation.