GEARSHIFT: Guaranteeing availability requirements in SLAs using hybrid fault tolerance
Chapter, Conference object, Peer reviewed
MetadataShow full item record
Original versionFahmy, Sonia; Li, Baochun; John, Lui [Eds.] 2015 IEEE Conference on Computer Communications (INFOCOM) p. 1373-1381, IEEE Press, 2015 10.1109/INFOCOM.2015.7218514
The dependability of ICT systems is vital for today's society. However, operational systems are not fault free. Providers and customers have to define clear availability requirements and penalties on the delivered services by using SLAs. Fulfilling the stipulated availability may be expensive. The lack of mechanisms that allow a fine control of the SLA risk may lead to over-dimension the provided resources. Therefore, a relevant question for ICT service providers is: How to guarantee the SLA availability in a cost efficient way? This paper studies how to combine different fault tolerant techniques with different costs and properties, in order to economically fulfill a given SLA requirement. GEARSHIFT is a mechanism that enables ICT providers to set the fault tolerance technique (gear ratio) needed, depending on the current service conditions and requirements. We illustrate how to use the proposed model in a backbone network scenario, using measurements from a production national network. Finally, we show that the total costs of delivering an ICT service follow a simple convex function, which allows an easy selection of the optimal risk by tuning properly the combination of fault tolerant techniques.