dc.contributor.author | Zahid, Feroz | |
dc.contributor.author | Taherkordi, Amirhosein | |
dc.contributor.author | Gran, Ernst Gunnar | |
dc.contributor.author | Skeie, Tor | |
dc.contributor.author | Johnsen, Bjørn Dag | |
dc.date.accessioned | 2019-04-23T08:28:43Z | |
dc.date.available | 2019-04-23T08:28:43Z | |
dc.date.created | 2018-12-11T13:28:04Z | |
dc.date.issued | 2018 | |
dc.identifier.citation | IEEE Transactions on Parallel and Distributed Systems. 2018, 29 (12), 2658-2671. | nb_NO |
dc.identifier.issn | 1045-9219 | |
dc.identifier.uri | http://hdl.handle.net/11250/2594982 | |
dc.description.abstract | Clouds offer flexible and economically attractive compute and storage solutions for enterprises. However, the effectiveness of cloud computing for high-performance computing (HPC) systems still remains questionable. When clouds are deployed on lossless interconnection networks, like InfiniBand (IB), challenges related to load-balancing, low-overhead virtualization, and performance isolation hinder full potential utilization of the underlying interconnect. Moreover, cloud data centers incorporate a highly dynamic environment rendering static network reconfigurations, typically used in IB systems, infeasible. In this paper, we present a framework for a self-adaptive network architecture for HPC clouds based on lossless interconnection networks, demonstrated by means of our implemented IB prototype. Our solution, based on a feedback control and optimization loop, enables the lossless HPC network to dynamically adapt to the varying traffic patterns, current resource availability, workload distributions, and also in accordance with the service provider-defined policies. Furthermore, we present IBAdapt, a simplified ruled-based language for the service providers to specify adaptation strategies used by the framework. Our developed self-adaptive IB network prototype is demonstrated using state-of-the-art industry software. The results obtained on a test cluster demonstrate the feasibility and effectiveness of the framework when it comes to improving Quality-of-Service compliance in HPC clouds. | nb_NO |
dc.language.iso | eng | nb_NO |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | nb_NO |
dc.title | A Self-Adaptive Network for HPC Clouds: Architecture, Framework, and Implementation | nb_NO |
dc.type | Journal article | nb_NO |
dc.type | Peer reviewed | nb_NO |
dc.description.version | acceptedVersion | nb_NO |
dc.source.pagenumber | 2658-2671 | nb_NO |
dc.source.volume | 29 | nb_NO |
dc.source.journal | IEEE Transactions on Parallel and Distributed Systems | nb_NO |
dc.source.issue | 12 | nb_NO |
dc.identifier.doi | 10.1109/TPDS.2018.2842224 | |
dc.identifier.cristin | 1641678 | |
dc.description.localcode | © 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. | nb_NO |
cristin.unitcode | 194,63,30,0 | |
cristin.unitname | Institutt for informasjonssikkerhet og kommunikasjonsteknologi | |
cristin.ispublished | true | |
cristin.fulltext | postprint | |
cristin.qualitycode | 2 | |