AuthorsE. Tasoulas, E. G. Gran, T. Skeie and B. D. Johnsen
EditorsA. Pellegrini, A. Gkoulalas-Divanis, P. Di Sanzo and D. R. Avresky
TitleFast Hybrid Network Reconfiguration for Large-Scale Lossless Interconnection Networks
AfilliationCommunication Systems
Project(s)ERAC: Efficient and Robust Architecture for the Big Data Cloud
Publication TypeProceedings, refereed
Year of Publication2016
Conference Name15th IEEE International Symposium on Network Computing and Applications (NCA 2016)
KeywordsCloud computing, Fat-Tree, HPC, InfiniBand, Network Reconfiguration, scalability

Reconfiguration of high performance lossless interconnection networks is a cumbersome and time-consuming task. For that reason reconfiguration in large networks are typically limited to situations where it is absolutely necessary, for instance when severe faults occur. On the contrary, due to the shared and dynamic nature of modern cloud infrastructures, performance-driven reconfigurations are necessary to ensure efficient utilization of resources. In this work we present a scheme that allows for fast reconfigurations by limiting the task to sub-parts of the network that can benefit from a local reconfiguration. Moreover, our method is able to use different routing algorithms for different sub-parts within the same subnet. We also present a Fat-Tree routing algorithm that reconfigures a network given a user-provided node ordering. Hardware experiments and large scale simulation results show that we are able to significantly reduce reconfiguration times from 50% to as much as 98.7% for very large topologies, while improving performance.

Citation Key24836