Authors | B. Bogdanski, B. D. Johnsen, S. Reinemo and F. O. Sem-Jacobsen |
Editors | H. Shen, Y. Sang, Y. Li, D. Qian and A. Y. Zomaya |
Title | Discovery and Routing of Degraded Fat-Trees |
Afilliation | Communication Systems, Communication Systems |
Status | Published |
Publication Type | Proceedings, refereed |
Year of Publication | 2012 |
Conference Name | 2012 13th International Conference on Parallel and Distributed Computing, Applications and Technologies |
Pagination | 689-694 |
Date Published | December |
Publisher | IEEE Computer Society |
Place Published | Los Alamitos |
Keywords | Conference |
Abstract | The fat-tree topology has become a popular choice for InfiniBand enterprise systems due to its deadlock freedom, fault-tolerance and full bisection bandwidth. In the HPC domain, InfiniBand fabric is used in almost 42% of the systems on the latest Top 500 list, and many of those systems are based on the fat-tree topology. Despite the popularity of the fat-tree topology, little research has been done to compare the behavior of InfiniBand routing algorithms on degraded fat-tree topologies. In this paper, we identify the weaknesses of the current fat-tree routing and propose enhancements that liberalize the restrictions imposed on the routed fabric. Furthermore, we present a thorough analysis of non-proprietary routing algorithms that are implemented in the InfiniBand Open Subnet Manager. Our results show that even though the performance of a fat-tree routed network deteriorates predictably with the number of failed links, fat-tree routing algorithm is still the best choice for severely degraded fat-tree fabrics. |
Citation Key | Simula.simula.1554 |