Visible to the public Towards Tightly-coupled Datacenter with Free-space Optical Links

TitleTowards Tightly-coupled Datacenter with Free-space Optical Links
Publication TypeConference Paper
Year of Publication2017
AuthorsHu, Yao, Hara, Hiroaki, Fujiwara, Ikki, Matsutani, Hiroki, Amano, Hideharu, Koibuchi, Michihiro
Conference NameProceedings of the 2017 International Conference on Cloud and Big Data Computing
Date PublishedSeptember 2017
PublisherACM
Conference LocationNew York, NY, USA
ISBN Number978-1-4503-5343-4
Keywordsclean slate, Collaboration, datacenter, free-space optics, Human Behavior, human factor, human factors, interconnection network, job scheduling, Metrics, policy governance, Policy-Governed Secure Collaboration, pubcrawl, Rackscale architecture, resilience, Resiliency
Abstract

Clean slate design of computing system is an emerging topic for continuing growth of warehouse-scale computers. A famous custom design is rackscale (RS) computing by considering a single rack as a computer that consists of a number of processors, storages and accelerators customized to a target application. In RS, each user is expected to occupy a single or more than one rack. However, new users frequently appear and the users often change their application scales and parameters that would require different numbers of processors, storages and accelerators in a rack. The reconfiguration of interconnection networks on their components is potentially needed to support the above demand in RS. In this context, we propose the inter-rackscale (IRS) architecture that disaggregates various hardware resources into different racks according to their own areas. The heart of IRS is to use free-space optics (FSO) for tightly-coupled connections between processors, storages and GPUs distributed in different racks, by swapping endpoints of FSO links to change network topologies. Through a large IRS system simulation, we show that by utilizing FSO links for interconnection between racks, the FSO-equipped IRS architecture can provide comparable communication latency between heterogeneous resources to that of the counterpart RS architecture. A utilization of 3 FSO terminals per rack can improve at least 87.34% of inter-CPU/SSD(GPU) communication over Fat-tree and improve at least 92.18% of that over 2-D Torus. We verify the advantages of IRS over RS in job scheduling performance.

URLhttp://doi.acm.org/10.1145/3141128.3141130
DOI10.1145/3141128.3141130
Citation Keyhu_towards_2017