Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Exploring the benefits of using co-packaged optics in data center and AI supercomputer networks: a simulation-based analysis [Invited]

Not Accessible

Your library or personal account may give you access

Abstract

We investigate the advantages of using co-packaged optics in next-generation data center and AI supercomputer networks. The increased escape bandwidth offered by co-packaged optics provides multiple possibilities for building 50T switches and beyond, expanding the opportunities in both the data center and supercomputing domains. This provides network architects with the opportunity to expand their design space and develop simplified networks with enhanced network locality properties. Co-packaging at the switch and server points enables networks with double capacity while reducing the switch count by 64% compared to state-of-the-art systems. We evaluate these concepts through discrete-event simulations using all-to-all and all-reduce traffic patterns that simulate collective communications commonly found in network-bound applications. Initially, we investigate the all-to-all overhead involved in distributing the virtual machines of the applications across multiple leaf switches and compare it to the scenario in which all VMs are placed under a single switch. Subsequently, we evaluate the performance of an AI supercomputing cluster by simulating both patterns for different message sizes, while also varying the number of participating nodes. The results suggest that networks with improved locality properties become increasingly important as the network stack operates at higher speeds; for a stack latency of 1.25 µs, placing the applications under multiple switches can result in up to 68% higher completion times than placing them under a single switch. For AI supercomputers, significant improvements are observed in the mean server throughput, reaching more than 90% for configurations involving 256 nodes and message sizes of at least 128 KiB.

© 2024 Optica Publishing Group

Full Article  |  PDF Article
More Like This
Toward higher-radix switches with co-packaged optics for improved network locality in data center and HPC networks [Invited]

Pavlos Maniotis, Laurent Schares, Daniel M. Kuchta, and Bengi Karacali
J. Opt. Commun. Netw. 14(6) C1-C10 (2022)

Optics enabled networks and architectures for data center cost and power efficiency [Invited]

Marc Taubenblatt, Pavlos Maniotis, and Asser Tantawi
J. Opt. Commun. Netw. 14(1) A41-A49 (2022)

Toward lower-diameter large-scale HPC and data center networks with co-packaged optics

Pavlos Maniotis, Laurent Schares, Benjamin G. Lee, Marc A. Taubenblatt, and Daniel M. Kuchta
J. Opt. Commun. Netw. 13(1) A67-A77 (2021)

Cited By

You do not have subscription access to this journal. Cited by links are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Figures (9)

You do not have subscription access to this journal. Figure files are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Tables (2)

You do not have subscription access to this journal. Article tables are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.