The OpenFabrics Alliance (OFA) is committed to accelerating the development of high performance fabrics. The annual OFA Workshop, located this year
in Boulder, CO, is a premier means of fostering collaboration among those who develop fabrics, deploy fabrics, and create applications that rely on fabrics. It is the only event of its kind where fabric developers and users can discuss emerging fabric technologies, collaborate on future industry requirements, and address problems that exist today.
[SPONSORED GUEST CONTENT] Scheduled Ethernet is emerging as a viable alternative to InfiniBand for AI networking. Why? Because of its ability to offer comparable performance but with greater flexibility and cost-effectiveness.
When building large-scale AI GPU clusters for training or inference, the backend network should be high-performance, lossless, and predictable to ensure maximum GPU utilization. This is hard to achieve when using Ethernet for the back-end network. This guide showcases a high-level reference design for an 8,192 GPU cluster, describing how it can be achieved with […]