In this video from the 2016 Stanford HPC Conference, Christian Kniep from Gaikai presents: Multi-Host Containerized Clusters – Using Docker Networking To Spin-Up SLURM.
“With Docker v1.9 a new networking system was introduced, which allows multi-host networking to work out-of-the-box in any Docker environment. This talk provides an introduction on what Docker networking provides, followed by a demo that spins up a full SLURM cluster across multiple machines. The demo is based on QNIBTerminal, a Consul backed set of Docker Images to spin up a broad set of software stacks.”
Christian Kniep has gained his System Operations experience working in the R&D departments of German automotive manufactures. He was operating a 3000-node InfiniBand interconnected cluster. This position led to his B.Sc. thesis, in which he proposed an InfiniBand Monitoring Suite named QNIB (Qualified Networking with InfiniBand) which aimed to provide a better understanding of Interconnect fabrics. This single-layer approach grew during the last couple of years to form QNIBTerminal, a holistic workbench to test-drive different HPC stacks using Docker containerization. Since 2015 Christian works at Gaikai Inc., a Sony company, as a Release Engineer looking into new technologies like Docker and other infrastructure related aspects.