mpi4cloud

mpi4cloud is a service for running MPI-workloads on Azure, AWS, and GCP using low-cost unreliable VMs (SPOT/preemptible). mpi4cloud makes it simple to monitor the resource usage and cost of your jobs.

mpi4cloud does not provide a fully managed service, so everything happens on your cloud provider account.

The components that mpi4cloud provides are:

  • a CLI for interacting with clusters
  • a web application for tracking/predicting your spending, monitoring resource usage / logs, and controlling clusters
  • a Python library with utilities for managing unreliable workloads