SPOTINST ELASTIGROUP & HIGH PERFORMANCE COMPUTING
Boost your HPC performance while reducing costs by 80%
High-Performance Computing (HPC) allows scientists and engineers to solve complex, compute-intensive problems. HPC applications often require high network performance, fast storage, large amounts of memory, very high compute capabilities or all of these.
Spotinst enables customers to increase the speed of research and reduce time-to-results by running or bursting HPC to the cloud. Spotinst helps to reduce infrastructure costs by up to 80% less using CPU, GPU, and FPGA servers from cloud excess capacity (aka “Spot Instances”), and reducing the overhead of managing the infrastructure, saving scientists and engineers time to focus on what’s next.
Spotinst Elastigroup deploys a controller (agent) that monitors HPC schedulers’ queues. You can configure Elastigroup to monitor specific Job’s Queue, and Elastigroup finds jobs that are pending for execution, it calculates the number of resources that these jobs need and launches the required compute to run these jobs.
As you submit Jobs and Tasks Elastigroup is learning what resources the tasks need and how long they will take. Elastigroup uses this information to densely pack tasks, without oversubscribing, to ensure fast and cost-effective runs. By combining task awareness (time & resources) with Spot Instances prediction of price & availability, Elastigroup schedules tasks where they won’t be interrupted.
Simplify management & orchestration using Elastigroup – both provisioning & autoscaling of Cloud infrastructure
Elasitgroup uses its proven prediction algorithm to match Spot Instances that can run for the desired amount of time for the required Task or job.
By configuring heterogeneous clusters and mixing nodes of multiple instance types, Elastigroup delivers maximum efficiency to run the relevant job on the right machine type to achieve greater data processing performance.
Michael Waltz | Principal DevOps Engineer, Demandbase
Elastigroup automates the Instance lifecycle and provides a combination of EC2 Spot, Existing RI reservations and On-Demand based on EC2 Spot capacity, availability & pricing trends
By configuring heterogeneous clusters and mixing nodes of multiple instance types, Elastigroup delivers maximum efficiency to run the relevant job on the right machine type to achieve greater data processing performance.
Elastigroup balances the required performance, cost, and SLA requirements when launching, scaling and terminating EC2 Spot Instances. By combining Elastigroup’s proven prediction algorithm for EC2 Spot availability.
Elastigroup drains nodes, approximately 15 minutes before a Spot termination notification arrives from the cloud provider, in order to gracefully terminate existing Tasks and prevent from scheduling new jobs on these nodes.