Skip to main content

Overview

Poolside officially tests and supports a defined set of deployment configurations for enterprise environments. Use this page to compare supported deployment paths, review the minimum platform, compute, database, and inference requirements, and find the configuration that best matches your infrastructure.
Standard technical support and SLA commitments cover the deployment paths documented on this page. For custom hardening, alternative software versions, or other variations, your Poolside account team can provide best-effort support. Contact your account team to discuss your environment requirements.

Platform requirements

For architecture, installation, and operational guidance for supported cloud deployment paths, see Cloud deployment. For on-premises configurations, see On-premises deployment.
If you already run the Terraform-based AWS deployment bundle, see Amazon EKS with Terraform (legacy) and the migration guide.
DeploymentDescription
Amazon EKSAmazon Elastic Kubernetes Service (EKS) environments
OpenShift 4.16+Red Hat OpenShift environments
Kubernetes 1.29+Self-managed Kubernetes environments such as RKE2 or Charmed Kubernetes

Compute requirements

The following minimum requirements apply to the Poolside Platform plane. These figures exclude model inference, or remote execution sandboxes, which you size separately based on your workload.
ResourceMinimum
CPU24 cores
Memory64 GB
Storage200 GB

Database requirements

You provision and manage the PostgreSQL database. Poolside does not provision it on your behalf. The following sizing is a recommended starting point:
ResourceRecommended
CPU4 cores
Memory16 GB
Storage100 GB
If your environment supports storage autoscaling, enable it. Without autoscaling, provision at least 100 GB to account for trajectory data growth.

Inference requirements

Poolside models have different minimum requirements. Use this table to size your inference nodes.
ModelQuantizationMinimum GPU memoryMinimum CPUMinimum host memory
Laguna M.1FP8384 GB128 cores1 TB
Laguna XS.2FP896 GB44 cores512 GB
Malibu 2.2FP8192 GB128 cores1 TB
Malibu 2.2INT496 GB44 cores512 GB
PointFP896 GB44 cores512 GB
For on-premises deployments, H200-based nodes require 10 TB NVMe scratch storage and RTX6000-based nodes require 4 TB NVMe. For cloud deployments, storage requirements depend on whether S3-compatible storage is colocated on the inference node.

Certified GPUs

All models are continuously certified in the following NVIDIA GPU types: RTX6000, H100, H200, GB200, GB300.

Standard on-premises configurations

For the certified single-node on-premises software stack and version inventory, see Certified stacks.
DeploymentRecommended scaleGPU configurationHost OS
BYO hardwareLarge enterprise teams8× NVIDIA H200 (HGX) (4× H200 with validation)Ubuntu 22.04 LTS, Ubuntu 24.04 LTS, or RHEL 9.6
Turnkey HGX rackLarge enterprise teams8× NVIDIA H200 (HGX)Ubuntu 22.04 LTS, Ubuntu 24.04 LTS, or RHEL 9.6
GPU workstation towerSmall teams and individual groups4× NVIDIA RTX 6000Ubuntu 22.04 LTS, Ubuntu 24.04 LTS, or RHEL 9.6
GPU workstation rackMid-sized teams8× NVIDIA RTX 6000Ubuntu 22.04 LTS, Ubuntu 24.04 LTS, or RHEL 9.6
For on-premises hardware deployments:
  • Inference and platform services run on the same node. The node must meet both the inference requirements above and the platform compute requirements.
  • Multi-node inference is supported across a single Kubernetes cluster, but it does not provide high availability. It distributes independent inference replicas across nodes but does not support cross-node tensor parallelism.
If you deploy on Red Hat Enterprise Linux (RHEL), pin the host release to RHEL 9.6 before you run package updates. RHEL can upgrade the host to a newer minor release when new updates become available. Prevent that upgrade because NVIDIA GPU Operator driver containers are pinned to specific RHEL releases. For the required commands, see Install Poolside on-premises.

Support and compatibility

For questions about integration with specific enterprise tooling or deployment workflows, contact a Poolside representative.