Skip to main content

Training Cluster

TIR Training Cluster is a dedicated GPU compute environment for distributed AI model training. Powered by Slurm-native scheduling, it gives you fixed-price node allocations, elastic scaling, high availability, and the flexibility to run pre-built framework images or any custom container via Enroot — with full job visibility from the TIR dashboard and no per-job charges.

Slurm-Native SchedulingUbuntu Slurm ImagesNeMo (Megatron-Bridge) ImagesFixed PricingElastic Scaling

Quick Start

Explore Training Cluster

API Reference

REST API

</>Training Cluster API Reference

Programmatically create, manage, and monitor TIR Training Clusters. Automate cluster provisioning, scale nodes, and control the cluster lifecycle via REST.

Explore REST APIs
Authentication & Endpoints
Request and Response Schemas
Open API Reference →
tir.e2enetworks.com / api / v1
GET/projects/{id}/distributed_jobs_v2/cluster/plans/List available cluster plans
GET/projects/{id}/distributed_jobs_v2/cluster/List training clusters
POST/projects/{id}/distributed_jobs_v2/cluster/Create a training cluster
GET/projects/{id}/distributed_jobs_v2/cluster/{id}/Get cluster details
PUT/projects/{id}/distributed_jobs_v2/cluster/{id}/Perform a cluster action
DELETE/projects/{id}/distributed_jobs_v2/cluster/{id}/Delete a training cluster

Billing & Plans

Billing & Credits

Training Clusters are billed at a fixed rate based on your cluster plan. Pricing does not vary with resource utilization, and all jobs running on the cluster incur no additional charges.

View Billing Docs →

Fixed-rate billing

Billed per hour based on your cluster plan — cost does not change with GPU utilization.

No per-job charges

All Slurm jobs running on the cluster are included at no extra cost.

On-Demand or Committed

Start with On-Demand hourly pricing and convert to a Committed plan when ready.