Training Cluster
Training Cluster enables the creation of a dedicated environment with a predefined allocation of RAM, CPU, and GPU resources. The pricing for the Training Cluster is fixed, unaffected by the actual usage percentage of the allocated resources. Additionally, creating deployments within the Training Cluster incurs no extra charges.
Create Training Cluster
- To create a new Training Cluster, click the CREATE CLUSTER button under Training Cluster.
-
Select the desired Cluster Name, Cluster Configuration by choosing the appropriate machine type with the required CPU, GPU, and an available plan.
-
On this page, you can view the details of the selected plan. Depending on whether you choose an Hourly-Based Plan or a Committed Plan, the summary section will display the corresponding details and associated costs.
Manage Training Cluster
Overview
You can view the details of the selected Training Cluster, including the Cluster Name, Number of Nodes, Plan Name, and the Cluster Node Configuration, which displays the count of GPUs, CPUs, and RAM allocated within the cluster.
Monitoring
You can view the Disk Usage and Memory Usage for the selected Node within the Training Cluster. Additionally, the following metrics are also available: GPU Utilization, GPU Temperature, CPU Utilization, Memory Utilization, Disk Total Read Bytes, and Disk Total Write Bytes.