Choose a GPU Card
Use this guide when the create-node flow asks you to pick a GPU card and plan. The portal is always the final source for currently available GPUs, plans, locations, and prices — use this page to understand what each card is for before you select one.
A GPU card is not the same thing as a plan. The card describes the NVIDIA accelerator (architecture, memory, fabric), while the plan describes the exact vCPU, RAM, GPU memory, CUDA version, disk space, IOPS, and number of GPU cards bundled into the node. The portal shows plan details for the selected card in a format similar to:
<vCPU> – <RAM> – <GPU Memory> – <CUDA (Version)> – <Disk Space> – <IOPS (R/W)>
GPU availability is region-gated. The GPU tab is hidden for some regions and accounts. If you do not see the card you want, switch regions in the location selector at the top of MyAccount, or contact cloud-platform@e2enetworks.com.
Quick Comparison
| Card | Memory | Architecture | Best fit | Avoid when |
|---|---|---|---|---|
| NVIDIA H100 | 80 GB HBM3 | Hopper | LLM training and inference, FP8, Transformer Engine. | You do not need FP8 or Hopper Transformer Engine. |
| NVIDIA A100 80GB | 80 GB HBM2e | Ampere | Multi-tenant LLM inference, training, HPC. | You can fit your model in 40 GB and want a cheaper plan. |
| NVIDIA A100 40GB | 40 GB HBM2 | Ampere | Mainstream LLM training and inference. | The model state exceeds 40 GB without sharding. |
| NVIDIA L40S | 48 GB GDDR6 | Ada Lovelace | Cost-efficient inference, generative AI, video, fine-tuning. | Largest-model training across many cards. |
| NVIDIA A40 | 48 GB GDDR6 | Ampere (RTX) | Visualization, rendering, mid-tier inference and training. | Pure datacenter Tensor Core workloads where L40S/A100 fit the budget. |
| NVIDIA A30 | 24 GB HBM2 | Ampere | Mid-range AI inference, classical ML. | Large models that do not fit in 24 GB. |
| NVIDIA L4 | 24 GB GDDR6 | Ada Lovelace | Low-power inference, video transcoding, generative AI at scale. | Training workloads or anything memory-bound above 24 GB. |
| NVIDIA V100 | 16 / 32 GB HBM2 | Volta | Legacy compatibility with Volta-tuned pipelines. | You are starting a new project. Prefer Ampere, Ada, or Hopper. |
| NVIDIA T4 | 16 GB GDDR6 | Turing | Cost-sensitive inference and lab work. | You need Tensor Float (TF32), FP8, or modern Tensor Core throughput. |
Do not rely on documentation alone. Always check the live MyAccount portal for the exact set of GPU cards available in your region.
How to Choose
Start with the workload bottleneck:
| If your main concern is | Start with |
|---|---|
| Mixed-precision training and FP8 inference | H100 |
| Large-memory LLM inference | A100 80GB |
| Mid-tier training and inference | A100 40GB or L40S |
| Cost-efficient generative AI inference and fine-tuning | L40S |
| Visualization, rendering, OpenGL/Vulkan | A40 |
| Classical ML, smaller AI inference | A30 |
| Low-power inference and video transcoding at scale | L4 |
| Legacy CUDA pipelines that target Volta or Turing | V100 or T4 |
Then check that the GPU memory column is large enough for your model state, batch size, and KV cache. Most "out of memory" issues at run time start with a card chosen on price rather than memory.
For LLM inference, target a card whose memory comfortably fits the model weights plus the KV cache for your maximum context length and concurrency. Sharding across multiple cards costs latency.
GPU Card Profiles
NVIDIA H100
Hopper, 80 GB HBM3, FP8 Transformer Engine, fourth-generation NVLink. The current mainstream training and inference card.
- Use it for: training 7B–70B LLMs, FP8 inference, fine-tuning, scientific computing.
- Reference: NVIDIA H100.
NVIDIA A100 80GB and A100 40GB
Ampere, third-generation Tensor Cores, TF32 / BF16 / FP16 / INT8 / FP64 precisions. The 80GB variant uses HBM2e with up to ~2.0 TB/s of memory bandwidth; the 40GB variant uses HBM2 at ~1.55 TB/s.
- Use them for: LLM training and inference, HPC, recommender systems.
- Reference: NVIDIA A100 80GB.
- Reference: NVIDIA A100 40GB.
NVIDIA L40S
Ada Lovelace, 48 GB GDDR6, fourth-generation Tensor Cores, third-generation RT Cores. A strong cost-per-inference card with generative-AI features.
- Use it for: stable diffusion, fine-tuning, mid-tier LLM inference, video AI, virtual workstations.
- Reference: NVIDIA L40S.
NVIDIA A40
Ampere RTX, 48 GB GDDR6. Designed for professional visualization and mixed AI workloads.
- Use it for: rendering, 3D, VDI with NVIDIA RTX vWS, mid-tier inference.
- Reference: NVIDIA A40.
NVIDIA A30
Ampere, 24 GB HBM2. Lower-cost datacenter card.
- Use it for: classical ML, recommendation, smaller-model inference.
- Reference: NVIDIA A30.
NVIDIA L4
Ada Lovelace, 24 GB GDDR6, low-profile, low-power (72 W TDP class). Designed for inference and video AI at scale.
- Use it for: high-volume small-model inference, video transcoding, generative AI on a budget.
- Reference: NVIDIA L4.
NVIDIA V100
Volta, 16 GB or 32 GB HBM2. Legacy datacenter card. Choose it only for compatibility with Volta-tuned pipelines.
- Use it for: legacy compatibility with Volta-tuned pipelines.
- Reference: NVIDIA V100.
NVIDIA T4
Turing, 16 GB GDDR6, 70 W. Legacy inference card. Use it for cost-sensitive inference or lab work, but prefer L4 or L40S for any new project.
- Use it for: cost-sensitive inference and lab work.
- Reference: NVIDIA T4.
Pricing and Billing
GPU plans are billed per minute. The portal Summary panel shows the live price for the selected plan, region, and billing mode at launch. Committed plans are explicitly confirmed; on-demand plans show the hourly rate and a monthly estimate.
Do not rely on static documentation for prices or savings percentages. Use the live portal Summary, API response, or approved pricing page for the current amount. Pricing is also gated by region.
For a side-by-side billing view, see the E2E GPU pricing page and the committed billing notes.
Related Resources
| Category | Resource | Use it for |
|---|---|---|
| Launch | Create a GPU node | Full step-by-step GPU launch flow. |
| Connect | Connect to a Linux GPU node | SSH access and nvidia-smi verification. |
| Managed AI | TIR AI/ML Platform | Launch notebooks, endpoints, and training jobs without managing a GPU node. |