Alert Management
Alert Management enables proactive monitoring of cloud infrastructure by triggering notifications when resource metrics meet defined conditions. Alerts can be configured for Instance, Inference, and Training Cluster services, with support for Email, Slack, and Webhook notification channels.
Quick Start
Set Up a Monitoring Integration
Configure email, Slack, or Webhook destinations so your team gets notified when an alert fires.
Create an Alert
Define a metric threshold rule for Instance, Inference, or Training Cluster services.
Attach Alert to a Resource
Bind a configured alert to a running resource and start receiving live notifications.
Configure Notification Channels
Deliver alerts to Slack or any HTTP endpoint using built-in webhook integration.
Explore Alert Management
How It Works
Concepts, prerequisites & metrics
Monitoring Integrations
Email, Slack & Webhook destinations
Alert Configuration
Create, manage & attach alerts
Notification Channels
Slack & Webhook setup guides
Troubleshooting
Common issues & resolutions