Faults and Alerts

Viewing faults and alerts in the DuploCloud Portal

Faults that happen in the system, be it Infrastructure creation, container deployments, Application health checks, or any Triggered Alarms can be tracked in the DuploCloud portal under Faults Menu.

Viewing Faults

You can look at Tenant-specific faults under Observability -> Faults or all the faults in the system under Administrator -> Faults.

The Faults page showing faults for the DEV01 Tenant

Configuring Tenant Fault Settings

To configure faults for a Tenant, navigate to Administrator -> Tenants and select the Tenant from the NAME column. In the Settings tab, click Add. Select or enter the appropriate feature on the Add Tenant Feature pane.

DuploCloud provides tenant-level settings that control fault behavior across different workloads and services. Some settings specifically affect Kubernetes pods and jobs, while others apply to the Tenant more generally.

For instructions on adding or updating these settings, see the Tenant Settings page for your cloud provider:

Tenant Fault Settings

Tenant Setting
Description

raise_fault_on_last_state_pod_failure_reasons

Generates a fault when a Kubernetes pod is terminated with a specified failure reason. Enter the failure reasons you want to monitor, e.g., OOMKilled. Multiple reasons can be comma-separated.

enable_k8s_job_fault_logging

Generates faults for Kubernetes Job failures by default when set to True.

tenant_instances_stopped

Mutes faults for Tenants that have been stopped when set to True.

Creating Alerts

Alerts in DuploCloud are created for individual resources and their metrics. To create an alert:

  1. Navigate to the resource type you want to monitor:

    • For Kubernetes services: Kubernetes Services

    • For RDS databases: Cloud Services Databases RDS

    • For other resources, follow a similar pattern

  2. Select the resource from the NAME column to open its details page.

  3. Click the Alerts tab.

  4. Click Add. The Create Alert pane displays.

    Create Alert pane
  5. Enter the desired threshold, conditions, and notification options.

  6. Click Create to save the alert.

Viewing Alerts

General Alerts

General alerts show all alerts across your account and all resources, providing a high-level view of system-wide issues.

  1. Navigate to Observability -> Alerts.

General Alerts page under Observability in the DuploCloud Portal

Resource-Specific Alerts

Resource-specific alerts display only the alerts associated with a particular resource, such as a Service, Host, or Database, allowing you to focus on issues affecting that specific component.

  1. Navigate to the resource’s details page (see the steps in Creating Alerts above).

  2. Select the Alerts tab.

Alerts tab under Cloud Services -> Hosts in the DuploCloud Portal

Last updated

Was this helpful?