apigee-prometheus-proxy
apigee-prometheus-agg
apigee-stackdriver-exporter
Resource types | k8s_container |
Metric | kubernetes.io/container/restart_count |
Filter By | namespace_name = apigee and container_name =~ .*cassandra.* |
Group By | cluster_name, namespace_name, pod_name, container_name, and all k8s_container resource type labels |
Aggregator | sum |
Alert consideration | If a container is restarting frequently, further investigation is needed for the root cause. There are multiple reasons a container can restart, such as OOMKilled , data disk full, and configuration issues, to name a few. |
Alert threshold | Depends on the SLO for the installation. For example: For production, trigger an event notification, If a container restarts more often than 5 times within 30 minutes. |
Cloud Monitoring dashboard MQL query: fetch k8s_container | metric 'kubernetes.io/container/restart_count' | filter (resource.container_name =~ '.*cassandra.*' && resource.namespace_name == 'apigee') | align rate(1m) | every 1m | group_by [resource.cluster_name, resource.namespace_name, resource.pod_name, resource.container_name], [value_restart_count_aggregate: aggregate(value.restart_count)] |
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-10-16 UTC.