告警指标汇总
kubernetes相关
pvc使用率
100 - (kubelet_volume_stats_available_bytes / kubelet_volume_stats_capacity_bytes) * 100
磁盘使用率
100 - (node_filesystem_avail_bytes / node_filesystem_size_bytes * 100)
主机和硬件监控
可用内存指标
主机中可用内存容量不足 10%
- alert: HostOutOfMemory
expr: node_memory_MemAvailable_bytes / node_memory_MemTotal_bytes * 100 < 10
for: 5m
labels:
severity: warning
annotations:
summary: Host out of memory (instance {{ $labels.instance }})
description: Node memory is filling up (< 10% left)\n VALUE = {{ $value }}\n LABELS: {{ $labels }}