n9e categraf k8s监控配置 -cadvisor

1、创建用户角色权限

bash 复制代码
vi  z-categraf-monitor-rbac.yaml
bash 复制代码
apiVersion: v1
kind: ServiceAccount
metadata:
  name: z-categraf-monitor
  namespace: z-monitor
  labels:
    app: categraf
    purpose: cadvisor-monitor
---
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRole
metadata:
  name: z-categraf-monitor-clusterrole
  labels:
    app: categraf
    purpose: cadvisor-monitor
rules:
  - nonResourceURLs:
      - "/metrics/cadvisor"
      - "/stats/summary"
      - "/pods"
      - "/nodes/proxy/metrics/cadvisor"
    verbs: ["get"]
  - apiGroups: [""]
    resources:
      - "nodes"
      - "nodes/metrics"
      - "nodes/stats"
      - "nodes/proxy"
    verbs: ["get", "list", "watch"]
---
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRoleBinding
metadata:
  name: z-categraf-monitor-clusterrolebinding
  labels:
    app: categraf
    purpose: cadvisor-monitor
roleRef:
  apiGroup: rbac.authorization.k8s.io
  kind: ClusterRole
  name: z-categraf-monitor-clusterrole
subjects:
  - kind: ServiceAccount
    name: z-categraf-monitor
    namespace: z-monitor
bash 复制代码
kubectl apply -f z-categraf-monitor-rbac.yaml

2、获取token

bash 复制代码
# 提取Token
SECRET_NAME=$(kubectl get sa z-categraf-monitor -n z-monitor -o jsonpath='{.secrets[0].name}')
VALID_TOKEN=$(kubectl get secret $SECRET_NAME -n z-monitor -o jsonpath='{.data.token}' | base64 -d)

#  将Token写入文件
echo "$VALID_TOKEN" > /data/zz/z-categraf-monitor.token

3、每个节点配置 categraf/conf/input.cadvisor/cim-pord-cadvisor.toml

bash 复制代码
vi  cim-pord-cadvisor.toml
bash 复制代码
# # collect interval
interval = 15

[[instances]]
url = "https://127.0.0.1:10250"
type = "kubelet"

bearer_token_string = "eyxxxxx"
ignore_label_keys = ["id","name", "container_label*"]
insecure_skip_verify = true
use_tls = true
container_exclude = ["^POD$"]
# ignore_label_keys = ["id","name", "container_label*"]
## choose_label_keys = ["id"]

timeout = "15s"# # collect interval
 
相关推荐
米高梅狮子1 天前
03.网络类服务实践
linux·运维·服务器·网络·kubernetes·centos·openstack
ElevenS_it1881 天前
Zabbix+Prometheus+云监控告警统一接入实战:用Webhook+事件总线搭建多源告警归一化平台
kubernetes·zabbix·prometheus
万里侯1 天前
GitOps实战:用Git管理基础设施
微服务·容器·k8s
STDD1 天前
cert-manager:Kubernetes 自动 TLS 证书管理
云原生·容器·kubernetes
阿里云云原生1 天前
【5.29北京】智驭运维,Agentic Ops可观测工作坊限时报名!
云原生·agent
卧室小白1 天前
docker容器
运维·docker·容器
Benszen1 天前
Docker容器化解决方案
运维·docker·容器
LT10157974441 天前
2026年云原生RPA选型指南:云端协同与弹性部署适配
云原生·rpa
姚不倒1 天前
Go 语言基础入门:从零到实战,一篇文章掌握核心语法
云原生·golang
仙柒4152 天前
Namespace
运维·docker·容器