n9e categraf k8s监控配置 -kube-state-metrics

1、k8s集群中创建命名空间/角色/绑定关系/容器-kube-state-metrics

下载镜像 kube-state-metrics-v2.5.0.tar

bash 复制代码
docker load -i kube-state-metrics-v2.5.0.tar
docker images |grep kube-state-metrics
kubectl create namespace z-monitor
vi   kube-state-metrics.yaml
bash 复制代码
#Create by:ember.zhang
#为什么不问问神奇的海螺呢丶
apiVersion: v1
kind: ServiceAccount
metadata:
  name: kube-state-metrics
  namespace: z-monitor  # 命名空间 
---
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRole
metadata:
  name: kube-state-metrics
rules:
  - apiGroups: [""]
    resources:
      - configmaps
      - secrets
      - nodes
      - pods
      - services
      - resourcequotas
      - replicationcontrollers
      - limitranges
      - persistentvolumeclaims
      - persistentvolumes
      - namespaces
      - endpoints
    verbs: ["list", "watch"]
  - apiGroups: ["apps"]
    resources:
      - statefulsets
      - daemonsets
      - deployments
      - replicasets
    verbs: ["list", "watch"]
  - apiGroups: ["batch"]
    resources:
      - cronjobs
      - jobs
    verbs: ["list", "watch"]
  - apiGroups: ["autoscaling"]
    resources:
      - horizontalpodautoscalers
    verbs: ["list", "watch"]
  - apiGroups: ["policy"]
    resources:
      - poddisruptionbudgets
    verbs: ["list", "watch"]
  - apiGroups: ["certificates.k8s.io"]
    resources:
      - certificatesigningrequests
    verbs: ["list", "watch"]
  - apiGroups: ["storage.k8s.io"]
    resources:
      - storageclasses
      - volumeattachments
    verbs: ["list", "watch"]
  - apiGroups: ["apiextensions.k8s.io"]
    resources:
      - customresourcedefinitions
    verbs: ["list", "watch"]
---
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRoleBinding
metadata:
  name: kube-state-metrics
roleRef:
  apiGroup: rbac.authorization.k8s.io
  kind: ClusterRole
  name: kube-state-metrics
subjects:
  - kind: ServiceAccount
    name: kube-state-metrics
    namespace: z-monitor  # 命名空间
---
apiVersion: apps/v1
kind: Deployment
metadata:
  name: kube-state-metrics
  namespace: z-monitor  # 命名空间
  labels:
    app: kube-state-metrics
spec:
  replicas: 1  # 副本数
  selector:
    matchLabels:
      app: kube-state-metrics
  template:
    metadata:
      labels:
        app: kube-state-metrics
    spec:
      serviceAccountName: kube-state-metrics
      containers:
        - name: kube-state-metrics
          image: registry.k8s.io/kube-state-metrics/kube-state-metrics:v2.5.0  # 镜像版本
          imagePullPolicy: IfNotPresent  # 强制使用本地镜像
          ports:
            - name: http-metrics
              containerPort: 8080
            - name: telemetry
              containerPort: 8081
          resources:
            # 资源限制
            limits:
              cpu: 200m
              memory: 256Mi
            requests:
              cpu: 100m
              memory: 128Mi
          readinessProbe:
            httpGet:
              path: /healthz
              port: 8080
            initialDelaySeconds: 5
            timeoutSeconds: 5
          livenessProbe:
            httpGet:
              path: /healthz
              port: 8080
            initialDelaySeconds: 15
            timeoutSeconds: 5
---
apiVersion: v1
kind: Service
metadata:
  name: kube-state-metrics
  namespace: z-monitor  # 命名空间
  labels:
    app: kube-state-metrics
spec:
  type: NodePort  
  ports:
    - name: http-metrics
      port: 8080
      targetPort: 8080
      nodePort: 30880
      protocol: TCP
    - name: telemetry
      port: 8081
      targetPort: 8081
      nodePort: 30881
      protocol: TCP
  selector:
    app: kube-state-metrics
bash 复制代码
kubectl apply -f z-kube-state-metrics.yaml

2、categraf/conf/input.prometheus/ 增加配置文件

bash 复制代码
vi cim-prod.toml
bash 复制代码
interval = 15

[[instances]]
urls = [
    "http://10.12.8.xx:30880/metrics"
]

url_label_key = "instance"
url_label_value = "{{.Host}}"


interval_times = 1

labels = { cluster = "k8s-cim-prod" }


ignore_metrics = [ "go_*", "process_*", "http_*" ]

ignore_label_keys = []


timeout = "10s"

重启 categraf

bash 复制代码
service  categraf restart
相关推荐
Xzh04231 小时前
AI Agent 学习路线(Java 后端方向)
java·人工智能·学习
正经教主2 小时前
【docker基础】第五课:Docker网络详解
运维·docker·容器
艾利克斯冰2 小时前
Java 设计模式-行为型模式(更新中)
java·开发语言·设计模式
倒霉蛋小马2 小时前
Java新特性:record关键字
java·开发语言
折哥的程序人生 · 物流技术专研2 小时前
《Java 100 天进阶之路》第95篇:消息队列基础(RocketMQ/Kafka)(2026版)
java·面试·kafka·rocketmq·java-rocketmq·求职招聘
budingxiaomoli3 小时前
Spring日志
java·开发语言
IT空门:门主3 小时前
Spring 注入三剑客:@Resource、@Autowired、@RequiredArgsConstructor 到底该用哪个?
java·后端·spring
Sam_Deep_Thinking3 小时前
Spring Boot 的启动原理是什么?
java·spring boot·后端
南部余额3 小时前
Spring WebClient 从入门到精通
java·后端·spring
CodeStats3 小时前
从 CPU 指令到 JVM 进程:彻底讲透 Java 执行 main 方法时,类加载、主线程、栈帧入栈的完整底层逻辑
java·linux·开发语言