n9e categraf k8s监控配置 -kube-state-metrics

1、k8s集群中创建命名空间/角色/绑定关系/容器-kube-state-metrics

下载镜像 kube-state-metrics-v2.5.0.tar

bash 复制代码
docker load -i kube-state-metrics-v2.5.0.tar
docker images |grep kube-state-metrics
kubectl create namespace z-monitor
vi   kube-state-metrics.yaml
bash 复制代码
#Create by:ember.zhang
#为什么不问问神奇的海螺呢丶
apiVersion: v1
kind: ServiceAccount
metadata:
  name: kube-state-metrics
  namespace: z-monitor  # 命名空间 
---
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRole
metadata:
  name: kube-state-metrics
rules:
  - apiGroups: [""]
    resources:
      - configmaps
      - secrets
      - nodes
      - pods
      - services
      - resourcequotas
      - replicationcontrollers
      - limitranges
      - persistentvolumeclaims
      - persistentvolumes
      - namespaces
      - endpoints
    verbs: ["list", "watch"]
  - apiGroups: ["apps"]
    resources:
      - statefulsets
      - daemonsets
      - deployments
      - replicasets
    verbs: ["list", "watch"]
  - apiGroups: ["batch"]
    resources:
      - cronjobs
      - jobs
    verbs: ["list", "watch"]
  - apiGroups: ["autoscaling"]
    resources:
      - horizontalpodautoscalers
    verbs: ["list", "watch"]
  - apiGroups: ["policy"]
    resources:
      - poddisruptionbudgets
    verbs: ["list", "watch"]
  - apiGroups: ["certificates.k8s.io"]
    resources:
      - certificatesigningrequests
    verbs: ["list", "watch"]
  - apiGroups: ["storage.k8s.io"]
    resources:
      - storageclasses
      - volumeattachments
    verbs: ["list", "watch"]
  - apiGroups: ["apiextensions.k8s.io"]
    resources:
      - customresourcedefinitions
    verbs: ["list", "watch"]
---
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRoleBinding
metadata:
  name: kube-state-metrics
roleRef:
  apiGroup: rbac.authorization.k8s.io
  kind: ClusterRole
  name: kube-state-metrics
subjects:
  - kind: ServiceAccount
    name: kube-state-metrics
    namespace: z-monitor  # 命名空间
---
apiVersion: apps/v1
kind: Deployment
metadata:
  name: kube-state-metrics
  namespace: z-monitor  # 命名空间
  labels:
    app: kube-state-metrics
spec:
  replicas: 1  # 副本数
  selector:
    matchLabels:
      app: kube-state-metrics
  template:
    metadata:
      labels:
        app: kube-state-metrics
    spec:
      serviceAccountName: kube-state-metrics
      containers:
        - name: kube-state-metrics
          image: registry.k8s.io/kube-state-metrics/kube-state-metrics:v2.5.0  # 镜像版本
          imagePullPolicy: IfNotPresent  # 强制使用本地镜像
          ports:
            - name: http-metrics
              containerPort: 8080
            - name: telemetry
              containerPort: 8081
          resources:
            # 资源限制
            limits:
              cpu: 200m
              memory: 256Mi
            requests:
              cpu: 100m
              memory: 128Mi
          readinessProbe:
            httpGet:
              path: /healthz
              port: 8080
            initialDelaySeconds: 5
            timeoutSeconds: 5
          livenessProbe:
            httpGet:
              path: /healthz
              port: 8080
            initialDelaySeconds: 15
            timeoutSeconds: 5
---
apiVersion: v1
kind: Service
metadata:
  name: kube-state-metrics
  namespace: z-monitor  # 命名空间
  labels:
    app: kube-state-metrics
spec:
  type: NodePort  
  ports:
    - name: http-metrics
      port: 8080
      targetPort: 8080
      nodePort: 30880
      protocol: TCP
    - name: telemetry
      port: 8081
      targetPort: 8081
      nodePort: 30881
      protocol: TCP
  selector:
    app: kube-state-metrics
bash 复制代码
kubectl apply -f z-kube-state-metrics.yaml

2、categraf/conf/input.prometheus/ 增加配置文件

bash 复制代码
vi cim-prod.toml
bash 复制代码
interval = 15

[[instances]]
urls = [
    "http://10.12.8.xx:30880/metrics"
]

url_label_key = "instance"
url_label_value = "{{.Host}}"


interval_times = 1

labels = { cluster = "k8s-cim-prod" }


ignore_metrics = [ "go_*", "process_*", "http_*" ]

ignore_label_keys = []


timeout = "10s"

重启 categraf

bash 复制代码
service  categraf restart
相关推荐
时艰.2 小时前
Java 线程池 — ThreadPoolExecutor
java·开发语言·python
Coder_Boy_2 小时前
企业级项目高并发监控场景-Spring Boot 集成 Graphite & InfluxDB 实战文档
java·spring boot·后端·系统架构
lang201509282 小时前
Java EE并发工具:JSR 236详解
java·java-ee
EverydayJoy^v^10 小时前
RH134学习进程——十二.运行容器(1)
linux·运维·容器
java_logo10 小时前
OpenProject Docker 容器化部署指南:从快速启动到生产环境配置
docker·容器·openproject·openproject部署·openproject部署手册·openproject部署方案·openproject部署教程
毕设源码-朱学姐11 小时前
【开题答辩全过程】以 基于JavaWeb的网上家具商城设计与实现为例,包含答辩的问题和答案
java
C雨后彩虹12 小时前
CAS与其他并发方案的对比及面试常见问题
java·面试·cas·同步·异步·
java1234_小锋13 小时前
Java高频面试题:SpringBoot为什么要禁止循环依赖?
java·开发语言·面试
2501_9445255413 小时前
Flutter for OpenHarmony 个人理财管理App实战 - 账户详情页面
android·java·开发语言·前端·javascript·flutter