3.7 自动伸缩 · k8s

[TOC] # Horizontal Pod Autoscaling **HPA**使Pod水平自动缩放,不再需要手动扩容，实现自动化，还可以自定义指标。 Horizontal Pod Autoscaling仅适用于Deployment和ReplicaSet,由API server和controller共同实现。 `Horizontal Pod Autoscaling，kubernetes`能够根据监测到的 CPU 利用率（或者在 alpha 版本中支持的应用提供的 metric）自动的扩容 replication controller，deployment 和 replica set。 ### 收集指标插件收集指标信息的插件有metrics-server和heapster，`heapster`不久就会被遗弃，所以我选择使用`metrics-server` > 注意：通过 Metrics API，只能获取 node 或 pod 当前的使用情况。k8s并不会存储这个数据，因此想要获取10分钟前的资源使用量是不行的。 ### 安装首先安装metrics 在 k8s 官网上找到metrics的项目地址：[metrics项目地址](https://github.com/kubernetes-incubator/metrics-server) 将项目下载到本地并安装 ~~~ git clone https://github.com/kubernetes-incubator/metrics-server.git #进入1.8+目录，查看所需镜像 cat ./* | grep image ~~~ ![](https://img.kancloud.cn/1a/a2/1aa2ad3ace02cef2362a9a7771a0e3ed_727x81.png) 由于`metrics-server`部署的时候，其中镜像下载的规则设置为了`Always`,因为墙的原因，不管本地是否存在有该镜像，都会从远端重新pull一次，导致部署`pull error`失败。修改 metrics-server-deployment.yaml 设置为`IfNotPresent` ![](https://img.kancloud.cn/7e/fc/7efc1bf61f26f5a8206bfb5894137cff_495x149.png) 安装 ![](https://img.kancloud.cn/63/62/6362eb5e724dc791ce4f69b15e85e564_745x170.png) #### 验证可能会出现红框中的提示信息，出现这种情况有两种可能 * `metrics-server`还在启动中，等待一会儿就好（如下图） ![](https://img.kancloud.cn/f7/6b/f76b3171df01034599bc19d68d52aab8_465x235.png) * `metrics-server`中的地址解析有问题（如下图） ![](https://img.kancloud.cn/e7/39/e739bcae1f3674ac447f82194c40d035_689x164.png) 如果出现了无法解析的情况，需要在`metrics-server-deployment.yaml`文件中添加以下信息 ![](https://img.kancloud.cn/ad/a0/ada07deda40466e16f07890f892f011b_605x207.png) 使用IP地址进行通信，同时忽略443端口造成的证书问题[参考issue](https://github.com/kubernetes-incubator/metrics-server/issues/143) ### Metrics支持的指标 * CPU 指标 * 内存指标 * 自定义指标（我不懂这个啊，找不到什么资料，以后再添加） ### 查看资源使用情况 ~~~ kubectl top node kubectl top pod ~~~ ### autoscale命令 ~~~ kubectl autoscale (-f FILENAME | TYPE NAME | TYPE/NAME) [--min=MINPODS] --max=MAXPODS [--cpu-percent=CPU] [flags] [options] ~~~ ### 示例 #### 基于 CPU 和内存使用的自动缩放为Deployment foo创建一个autoscaler，当Pod的CPU利用率达到80%的时候，RC的replica数量在2到5之间。 ~~~ kubectl autoscale deployment foo --min=2 --max=5 --cpu-percent=80 ~~~ 定义一个保持最少两个副本的 HPA ，如果 CPU 平均使用量超过 80% 或内存超过 200Mi，则最高可扩展到 10 个副本 ~~~ apiVersion: autoscaling/v2beta1 kind: HorizontalPodAutoscaler metadata: name: podinfo spec: scaleTargetRef: apiVersion: extensions/v1beta1 kind: Deployment name: podinfo minReplicas: 2 maxReplicas: 10 metrics: - type: Resource resource: name: cpu targetAverageUtilization: 80 - type: Resource resource: name: memory targetAverageValue: 200Mi ~~~ #### 基于自定义指标的自动扩容 (这段是在网上转载的，留着以后复盘用) ##### 开启自定义功能 * 将kube-controller-manager的启动参数中--horizontal-pod-autoscaler-use-rest-clients设置为true，并指定--master为API server地址，如`--master=http://172.20.0.113:8080` * 修改kube-apiserver的配置文件apiserver，增加一条配置--requestheader-client-ca-file=/etc/kubernetes/ssl/ca.pem --requestheader-allowed-names=aggregator --requestheader-extra-headers-prefix=X-Remote-Extra- --requestheader-group-headers=X-Remote-Group --requestheader-username-headers=X-Remote-User --proxy-client-cert-file=/etc/kubernetes/ssl/kubernetes.pem --proxy-client-key-file=/etc/kubernetes/ssl/kubernetes-key.pem，用来配置aggregator的CA证书。 ##### 使用示例 ~~~ $ kubectl get --raw "/apis/custom.metrics.k8s.io/v1beta1/namespaces/default/pods/*/http_requests" | jq . ~~~ ~~~ { "kind": "MetricValueList", "apiVersion": "custom.metrics.k8s.io/v1beta1", "metadata": { "selfLink": "/apis/custom.metrics.k8s.io/v1beta1/namespaces/default/pods/%2A/http_requests" }, "items": [ { "describedObject": { "kind": "Pod", "namespace": "default", "name": "podinfo-6b86c8ccc9-kv5g9", "apiVersion": "/__internal" }, "metricName": "http_requests", "timestamp": "2018-01-10T16:49:07Z", "value": "901m" #m表示毫秒，例如，901m 表示 901 毫次/每秒。 }, { "describedObject": { "kind": "Pod", "namespace": "default", "name": "podinfo-6b86c8ccc9-nm7bl", "apiVersion": "/__internal" }, "metricName": "http_requests", "timestamp": "2018-01-10T16:49:07Z", "value": "898m" } ] } ~~~ 创建一个 HPA，如果请求数超过每秒 100 次将扩大 podinfo 副本数,，最大10个，最少2个： ~~~ apiVersion: autoscaling/v2beta1 kind: HorizontalPodAutoscaler metadata: name: podinfo spec: scaleTargetRef: apiVersion: extensions/v1beta1 kind: Deployment name: podinfo minReplicas: 2 maxReplicas: 10 metrics: - type: Pods pods: metricName: http_requests targetAverageValue: 100 ~~~ ### HAP的滚动更新目前在Kubernetes中，可以通过直接管理 replication controller 或使用 deployment 对象来执行滚动更新，该 deployment 对象为您管理基础 replication controller。 Horizontal Pod Autoscaler 仅支持后一种方法：Horizontal Pod Autoscaler 被绑定到 deployment 对象，它设置 deployment 对象的大小，deployment 负责设置底层 replication controller 的大小。 Horizontal Pod Autoscaler 不能使用直接操作 replication controller 进行滚动更新，即不能将 Horizontal Pod Autoscaler 绑定到 replication controller，并进行滚动更新（例如使用kubectl rolling-update）。这不行的原因是，当滚动更新创建一个新的 replication controller 时，Horizontal Pod Autoscaler 将不会绑定到新的 replication controller 上。