Workload performance management and elasticity

Application scaling based on performance metri= cs
- Check core metrics in the system
- Check custom metrics in the system
  - Basic example application in Python:
Core metrics and Custom metrics examples
- HPA manifest with core metrics scraping:
- HPA manifest with custom metrics scraping:
  - The starter command for custom metric = based HPA manifest is:
  - Example Kubernetes manifests for test= ing with core metrics:

Application scaling based on performance metri= cs

Basically there is one performance management solution in CaaS sub= -system and it is exposed to the application via the HPA API provided by Kuber= netes platform. Applications can use both core metrics and custom metrics t= o horizontally scale themselves. The first is based on CPU and memory usage= and the other uses practically every metric that the developer provides to= the API Aggregator via an HTTP server.

In the following there is a short overview of the components of Performa= nce management and elasticity sub-system of CaaS.

API Aggregator: The API Aggregator is part of the K8S = (Kubernetes) API server. The aggregation layer allows Kubernetes to be exte= nded with additional APIs, beyond what is offered by the core Kubernetes AP= Is. The aggregation layer enables installing additional Kubernetes-style AP= Is in K8S cluster. These can either be pre-built, existing 3rd party soluti= ons or user-created APIs. The role of the API aggregator in scaling i= s to proxy the core, and custom metrics API requests to the core, and custo= m metrics API handler servers registered to serve the APIs. In CaaS = these are the metrics server for core: lin= k, and Prometheus for custom: link.
API Server: The Kubernetes API server validates and co= nfigures data for the API objects which include pods, services, replication= controllers, and others. The API Server services REST operations and provi= des the frontend to the cluster=E2=80=99s shared state through which all ot= her components interact.
cAdvisor: cAdvisor is an open source container resourc= e usage and performance analysis agent. It is purpose-built for containers = and supports Docker containers natively. In Kubernetes, cAdvisor is integra= ted into the Kubelet binary. cAdvisor auto-discovers all containers in the = machine and collects CPU, memory, filesystem, and network usage statistics.= cAdvisor also provides the overall machine usage by analyzing the =E2=80= =98root=E2=80=99 container on the machine.
Custom Metric adapter: Custom Metric adapter is an ele= ment to provide connection between Prometheus and API Aggregator.
HPA (Horizontal Pod Autoscaler): The Horizontal Pod Au= toscaler is implemented as a Kubernetes API resource and a controller. It a= utomatically scales the number of pods in a replication controller, deploym= ent or replica set based on core or custom metrics.
Kubelet: The Kubelet acts as a bridge between the Kube= rnetes master and the nodes. It manages the pods and containers running on = a machine. Kubelet translates each pod into its constituent containers and = fetches individual container usage statistics from cAdvisor. It then expose= s the aggregated pod resource usage statistics via a REST API.
Prometheus: Prometheus is an open-source software proj= ect written in Go that is used to record real-time metrics in a time series= database (allowing for high dimensionality) built using a HTTP pull model,= with flexible queries and real-time alerting.

The key differences between core and custom metrics are that core metric= s support scraping metrics only from CPU and memory whereas custom metrics = can scrape practically every kind of metrics. In the first case Kubernetes = offers the metrics out of box, but in the second case users have to impleme= nt the metrics provider HTTP server.

Note that the database behind the performance management system is not p= ersistent but uses time-series database to store metric values in both solu= tions.

Check core metrics in the system

Metrics APIs provided by Kubernetes can be g= otten by:

~]$ kubectl=
 api-versions

...

custom.metrics.k8s.io/v1beta1

...

metrics.k8s.io/v1beta1

...

cAdvisor automatically scra= pes metrics from every active pod in the system regarding to CPU and memory= usage. The state of core metrics can be requested by the following:

~]$ kubectl=
 top node

NAME            CPU(cores)   CPU%   MEMORY(bytes)   MEMORY%

172.24.16.104   1248m        62%    5710Mi                      74%

172.24.16.105   1268m        63%    5423Mi                      71%

172.24.16.107   1215m        60%    5191Mi                      68%

172.24.16.112   253m           6%     846Mi                        11%

With this command y= ou can get core metrics from nodes respectively to CPU and memory usage.

The printout shows the names of nodes actually the IP addresses of= nodes, the usage of CPUs in percentage and milli standard for 2 CPU= s in the example furthermore memory usage in percentage and Mi (MiB) standa= rd.

~]$ kubectl=
 top pod --namespace=3Dkube-system | grep elasticsearch
NAME CPU(cores) MEMORY(bytes)
elasticsearch-data-0 71m 1106Mi
elasticsearch-data-1 65m 1114Mi
elasticsearch-data-2 75m 1104Mi
elasticsearch-master-0 4m 1068Mi
elasticsearch-master-1 7m 1076Mi
elasticsearch-master-2 3m 1075Mi

Console output shows pod names and their CPU and memory consumption in t= he same format.

Check custom metrics in the system

In case of the usage of Custom Metrics the developer has to provide the = exposition of metrics in his application in Prometheus format. There are sp= ecific libraries that can be used for creating HTTP server and Prometheus c= lient for this purpose in Golang, Python etc.

Basic example application in Python:

from promethe=
us_client import start_http_server, Histogram
import random
import time

function_exec =3D Histogram('function_exec_time',
                          'Time spent processing a function',
                          ['func_name'])

def func():
    if (random.random() < 0.02):
        time.sleep(2)
        return

time.sleep(0.2)
start_http_server(9100)

while True:
    start_time =3D time.time()
    func()
    function_exec.labels(func_name=3D"func").observe(time.time() - start_ti=
me)

This application imports http_server and Histogram metrics from Promethe= us client library and exposes metrics from the func() function. Prometheus = can scrape these metrics from port 9100.

~]$ kubectl=
 get --raw "/apis/custom.metrics.k8s.io/v1beta1" | jq .

{
  "kind": "APIResourceList",
  "apiVersion": "v1",
  "groupVersion": "custom.metrics.k8s.io/v1beta1",
  "resources": [
    {
      "name": "pods/go_memstats_heap_released_bytes",
      "singularName": "",
      "namespaced": true,
      "kind": "MetricValueList",
      "verbs": [
        "get"
      ]
    },
    {
      "name": "jobs.batch/http_requests",
      "singularName": "",
      "namespaced": true,
      "kind": "MetricValueList",
      "verbs": [
        "get"
      ]
    }
    ]
}

The command result lists the custom metrics in the system, each metrics = can be requested one by one for more details:

kubectl get=
 =E2=80=93raw "/apis/custom.metrics.k8s.io/v1beta1/namespaces/kube-system/p=
ods/*/http_requests" | jq .

{
  "kind": "MetricValueList",
  "apiVersion": "custom.metrics.k8s.io/v1beta1",
  "metadata": {
    "selfLink": "/apis/custom.metrics.k8s.io/v1beta1/namespaces/kube-system=
/pods/%2A/http_requests"
  },
  "items": [
    {
      "describedObject": {
        "kind": "Pod",
        "namespace": "kube-system",
        "name": "podinfo-bd494c88d-lmt2j",
        "apiVersion": "/v1"
      },
      "metricName": "http_requests",
      "timestamp": "2019-02-14T10:21:19Z",
      "value": "898m"
    },
    {
      "describedObject": {
        "kind": "Pod",
        "namespace": "kube-system",
        "name": "podinfo-bd494c88d-lxng7",
        "apiVersion": "/v1"
      },
      "metricName": "http_requests",
      "timestamp": "2019-02-14T10:21:19Z",
      "value": "898m"
    }
  ]
}

~]$ curl http://$(kubectl get service podinfo --namespace=3Dkube-system -o =
jsonpath=3D'{ .spec.clusterIP }'):9898/metrics
=E2=80=A6
http_request_duration_seconds_bucket{method=3D"GET",path=3D"healthz",status=
=3D"200",le=3D"0.005"} 2040
http_request_duration_seconds_bucket{method=3D"GET",path=3D"healthz",status=
=3D"200",le=3D"0.01"} 2040
http_request_duration_seconds_bucket{method=3D"GET",path=3D"healthz",status=
=3D"200",le=3D"0.025"} 2040
http_request_duration_seconds_bucket{method=3D"GET",path=3D"healthz",status=
=3D"200",le=3D"0.05"} 2072
http_request_duration_seconds_bucket{method=3D"GET",path=3D"healthz",status=
=3D"200",le=3D"0.1"} 2072
http_request_duration_seconds_bucket{method=3D"GET",path=3D"healthz",status=
=3D"200",le=3D"0.25"} 2072
=E2=80=A6

# HELP http_requests_total The total number of HTTP requests.
# TYPE http_requests_total counter
http_requests_total{status=3D"200"} 5593
=E2=80=A6

This is a HTTP request with cURL, it shows the custom metrics exposed by= an HTTP server of an application running in a Kubernetes pod.

Core metrics and Custom metrics examples

HPA manifest with core metrics scraping:

php-apache-h=
pa.yml
apiVersion: autoscaling/v1
kind: HorizontalPodAutoscaler
metadata:
  name: php-apache-hpa
spec:
  scaleTargetRef:
    apiVersion: extensions/v1beta1
    kind: Deployment
    name: php-apache-deployment
  minReplicas: 1
  maxReplicas: 5
  targetCPUUtilizationPercentage: 50

In this example HPA scrapes metrics from CPU consumption of php-apache-d= eployment. The initial pod number is one and the maximum replica counts are= five. HPA initiates pod scaling when the CPU utilization is higher than 50= %. If the utilization is less than 50% HPA starts scaling down the number o= f pods by one.

HPA manifest with custom metrics scraping:

podinfo-hpa-=
custom.yaml
apiVersion: autoscaling/v2beta1
kind: HorizontalPodAutoscaler
metadata:
  name: podinfo
  namespace: kube-system
spec:
  scaleTargetRef:
    apiVersion: extensions/v1beta1
    kind: Deployment
    name: podinfo
  minReplicas: 2
  maxReplicas: 10
  metrics:
  - type: Pods
    pods:
      metricName: http_requests
      targetAverageValue: 10

In the second example HPA uses custom metrics to manage the= performance. The podinfo application contains the implementation of an HTT= P server which exposes the metrics in Prometheus format. The initial number= of pods are two and the maximum are ten. The custom metric is the cardinal= ity of the http requests on the HTTP server regarding to the metrics expose= d.

The starter command for custom metric based = HPA manifest is:

~]$ kubectl=
 create -f podinfo-hpa-custom.yaml --namespace=3Dkube-system

In case of starting core metrics HPA the command is the same.

Afte= r HPA start you can get information about the state of actual performance m= anagement of the system with:

~]$ kubectl=
 describe hpa podinfo --namespace=3Dkube-system
Name:                       podinfo
Namespace:                  kube-system
Labels:                     <none>
Annotations:                <none>
CreationTimestamp:          Tue, 19 Feb 2019 10:08:21 +0100
Reference:                  Deployment/podinfo
Metrics:                    ( current / target )
  "http_requests" on pods:  901m / 10
Min replicas:               2
Max replicas:               10
Deployment pods:            2 current / 2 desired
Conditions:

  Type            Status  Reason            Message
  ----            ------  ------            -------
  AbleToScale     True    ReadyForNewScale  recommended size matches curren=
t size
  ScalingActive   True    ValidMetricFound  the HPA was able to successfull=
y calculate a replica count from pods metric http_requests
  ScalingLimited  True    TooFewReplicas    the desired replica count is in=
creasing faster than the maximum scale rate

Events:           <none>

Note that: HPA API supports scaling based on both core and custom metric= s within the same HPA object.

Example Kubernetes manifests for testing wi= th core metrics:

Example Kubernetes manifests for testing with custom metrics: