Kubernetes configuration

The concepts in this section apply to any Kubernetes clusters created using Cluster API, i.e. the HA cluster in a HA deployment and tenant clusters.

The variable names differ slightly for the two cases.

Images

When building a cluster, Cluster API requires that an image exists in the target cloud that is accessible to the target project and has the correct version of kubelet and kubeadm available.

Suitable images are uploaded as part of an Azimuth deployment using the Community images functionality and the IDs are automatically propagated where they are needed, i.e. for the Azimuth HA cluster and the Kubernetes cluster templates.

If required, e.g. for custom Kubernetes templates, the IDs of these images can be referenced using the community_images_image_ids variable:

environments/my-site/inventory/group_vars/all/variables.yml

kube_1_25_image_id: "{{ community_images_image_ids.kube_1_25 }}"
kube_1_26_image_id: "{{ community_images_image_ids.kube_1_26 }}"
kube_1_27_image_id: "{{ community_images_image_ids.kube_1_27 }}"

Docker Hub mirror

Docker Hub imposes rate limits on image downloads, which can cause issues for both the Azimuth HA cluster and, in particular, tenant clusters. This can be worked around by mirroring the images to a local registry.

If you have a Docker Hub mirror available, this can be configured using the following variables:

#### For the HA cluster ####

# The ID of the external network to use
capi_cluster_registry_mirrors:
  docker.io: ["https://docker-hub-mirror.example.org/v2"]

#### For tenant clusters ####

azimuth_capi_operator_capi_helm_registry_mirrors:
  docker.io: ["https://docker-hub-mirror.example.org/v2"]

Tip

If you do not have a Docker Hub mirror available, one can be deployed as part of an Azimuth deployment. This mirror will be automatically configured for tenant Kubernetes clusters.

This mirror cannot be used for the Azimuth HA cluster as it is deployed on that cluster.

Multiple external networks

In the case where multiple external networks are available, you must tell Azimuth which one to use:

environments/my-site/inventory/group_vars/all/variables.yml

#### For the HA cluster ####

# The ID of the external network to use
capi_cluster_external_network_id: "<network id>"

#### For tenant clusters ####

azimuth_capi_operator_external_network_id: "<network id>"

Note

This does not currently respect the portal-external tag.

Volume-backed instances

Flavors with 100GB root disks are recommended for Kubernetes nodes, both for the Azimuth deployment and for tenant clusters.

If flavors with large root disks are not available on the target cloud, it is possible to use volume-backed instances instead.

etcd and spinning disks

The configuration options in this section should be used subject to the advice in the prerequisites about using Cinder volumes with Kubernetes.

etcd on a separate block device

If you only have a limited amount of SSD or, even better, local disk, available, consider placing etcd on a separate block device to make best use of the limited capacity.

To configure Kubernetes clusters to use volume-backed instances (i.e. use a Cinder volume as the root disk), the following variables can be used:

environments/my-site/inventory/group_vars/all/variables.yml

#### For the HA cluster ####

# The size of the root volumes for Kubernetes nodes
capi_cluster_root_volume_size: 100
# The volume type to use for root volumes for Kubernetes nodes
capi_cluster_root_volume_type: nvme

#### For tenant clusters ####

azimuth_capi_operator_capi_helm_root_volume_size: 100
azimuth_capi_operator_capi_helm_root_volume_type: nvme

Tip

You can see the available volume types using the OpenStack CLI:

openstack volume type list

Etcd configuration

As discussed in the prerequisites, etcd is extremely sensitive to write latency.

Azimuth is able to configure Kubernetes nodes, both for the HA cluster and tenant clusters, so that etcd is on a separate block device. This block device can be of a different volume type to the root disk, allowing efficient use of SSD-backed storage. When supported by the flavor, the etcd block device can also use local disk even if the root volume is from Cinder.

Use local disk for etcd whenever possible

Using local disk when possible minises the write latency for etcd and also eliminates network instability as a cause of latency problems.

The following variables are used to configure the etcd block device for the HA cluster:

environments/my-site/inventory/group_vars/all/variables.yml

# Specifies the size of the etcd block device in GB
# This is typically between 2GB and 10GB - Amazon recommends 8GB for EKS
# Defaults to 0, meaning etcd stays on the root device
capi_cluster_etcd_blockdevice_size: 8

# The type of block device that will be used for etcd
# Specify "Volume" (the default) to use a Cinder volume
# Specify "Local" to use local disk (the flavor must support ephemeral disk)
capi_cluster_etcd_blockdevice_type: Volume

# The Cinder volume type to use for the etcd block device
# Only used if "Volume" is specified as block device type
# If not given, the default volume type for the cloud will be used
capi_cluster_etcd_blockdevice_volume_type: nvme

# The Cinder availability zone to use for the etcd block device
# Only used if "Volume" is specified as block device type
# Defaults to "nova"
capi_cluster_etcd_blockdevice_volume_az: nova

The equivalent variables for tenant clusters are:

environments/my-site/inventory/group_vars/all/variables.yml

azimuth_capi_operator_capi_helm_etcd_blockdevice_size:
azimuth_capi_operator_capi_helm_etcd_blockdevice_type:
azimuth_capi_operator_capi_helm_etcd_blockdevice_volume_type:
azimuth_capi_operator_capi_helm_etcd_blockdevice_volume_az:

Load-balancer provider

If the target cloud uses OVN networking, and the OVN Octavia provider is enabled, then Kubernetes clusters should be configured to use the OVN provider for any load-balancers that are created:

environments/my-site/inventory/group_vars/all/variables.yml

openstack_loadbalancer_provider: ovn

Tip

You can see the available load-balancer providers using the OpenStack CLI:

openstack loadbalancer provider list

Availability zones

By default, an Azimuth installation assumes that there is a single availability zone (AZ) called nova - this is the default set up and common for small-to-medium sized clouds. If this is not the case for your target cloud, you may need to set additional variables specifying the AZs to use, both for the HA cluster and for tenant Kubernetes clusters.

The default behaviour when scheduling Kubernetes nodes using Cluster API is:

All available AZs are considered for control plane nodes, and Cluster API will attempt to spread the nodes across multiple AZs, if available.
Worker nodes are scheduled into the nova AZ explicitly. If this AZ does not exist, scheduling will fail.

If this default behaviour does not work for your target cloud, the following options are available.

Note

Cluster API refers to "failure domains" which, in the OpenStack provider, correspond to availability zones (AZs).

Ignore availability zones

It is possible to configure Cluster API clusters in such a way that AZs are not specified at all for Kubernetes nodes. This allows other placement constraints such as flavor traits and host aggregates to be used, and a suitable AZ to be selected by OpenStack.

environments/my-site/inventory/group_vars/all/variables.yml

#### For the HA cluster ####

# Indicate that the failure domain should be omitted for control plane nodes
capi_cluster_control_plane_omit_failure_domain: true
# Specify no failure domain for worker nodes
capi_cluster_worker_failure_domain: null

#### For tenant clusters ####
azimuth_capi_operator_capi_helm_control_plane_omit_failure_domain: true
azimuth_capi_operator_capi_helm_worker_failure_domain: null

Tip

This is the recommended configuration for new deployments, unless you have a specific need to use specific availability zones.

Use specific availability zones

To use specific availability zones for Kubernetes nodes, the following variables can be used:

environments/my-site/inventory/group_vars/all/variables.yml

#### For the HA cluster ####

# A list of failure domains that should be considered for control plane nodes
capi_cluster_control_plane_failure_domains: [az1, az2]
# The failure domain for worker nodes
capi_cluster_worker_failure_domain: az1

#### For tenant clusters ####

azimuth_capi_operator_capi_helm_control_plane_failure_domains: [az1, az2]
azimuth_capi_operator_capi_helm_worker_failure_domain: az1

Last update: April 9, 2024
Created: April 9, 2024