Who am I?¶

Cesare Placanica¶

That was easy... :-)¶

Telco engineer @ Red Hat
- Formerly in Cisco Photonics for many years
Always worked in Telecommunication industry
- Which is good since I dig big labs a lot!
Linux and Python addicted since ever.
Eventually, I'm trying to be an event organizer for PyMI
- Pythonistas interviewer!

Thanks Python Biella Group for having me!¶

Agenda¶

What we will do¶

Debunk Container (and Docker)
Why Kubernetes? What is his purpose?
Hopefully, install it and start play with it.

Poll Time!¶

The Poll is here ¶

Kubernetes¶

What I learned in the last year.¶

What Kubernetes is?¶

Linux Day 2022¶

Lorenzo Soligo GitOps 101 - da dove veniamo, dove stiamo andando

Kubernetes is a (Game) Loop!

Are you nuts?¶

Let me write it, in Python of course¶

while True:
    current_state = get_current_state()
    desidered_state = get_desidered_state()
    reconcile(current_state, desidered_state)

OK, this is a super high level description to the thing but can help to set the context about what Kubernetes is

Somebody calls the simple loop above a reconciliation loop, a sort of pattern.

This enables you to be "declarative", just state what you need from the system, not how reach a "desidered" condition.

And enable Kubernetes to be "Intent based": match the resources with users "desire".

Why Kubernetes happened?¶

The official history¶

The Kubernetes Documentary, part 1 and part 2

The path to Kubernetes, what I think?¶

New ideas is Software Engineering and Information Technology.

New ways of working.

Internet, hardware disaggregation, virtualization.

Software Engineering¶

Clean Architecture¶

Decouple and isolate business logic from "infrastructural" stuff.

Entities, Use cases, Repositories.

Use of "Dependency Inversion" for enabling pluggability.

Synergy with TDD.

Domain Driven Design¶

A thing can have multiple contexts (bounded contexts), or multiple point of view.

For example a "Customer" in a CRM can have some attributes relevant for the "Sales" department, but not so much for "Support" department.

A complex application can have many business logic.

Microservices¶

Break the monolith to enable horizontal scaling.

"Sales" and "Support" can be separated and talk via an interface.

"Sales" and "Support" can be developed by different teams.

The application deployment is part of the application and should be easy. The Twelve-Factor App.

API centricity¶

Migration from in memory interfaces to Web / REST Application Programming Interface.

Concept of "Resource".

Concept of "Single Source of Truth".

New ways of working¶

Agile¶

Too much to say. I'll stop here. Just a mention for Team Topologies (2019). Build teams around projects and not in functional silos.

Devops Movement¶

Infrastructure as code.

Automation.

Infrastructure as a service (IaaS), Platform as a Service (PaaS), Software as a Service (SaaS)

Cloud Computing.

Site Reliability Engineering¶

Distributed Systems is hard. Many things can go wrong, see Designing Data-Intensive Applications chapter 8.

Distributed System are built upon cheap but unreliable components.

Write a platform where lessons learned while implementing distributed system become code that can help to operate and monitor a system.

Possibility to plug-in or extend the resources under control.

What's under the hood?¶

What is the techie stuff that made Kubernetes even possible?

It started with...¶

Virtual machines¶

Hardware disaggregation
Slice and dice servers
HW becames software

Docker¶

Containers¶

Operative systems capabilities to isolate Resources.

Are containers Linux?¶

Almost¶

Just debunk them and look under the hood!
Which Linux tecnologies make containers possible?

Linux Kernel stuff that enables containerization.¶

Rootfs - Containers's file system.

chroot - change a root directory of a process, first step for isolation.

Linux namespace, or to be honest, namespaces!

Once upon a time under /proc/PID_N/ns in the Unshare Land...¶

Namespace	Isolates
Cgroup	Cgroup root directory
IPC	System V IPC, POSIX message queues
Network	Network devices, stacks, prots, etc.
Mount	Mount points
PID	Process IDs
Time	Boot and monotonic clocks
User	User and Group IDs
UTS	Hostname and NIS domain name

Docker automated all this and added a specific way to build images and handle process communication, but we can build our "containers" just with Linux.

Crossing fingers...¶

Demotime! Containers under the hood.¶

Closing toughts¶

Tools like Podman or Docker make running containers easy for everyone by abstracting the different Linux technologies used under the hood from the user.

Anyway these tools are doing more than what we showed in the previous demo.

They provide a truly isolated environments via Network Namespaces
They provide a way for building container images (Dockerfile)
They provide a specification to create, run and distribute images to the Open Container Initiative OCI
In particular Docker donated to OCI the reference implementation of the runtime-spec, runc

How isolated Container can communicate?¶

Enter Linux Kernel Namespaces, and Linux Kernel Networking.¶

Linux Networking capabilities:

Interfaces
- Physical
- Virtual (Using Network Namespaces)
Routing Tables
Firewall / IPTables (Netfilter command)
A bunch of networking commands to configure the network stack that use the "net" Namespaces.

Crossing fingers!¶

Almost Demotime! Linux Networking¶

This time I will draw something with https://excalidraw.com/

For reference look at these videos:

So, you said debunk Docker?¶

What if I tell you...?¶

That our v-net-0 Linux Bridge it's docker0?

Docker uses a Linux Bridge: docker0 for host networking, by default!
It uses netns to truly isolate coontainers.
And uses cgroup for resorce limitation.

Let us see another sketch on https://excalidraw.com/

For Container to Container communication, Docker can expose a service to the host, or to other containers by mapping a port. But we have to be explicit and tell to do so:

docker run --name nginx -p 8080:80 ngnix

The inside port 80 is exposed as 8080 to the host, and to other containers

Docker has other ways to handle the host network.

Host Network: no netns isolation, the container shares host's interface and IP.
Custom bridge Network
Container-defined network
No network.

Custom bridge¶

In custom bridge mode, more container can share the same netns, this means they can communicate with localhost:port, so you can have a web API app container and a database container on a custom bridge, and you can expose just the API port (security is enhanced).

docker network create mynetwork
docker run -it --rm --name=container-a --network=mynetwork busybox /bin/sh
docker run -it --rm --name=container-b --network=mynetwork busybox /bin/sh

Container defined Network¶

This is a special case of custom networking where another container joins the network of another container, same here, the containers are on the same netns. This is similar to how a Pod works on Kubernetes.

docker run -it --rm --name=container-a busybox /bin/sh
docker run -it --rm --name=container-b --network=container:container-a busybox /bin/sh

No network¶

docker run --net=none --name busybox busybox ip a

Docker with multiple hosts¶

Cross-host networking usually uses an overlay network, which builds a mesh between hosts and employs a large block of IP addresses within that mesh. The overlay network functionality built into Docker is called Swarm. When you connect a host to a swarm, the Docker engine on each host handles communication and routing between the hosts.

So far so good, are going to tell us about Kubernetes?¶

OK, I said Docker donated runc to the Open Container Initiative, and donated the runtime and image specification, then what happened?

The truth is that Kubernetes is made of multiple projects! Under the Linux Foundation started a specification work that produced

The [Container Runtime Interface](https://kubernetes.io/docs/concepts/architecture/cri/#:~:text=The%20Container%20Runtime%20Interface%20(CRI,components%20kubelet%20and%20container%20runtime) that teels how containers are launched by kubelet (more below).
And the Container Network Interface, an api for networking plugins.

Did I spoke about pluggability at the beginning of this thing? Here is the result, we have two CRI reference implementations:

Both of them call runc (or some sort of fork of it) to launch containers.

But why the CNI? Because people realized that container networking implemented in: the Linux Kernel Network Namespaces, Docker, rkt, Mesos, Kubernetes and bla bla, everybody do the same things:

Create a Network Namespace
Create Bridge Network/Interface
Create VETH pairs (Pipe, Virtual Cable)
Attach veth to Namespace
Attach other veth to bridge
Assign IP addresses
Bring up interfaces
Enable NAT - IP Masquerade

All the stuff needed for building the Kubernetes Network Model.

So they: Linux Foundation and Kubernetes, created this application interface, but as I told ya this enabled pluggabilty, and so we have tons of CNI compliant network plugins, using different network protocols in some cases.

Ovelay Network, a dumb definition. It's often used in network plugins.¶

A logical network built on top of a physical one. In practice is implemented by encapsulate an "inner" IP into an "outer" IP. A trivial overlay is IP-over-IP overlay. Real life examples: VxLAN, IPSec, VPN.

Flannel (Overlay layer 3 network using VxLAN or IPSec)
Calico (BGP, can fallback to overlay Network)
Multus (can expose many interfaces to a Container)
Cilium (eBPF the new gig in the Linux Kernel)
The full list (???)

Part 2¶

Agenda¶

The Kubernetes Architecture.
The Kubernetes Purpose.
Build a cluster example.
The Kubernetes Resources (Pod, Deployments etc.).
Doing some demos along the way, hopefully.

Kubernetes architecture, an very light overview... hopefully¶

Bret Fisher is better than me, we seat on Giants shoulders!

Here it is!

What the advantages over Docker are? My 0.01.¶

Kubernetes came out many lessons learned by people who actually built a lot of distributed system.
It's made to build computer clusters and have a standard way to deal with them.
Has the "single source of truth for the cluster", the etcd database, you can recovery from crashes by doing backups.
Is API centric.
It's declarative!
It's extendable using the Custom Resource Definition
Even the loop is extendable, using the Operator Pattern, you can add custom maintenance automation scripts for your application.

What are the drawbacks? My second 0.01.¶

Since it's an aggregate of open source projects it's not simple to install it. Sometimes one of these projects break each other.

This is true for Vanilla's Kubernetes. It's very much the same as Linux, but, as Linux you can choose a "Kubernetes Distribution" from a Linux provider that developed "installers" for you to simplify the installation, or look at managed versions From a Cloud provider like EKS, GCP, AKS.

Kubernetes purpose... again¶

Kubernetes networking builds on top of the Docker and Netfilter constructs to tie multiple components together into applications.
Kubernetes uses Docker just low level stuff: processes and volumes.
Kubernetes operates at higher level providing orchestration services such as self-healing, scaling, updates and load balancing.

Crossing fingers... try to build a Vanilla cluster...¶

A Funny picture of a "Cluster for learning".
Some ways to do this
My loved preferred way Just me and open source, it's a minimal Vanilla Installation.
Kubernetes the Hard Way. Requires Google Cloud.
k8s-the-ansible-way. Does not requires Google Cloud, but 2018.
minikube (at the moment on a Mac is possible to use Podman).

Issues¶

At the time of this writing.

Ventura broke VirtualBox 6, hence upgraded to 7
VirtualBox 7, due to Ventura, deprecated "host-only", hence minikube + VirtualBox is broken.
Containerd version 1.5.9 broke kubeadm 1.24

Workarounds¶

Use minikube with the experimental Podman driver
For just me and opensounce / Vanilla Kubernetes 1.24, for Ubuntu 22.04 there's a fix, I used the #2.

Let's do a quick code walk-through...¶

https://github.com/keobox/kubernetes/tree/fix-containerd-issue

Start the Cluster¶

cd vagrant-provisioning
vagrant up

Note: I started browsing this Kubernetes Books just after the first meeting and seems nice 'cause is following the same "white box" approach I'm following. The nice thing that the Cluster built with this book have, althought is minimal, is an HA cluster:

There are 3 nodes with the control plane running: the minimum for HA, and these have both the master and worker roles.
There's just one worker node.

Ours Cluster is not a HA Cluster.

Exploring the cluster¶

cd vagrant-provisioning
vagrant up
vagrant ssh kmaster
mkdir -p $HOME/.kube
sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config
sudo chown $(id -u):$(id -g) $HOME/.kube/config
sudo systemctl status containerd
sudo systemctl status kubelet
kubectl get nodes

Hint: grab kubeconfig file¶

To use kubectl on the host machine, hence outside the cluster, you can take it from the cluster.

mkdir -p ~/.kube
scp -P 2222 -i .vagrant/machines/kmaster/virtualbox/private_key vagrant@127.0.0.1:.kube/config ~/.kube/playground-config
export KUBECONFIG=~/.kube/playground-config

Demo¶

kubectl get nodes
kubecrl get nodes -o wide

Adding a web console¶

Please please please do not this in production¶

Is possible to add an INSECURE Kubernetes Web console in just one line:

kubectl apply -f https://k8smastery.com/insecure-dashboard.yaml

This installs a NodePort service (more on this below), and is possible to connect like this:

kubectl get svc dashboard

Note down the node port and connect with a browser to http://172.16.16.100:3XXXXX and press "skip".

Kubernetes Resources¶

Pods¶

A Pod is the smallest unit of deployment in a Kubernetes cluster.
Pod encapsulates one or more containers. Containers in the same Pod always run on the same host. They share resources such as the network namespace and storage.
Each Pod has a routable IP address assigned to it, not to the containers running within it. Having a shared network space for all containers means that the containers inside can communicate with one another over the localhost address, a feature not present in traditional Docker networking.
Every Pod in the Cluster is reachable via an IP Address (But a Network Policy can change this).
All this is explained in the Kubernetes Network Model.

Demo!¶

kubectl run web --image=nginx
kubectl get pods -o wide
kubectl logs nginx
kubectl delete pod web

Workloads¶

The Kubernetes engineers recognized that a Cluster may have many use case, for this reason they created some abstractios to address this use cases, so there are some kinds of workloads.

ReplicaSet¶

The ReplicaSet maintains the desired number of copies of a Pod running within the cluster. If a Pod or the host on which it’s running fails, Kubernetes launches a replacement.

Deployment¶

A Deployment manages a ReplicaSet.

kubectl create deployment web --image=nginx --replicas=3
kubectl get all -n default

What happened? A Walkthrought! The "Loop" in action!

Demo¶

Where are the pod?

kubectl get pods -o wide

Self-healing

kubectl delete pod web-xxx-yyy && kubectl get pods -w

Scale up

kubectl scale deployment web --replicas=4 && kubectl get pods -w

Scale down

kubectl scale deployment web --replicas=2 && kubectl get pods -w

Demo of an update¶

kubectl get deployment web -o yaml > web.yaml

The ReplicaSet managed by the Deployment has 2 important parameters in the spec for a RollingUpdate: maxSurge which is the percentage of the pods launched on a new version and maxUnavailable which is the percentage of pods deleted in the old version.

Suppose to do an update to nginx version 1.23 but that does not exists eh eh.

Check the image name in a pod

kubectl describe pod web-xxx-yyy

try the update

kubectl scale deployment web --replicas=10
kubectl set image deployments/web nginx==nginx:1.23 && kubectl get replicasets -w
kubectl rollout status deploy web
kubectl rollout undo deploy web

DaemonSet¶

A DaemonSet runs one copy of the Pod on each node in the Kubernetes cluster. This workload model provides the flexibility to run daemon processes such as log management, monitoring, storage providers, or network providers that handle Pod networking for the cluster.

Demo¶

Calico, our CNI implementation is implemented as a DaemonSet. This means we can see 1 daemon per node, hence 3 Pods.

kubectl get daemonset -n kube-system calico-node
kubectl get pods -n kube-system -o wide | grep calico-node

StatefulSet¶

A StatefulSet controller ensures that the Pods it manages have durable storage and persistent identity. StatefulSets are appropriate for situations where Pods have a similar definition but need a unique identity, ordered deployment and scaling, and storage that persists across Pods rescheduling.

Since StatefulSet deal with Storage they depend of the presence of an implementation of the CSI, Container Storage Interface, installed in the cluster. The main resources to deal with storage are PersistentVolumeClaim and PersistentVolume.

Difference respect to normal (stateless) deployments

Is necessary to define a serviceName for creating a DNS entry for every Pod in the StatefulSet.
Is necessary to request a persistent volume for the containers.

Demo¶

Prerequisites¶

Required packages installation on nodes plus longhorn installation.

Patch servers:

sudo apt install -y nfs-common
sudo systemctl enable --now iscsid

Install long horn in the cluster

curl -LO https://raw.githubusercontent.com/longhorn/longhorn/v1.2.4/deploy/longhorn.yaml
kubectl apply -f longhorn.yaml

Code used: Book of Kubernetes

vagrant ssh kmaster
cd examples/chapter-07/files
cat sleep-set.yaml
kubectl apply -f sleep-set.yaml
kubectl get statefulset
kubectl get service

kubectl exec sleep-0 -- /bin/sh -c 'hostname > /storagedir/myhost'
kubectl exec sleep-0 -- /bin/cat /storagedir/myhost

kubectl exec sleep-1 -- /bin/sh -c 'hostname > /storagedir/myhost'
kubectl exec sleep-1 -- /bin/cat /storagedir/myhost

Now if I delete a Pod the replacement will take the same storage

kubectl delete pod sleep-0
kubectl get pods -w

kubectl exec sleep-0 -- /bin/cat /storagedir/myhost

Clean up

vagrant ssh kmaster
kubectl delete -f examples/chapter-07/files/sleep-set.yaml

Other workloads¶

There are other simpler workloads that Kubernetes supports like Job and CronJob. Some examples here

Services¶

Pods are ephemeral. The services that they provide may be critical, but because Kubernetes can terminate Pods at any time, they are unreliable endpoints for direct communication.
Kubernetes offers the Service resource, which provides a stable IP address and balances traffic across all of the Pods behind it. This abstraction brings stability and a reliable mechanism for communication between microservices.

Although Services solve this by providing a stable address in front of a group of Pods, consumers of the Service still want to avoid using an IP address.
Kubernetes solves this by using DNS for service discovery.
That's why there's the presence of CoreDNS pods inside the cluster.

The default internal domain name for a cluster is cluster.local. When you create a Service, it assembles a subdomain of namespace.svc.cluster.local (where namespace is the namespace in which the service is running) and sets its name as the hostname.
For example: nginx.default.svc.cluster.local. If the service’s IP changes, the hostname remains the same.

The kube-proxy The kube-proxy daemon that runs on all nodes of the cluster allows the Service to map traffic from one port to another. The kube-proxy daemon that runs on all nodes of the cluster allows the Service to map traffic from one port to another. This component configures the Netfilter rules on all of the nodes according to the Service’s definition in the API server.
Basically is the guy who maps ports with IPTables like we show earlier when we spoke about Network Namespaces.

Type of Services¶

ClusterIP¶

This type of Service is the default and exists on an IP that is only visible within the cluster. It enables cluster resources to reach one another via a known address while maintaining the security boundaries of the cluster itself.

NodePort¶

A Service of type NodePort exposes the same port on every node of the cluster.
This type of Service automatically creates a ClusterIP Service as its target, and the ClusterIP Service routes traffic to the Pods.
External load balancers frequently use NodePort services. They receive traffic for a specific site or address and forward it to the cluster on that specific port.

LoadBalancer¶

When working with a cloud provider for whom support exists within Kubernetes, a Service of type LoadBalancer creates a load balancer in that provider’s infrastructure. The exact details of how this happens differ between providers.
There is a limitation for Layer 4 Cloud load balancers: 1 load balancer per service, hence high costs.
It's quite painful create a load balancer outside a cloud provider, but is doable using an Ingress Controller. Alternatives: kube-vip, metal lb.

Demo of a NodePort¶

Pingolo!

A Show me the code moment, then...

cd ~/src/pingolo
kubectl apply -f kubernetes/deploy.yaml
kubectl get all -n pingolo
# try to connect to http://172.16.16.100
kubectl logs -n pingolo -l usecase=ping-from-pod
# try to ping something outside to cluster
kubectl apply -f shpod.yaml
kubectl get pods -n shpod
# try to ping the shpod

Ingress Resource and Ingress Controller¶

A way for exposing applications outside the Cluster.

An Ingress is a Kubernetes resource for configuring a HTTP Load Balancer, basically it defines HTTP routes
- Host-based routing. For example, routing requests with the host header foo.example.com to one group of services and the host header bar.example.com to another group.
- Path-based routing. For example, routing requests with the URI that starts with /serviceA to service A and requests with the URI that starts with /serviceB to service B.
It's used also for SSL termination

The Ingress Controller is an application that runs in a cluster and configures an HTTP load balancer according to Ingress resources.

Demo for Ingress Controller¶

From the kubernetes book. NOT WORKING on my Cluster at least for ports 80 and 443, cause I have to do tweaks on Calico I'm not able to, so that the load balancer can bind on port 80 and 443.

curl -Lo ingress-controller.yaml https://raw.githubusercontent.com/kubernetes/ingress-nginx/controller-v1.1.1/deploy/static/provider/cloud/deploy.yaml

cp examples/setup/roles/k8s/templates/ingress-patch.yaml.j2 ingress-patch.yaml

kubectl apply -f ingress-controller.yaml
kubectl patch -n ingress-nginx service/ingress-nginx-controller --patch-file ingress-patch.yaml

Final Thoughts¶

Kubernetes distributions¶

There are many Kubernetes distributions out there. All of them has "opinions". Their basic functions is to provide installer or applications to make Kubernetes installation less painful then the vanilla installation.

Openshift
- Has an all in approach, comes with a web console.
- User Management and RBAC is not an option, is mandatory.
- Has any easy way to expose web apps with Route.
- Autoscale always on.
- Monitoring always on (Prometheus).
- Support applications builds and deployments even from github.
- Can run in Cloud Providers.

Microk8s
- Starts minimal.
- but is possible to add services on demand.

Books and resources¶

We didn't talk about... a lot of things¶

Who am I?¶

Cesare Placanica¶

That was easy... :-)¶

Thanks Python Biella Group for having me!¶

Agenda¶

What we will do¶

Poll Time!¶

The Poll is here¶

Kubernetes¶

What I learned in the last year.¶

What Kubernetes is?¶

Linux Day 2022¶

Are you nuts?¶

Let me write it, in Python of course¶

Why Kubernetes happened?¶

The official history¶

The path to Kubernetes, what I think?¶

Software Engineering¶

Clean Architecture¶

Domain Driven Design¶

Microservices¶

API centricity¶

New ways of working¶

Agile¶

Devops Movement¶

Site Reliability Engineering¶

What's under the hood?¶

It started with...¶

Virtual machines¶

Docker¶

Containers¶

Are containers Linux?¶

Almost¶

Linux Kernel stuff that enables containerization.¶

Once upon a time under /proc/PID_N/ns in the Unshare Land...¶

Crossing fingers...¶

Demotime! Containers under the hood.¶

Closing toughts¶

How isolated Container can communicate?¶

Enter Linux Kernel Namespaces, and Linux Kernel Networking.¶

Crossing fingers!¶

Almost Demotime! Linux Networking¶

So, you said debunk Docker?¶

What if I tell you...?¶

Custom bridge¶

Container defined Network¶

No network¶

Docker with multiple hosts¶

So far so good, are going to tell us about Kubernetes?¶

Ovelay Network, a dumb definition. It's often used in network plugins.¶

Part 2¶

Agenda¶

Kubernetes architecture, an very light overview... hopefully¶

What the advantages over Docker are? My 0.01.¶

What are the drawbacks? My second 0.01.¶

Kubernetes purpose... again¶

Crossing fingers... try to build a Vanilla cluster...¶

Issues¶

Workarounds¶

Let's do a quick code walk-through...¶

Start the Cluster¶

Exploring the cluster¶

Hint: grab kubeconfig file¶

Demo¶

Adding a web console¶

Please please please do not this in production¶

Kubernetes Resources¶

Pods¶

Demo!¶

Workloads¶

ReplicaSet¶

Deployment¶

Demo¶

Demo of an update¶

DaemonSet¶

Demo¶

StatefulSet¶

Demo¶

Prerequisites¶

Other workloads¶

The Poll is here ¶