Certified Kubernetes Administrator - Domain 2: Troubleshooting (Cluster)

0h 16m video Published Sep 13, 2025 Transcribed Jun 9, 2026 Cyber Secure

Cyber Secure

Intermediate 8 min read For: CKA candidates and Kubernetes administrators looking to troubleshoot cluster components.

AI Trust Score 90/100

✅ Highly Legit

"Title accurately describes the content: a focused troubleshooting guide for CKA Domain 2."

AI Summary

This video provides a focused troubleshooting checklist for CKA Domain 2, covering cluster architecture, core components, and practical commands to diagnose issues. It walks through the control plane and worker node components, explaining their roles and how to check them when things go wrong.

Chapters

1 Cluster Architecture Overview 00:00 2 Kubelet and CRI 01:43 3 kube-proxy 05:56 4 API Server 06:56 5 Controller Manager 11:38 6 etcd and Scheduler 14:01 7 Wrap-Up and Summary 15:29

[01:02]

Cluster Architecture Overview

A Kubernetes cluster consists of a control plane (decision-making) and worker nodes (execution). Troubleshooting starts by identifying whether the problem is with the control plane or worker nodes.

[01:43]

Kubelet Role

Kubelet runs on every node, reports node status, manages containers via CRI, enforces pod specs, collects resource usage, and invokes network/storage plugins. Check its pods and endpoints when pods fail or nodes are unhealthy.

[02:46]

Kubelet Security Demo

Demonstrates disabling anonymous authentication and read-only port in kubelet config. After changes, direct pod access and metrics are blocked, enhancing security.

[04:58]

Container Runtime Interface (CRI)

CRI makes kubelet runtime-agnostic, supporting containerd, CRI-O, etc. Use crictl to list/inspect containers, check logs, and exec into containers for troubleshooting.

[06:10]

kube-proxy

Runs on each node as a DaemonSet, implements networking rules for services. Check kube-proxy logs and local forwarding rules if services are unreachable.

[06:56]

API Server

Front door of the cluster; authenticates, authorizes, validates requests, runs admission controllers, and persists state to etcd. Check logs and static manifests if kubectl fails.

[07:57]

API Server Troubleshooting Demo

Three misconfigurations found: semicolon in manifest, unknown flag, wrong etcd port (23000 vs 2379). Fixed by editing manifest and correcting flags/ports.

[11:38]

Controller Manager

Runs controllers to ensure actual state matches desired state. If resources aren't reconciling, check controller manager logs and watch cycles.

[12:32]

Controller Manager Troubleshooting Demo

Found unknown flag 'sidecar-insertion' in kube-controller-manager config. Removed the line to fix the issue.

[14:01]

etcd

Persistent store for all cluster objects. If unhealthy, control plane loses memory. Check cluster membership, health, and write success rates.

[14:44]

Scheduler

Selects best node for each pod. If pods are pending, check scheduler logs and conditions to see why filters/ranking failed.

Effective troubleshooting requires determining whether the issue is at the control plane or node level, then examining component-specific configuration files, logs, and ports. Understanding each component's role and interaction is key to resolving cluster problems.

Mentioned in this Video

crictl

tool

kubectl

tool

Tutorial Checklist

1 02:46 Check kubelet service and configuration file location.

2 03:16 Access kubelet config file and verify anonymous authentication and read-only port settings.

3 03:50 Test direct pod access via curl to confirm unauthenticated access.

4 04:12 Set anonymous authentication to false, authorization to webhook, and read-only port to 0.

5 04:31 Restart kubelet and verify that direct pod access and metrics are blocked.

6 07:57 When kubectl is unresponsive, check if API server container is running using crictl.

7 08:28 Check API server logs and manifest file for errors (e.g., semicolon, unknown flags, wrong port).

8 09:02 Fix manifest errors and restart API server container.

9 12:32 Check controller manager pod logs for unknown flags (e.g., sidecar-insertion).

10 13:32 Remove erroneous flag from controller manager config file and verify pod restarts.

Study Flashcards (11)

What are the two main parts of a Kubernetes cluster?

easy Click to reveal answer

Control plane (decision-making) and worker nodes (execution).