kubernetes

12 posts

Introduction to KEDA and Event-Driven Autoscaling

How KEDA extends Kubernetes HPA with 65+ scalers, scale-to-zero, and a two-phase architecture for event-driven pod autoscaling.

by KubeDojo

kubernetes kubedojo

Welcome to KubeDojo

What KubeDojo is and what you'll find here: deep dives into real code, honest explorations of the Kubernetes ecosystem, and structured learning paths to master every certification.

by KubeDojo

certification kubernetes ckad

The Kubernetes Certification Landscape

A practical map of the five CNCF Kubernetes certifications — what each one covers, how exams work, and which path fits your career.

by KubeDojo

agentic-ai kubernetes keda

Agentic AI Workloads on Kubernetes

How kagent, Agent Sandbox, KEDA, and OPA/Kyverno form the production stack for agentic AI on Kubernetes.

by KubeDojo

nvidia gpu kubernetes

NVIDIA AI Cluster Runtime: Validated GPU Kubernetes Recipes

NVIDIA released AI Cluster Runtime, an open-source project providing validated, version-locked Kubernetes configurations for GPU infrastructure.

by KubeDojo

kubernetes scheduling workload-api

Workload-Aware Scheduling in Kubernetes 1.36: The Decoupled PodGroup Model

Kubernetes 1.36 decouples scheduling policy from runtime instances with Workload API v1alpha2, standalone PodGroups, and a dedicated group scheduling cycle.

by KubeDojo

cncf kubernetes ai-conformance

CNCF Certified Kubernetes AI Conformance Program

CNCF launched v1.0 of the Kubernetes AI Conformance Program defining baseline capabilities for running AI workloads across conformant clusters.

by KubeDojo

llm-d cncf sandbox

llm-d Joins CNCF Sandbox: Kubernetes-Native Distributed LLM Inference

llm-d was accepted as a CNCF Sandbox project, providing Kubernetes-native distributed inference with KV-cache-aware routing, prefill/decode disaggregation, and accelerator-agnostic serving.

by KubeDojo

nvidia dynamo inference

NVIDIA Dynamo 1.0: The Inference Operating System for AI Factories

Production deployment patterns for NVIDIA Dynamo 1.0 on EKS and GKE — disaggregated serving, KV-aware routing, and gotchas from real deployments.

by KubeDojo

dra kubernetes gpu

Dynamic Resource Allocation (DRA) GA: The New GPU Interface for Kubernetes

DRA went GA in Kubernetes v1.34 and continues evolving — replacing Device Plugins with richer semantics including DeviceClass, ResourceClaim, CEL-based filtering, and topology awareness.

by KubeDojo

karpenter kubernetes autoscaling

What Is Karpenter and Why It Replaced Cluster Autoscaler

How Karpenter's groupless, pod-driven provisioning model solves the scaling limitations that plagued Kubernetes Cluster Autoscaler for years.

by KubeDojo

karpenter kubernetes disruption

Disruption, Drift, and Consolidation — Karpenter's Node Lifecycle

How Karpenter automatically replaces drifted nodes, consolidates underutilized capacity, and respects disruption budgets to keep clusters lean and current.

by KubeDojo