DevOps/SRE Interview Landscape

Your 90-Day DevOps/SRE Study Plan

4 min read

A structured 90-day plan to transform you from "preparing" to "interview-ready." Adjust based on your current level and target companies.

Assessment: Where Are You Starting?

Before diving in, honestly assess your current level:

Area Beginner Intermediate Advanced
Linux Basic commands Process/memory mgmt Kernel tuning
Networking TCP vs UDP DNS, load balancing BGP, network design
CI/CD Used Jenkins/GHA Built pipelines Designed for scale
Containers Docker basics K8s deployments Operators, custom controllers
Monitoring Used Grafana Built dashboards Designed observability
Coding Bash scripts Python/Go fluent Distributed systems

Phase 1: Foundations (Days 1-30)

Goal: Solidify fundamentals that appear in EVERY interview

Week 1-2: Linux Deep Dive

  • Process lifecycle (fork, exec, zombie processes)
  • Memory management (virtual memory, OOM killer)
  • File systems (inodes, permissions, types)
  • System calls (strace, ltrace)
  • Practice: Set up a Linux VM, break it, fix it

Week 3-4: Networking Essentials

  • OSI model (focus on layers 3, 4, 7)
  • TCP handshake, congestion control
  • DNS resolution, caching, TTL
  • Load balancing algorithms
  • Practice: tcpdump, Wireshark packet analysis

Daily routine (2-3 hours):

30 min: Read documentation/books
60 min: Hands-on lab practice
30 min: Review and note-taking
30 min: Mock troubleshooting scenarios

Phase 2: Tools and Systems (Days 31-60)

Goal: Master the tools you'll discuss in interviews

Week 5-6: CI/CD and IaC

  • Git advanced (rebase, cherry-pick, bisect)
  • GitHub Actions / GitLab CI deep dive
  • Terraform state, modules, workspaces
  • Ansible playbooks and roles
  • Practice: Build a complete CI/CD pipeline

Week 7-8: Containers and Orchestration

  • Docker internals (namespaces, cgroups)
  • Kubernetes architecture deep dive
  • Pod lifecycle, scheduling, affinity
  • Services, Ingress, NetworkPolicies
  • Practice: Deploy a multi-tier app on K8s

Daily routine (3 hours):

45 min: Tool documentation and tutorials
90 min: Build real projects
45 min: Coding practice (LeetCode Easy-Medium)

Phase 3: Production Skills (Days 61-80)

Goal: Develop SRE-specific expertise

Week 9-10: Monitoring and Observability

  • Metrics (Prometheus, Grafana)
  • Logging (ELK, Loki)
  • Tracing (Jaeger, OpenTelemetry)
  • SLOs, SLIs, Error Budgets
  • Practice: Instrument an application end-to-end

Week 11-12: Incident Response

  • On-call best practices
  • Incident management frameworks
  • Postmortem writing
  • Chaos engineering basics
  • Practice: Simulate and respond to incidents

Phase 4: Interview Sprint (Days 81-90)

Goal: Peak performance for interviews

Week 13: Mock Interviews

  • 2 coding mock interviews
  • 2 system design mock interviews
  • 2 behavioral mock interviews
  • Review and iterate on feedback

Week 14: Final Preparation

  • Company-specific research
  • Review all notes and weak areas
  • Rest and mental preparation
  • Logistics (test setup, quiet space)

Study Resources

Books (pick 1-2):

  • "Site Reliability Engineering" (Google SRE Book) - Free online
  • "The DevOps Handbook"
  • "Designing Data-Intensive Applications"

Courses:

  • Linux Foundation: CKA/CKAD prep
  • Cloud provider certifications (AWS SA, GCP)

Practice platforms:

  • Kubernetes: killer.sh, KodeKloud
  • Linux: OverTheWire, LinuxJourney
  • Coding: LeetCode (Easy-Medium), HackerRank

Weekly Progress Tracker

Week Focus Hours Checkpoint
1-2 Linux 20-25 Can explain process lifecycle
3-4 Networking 20-25 Can troubleshoot DNS issues
5-6 CI/CD, IaC 25-30 Built working pipeline
7-8 Kubernetes 25-30 Deployed multi-tier app
9-10 Monitoring 25-30 Full observability stack
11-12 Incident Mgmt 20-25 Wrote practice postmortems
13-14 Mock Interviews 15-20 Completed 6 mocks

Total time investment: ~200 hours over 90 days

Adjustments by Current Level

If you're a beginner (0-2 YoE):

  • Extend Phase 1 to 6 weeks
  • Focus more on fundamentals
  • Target Tier 3-4 companies first

If you're intermediate (3-5 YoE):

  • Follow the plan as-is
  • Add more system design practice
  • Target Tier 2-3 companies

If you're senior (5+ YoE):

  • Compress Phase 1 to 2 weeks
  • Heavy focus on design and behavioral
  • Target Tier 1-2 companies

Ready to begin? The next module covers Linux and Networking fundamentals—the foundation of every DevOps/SRE interview. :::

Quiz

Module 1: DevOps/SRE Interview Landscape

Take Quiz