Loading...

Senior Infrastructure & Monitoring Engineer

Enterprise Monitoring Expert
with 14+ Years of Experience

Enterprise Monitoring Experte
mit 14+ Jahren Erfahrung

Deep expertise across the full enterprise monitoring spectrum — Nagios, Icinga, SolarWinds, Check MK, and now Zabbix Certified Professional & Specialist. 30,000+ hosts, 200+ enterprise clients, large-scale migrations. Currently exploring Kubernetes, Rancher, and CI/CD through my homelab.

Monitoring Pro Homelab Explorer
DS
Dibesh Shrestha
Infrastructure & Monitoring Engineer
Wetzlar, Germany · Remote
200
+
Enterprise Clients
30,000
+
Monitored Hosts
46
+
Server Clusters
14
+
Years Experience
Scroll
dibesh@homelab:~

Running Systems

Real homelab infrastructure — click to explore

Homelab Architecture

Single Proxmox node powering two VMs — Docker services and Kubernetes cluster

Proxmox PVE 9.1 · Single Node 8 vCPUs · 24 GB RAM · ~960 GB Storage

homelab-docker

VM 400 · Ubuntu 24.04 · 10 GB RAM
30+ containers

Access & Security

Traefik · Cloudflare · Authentik · CrowdSec · Pihole

Observability

Graylog · Netdata · Uptime Kuma · PostgreSQL

Self-Hosted Apps

Immich · Jellyfin · Actual Budget · FinanzBlick

AI / Automation

Ollama · n8n · Telegram Bot

GitLab Runner

Executes CI/CD pipeline jobs

Management

Portainer · Homepage · Watchtower

cicd-automation

VM 500 · Debian 12 · k3s v1.34
Rancher 2.14

K8s Cluster (k3s)

Traefik ingress · cert-manager · Dashboard · metrics-server

zabbix-dev

Helm 7.0.12 · Terraform · GitLab CI

zabbix-staging

Helm 7.0.12 · Terraform · GitLab CI

zabbix-prod

Helm 7.0.12 · Terraform · GitLab CI

IaC Layer

Terraform v1.15 · Helm v3.20 · GitLab State

GitLab CI Pipelines

Plan (auto) → Apply (manual) · cicd-zabbix repo

Rancher System

Fleet · CAPI (rancher-turtles) · Webhook
Docker VM
Kubernetes VM
Data Flow

Technical Expertise

14+ years of deep enterprise monitoring expertise. Everything below the line is homelab exploration.

Zabbix / Enterprise Monitoring
14 yrs · 4 certs · 30K+ hosts
Docker & Containerization
~1 yr · 25+ services · Homelab
Kubernetes / Rancher
~1 yr · k3s v1.34 · Rancher 2.14 · CKA prep
Terraform / IaC
~6 mo · 33 pipelines · Homelab
CI/CD (GitLab CI)
~6 mo · 33 runs · Helm deploys
Python / FastAPI (AI-Assisted)
~6 mo · APIs & automation · AI
Detailed View

Migration & Implementation

Led enterprise-wide platform migrations with zero downtime. Cross-platform transitions serving 200+ enterprise clients. Nagios to Zabbix, legacy tool consolidation, and monitoring standardization.

Zero Downtime Cross-Platform 200+ Clients

Escalation & Incident Management

Trusted technical voice in high-pressure escalations. Coached Service Desk teams improving resolution times by 40%, handling 30+ daily critical incidents across enterprise environments.

Incident Response Team Coaching ITIL
Homelab Exploration
Homelab Exploration
~6–12 months · Homelab only · Learning by doing

Kubernetes & Rancher (k3s v1.34), CI/CD & GitLab, Terraform & IaC, Docker & containerisation — all built hands-on in the homelab. Not production-professional yet, actively building.

k3s v1.34 Rancher 2.14 GitLab CI Terraform Helm Docker Compose
View details
K8s · CI/CD · IaC
Docker · Roadmap
Homelab Projects 5 live projects

Learning by Building

Real infrastructure experiments — cicd-zabbix, monitoring-hub, Homelab Platform, AI Infrastructure Bot, FinanzBlick. Full docs with architecture, network, CI/CD pipeline and roadmap.

cicd-zabbix monitoring-hub Homelab Platform AI Infrastructure Bot FinanzBlick 30+ containers 360+ pipeline runs 19 subdomains
Explore
Architecture · Network
CI/CD · Roadmap

Professional Experience

14+ years of progressive experience in enterprise IT and monitoring infrastructure

CANCOM

Senior Monitoring Administrator · Full-time · Remote
Jan 2020 – Present
  • Key technical contributor in the migration of 16+ client environments to Zabbix in a managed service provider context
  • Managed monitoring infrastructure supporting 30,000+ hosts; technical owner of Nagios, Centreon, and Zabbix platforms
  • Direct escalation management for 200+ enterprise clients; coached Service Desk teams improving resolution times by 40%
  • Maintained platform stability during significant team reduction; took over and stabilized Centreon after team departure

Tröger IT Business Consulting GmbH

IT Consultant / Center of Excellence Monitoring · Full-time
Jul 2018 – Jan 2020
  • Consulting role focused on enterprise monitoring strategy and implementation across client environments
  • Center of Excellence: defined monitoring standards and best practices across the organization

VRG-Gruppe

IT System Engineer · Full-time · Oldenburg, Germany
Sep 2015 – Jun 2018
  • System monitoring and management of complex server farm infrastructure including maintenance
  • Networking and monitoring across heterogeneous infrastructure environments

Sector Nord AG

IT System Administrator · Greater Oldenburg Area
Sep 2012 – Aug 2015
  • Open source solutions for system and server monitoring with ticketing system integration
  • Foundation role where professional monitoring career began — Nagios, Linux administration, SNMP

Licenses & Certifications

Industry-recognized certifications validating expertise. Click to verify.

Zabbix Certified Specialist 7.0

Zabbix
Issued Dec 2024 CS-2412-151

Advanced Zabbix Security Administration

Zabbix
Issued Mar 2025 ZEX03-2503-004

Advanced Zabbix SNMP Monitoring 7.0

Zabbix
Issued Mar 2025 ZEX05-2503-018

ITIL Foundations

PeopleCert
Issued Oct 2018 GR750484175DS

AppDynamics Certified Implementer

AppDynamics
Issued Jun 2018 Foundation Workshop 4.4 & Kickstarter

Internet Cyber Security & Privacy Specialist

EC-Council
Issued 2019

Professional Reference

"

I had the pleasure of supervising Dibesh Shrestha for three years. He was responsible for our Nagios monitoring infrastructure, overseeing 32 customers and 9,000 hosts. His standout achievement was almost single-handedly leading the migration of this massive scope to Zabbix. He managed the entire process, taking full ownership of both the technical implementation and the communication with both internal stakeholders and directly with the customers, all while mastering Zabbix on the fly under significant time pressure.

When a colleague left, Dibesh stepped up to finalize the Centreon migration, successfully delivering it within a strict deadline despite having no prior background in that environment. This demonstrates his natural ability to adapt quickly and effectively to new challenges.

Beyond his technical skillset, Dibesh is a fantastic team player who builds a positive, friendly atmosphere. He is highly client-oriented and masters the balance of being assertive with stakeholders to get things done. He is a versatile and impact-driven employee, and any team would be lucky to have him.

Richard Germanus Former Direct Manager, CANCOM View full recommendation on LinkedIn

Get In Touch

Copied to clipboard!