Cagatay Ornek:


🧘‍♂️ Mindset

Core principles we share as a team and won’t change overtime. These principles will be applied all the time.

  • Openness: honesty, transparency, trust
  • Blameless culture
  • Continuous improvement
  • Proactivity

💙 Purpose

Reason behind your goals.

  • Increasing the reliability
  • Fixing any incidents
  • Creating WoW effect to the customers

🏈 Goals

The goals for the whole team.

  • Ensure customer’s services are running as expected
  • Spending less time on chore operation and more on development
  • Get the observability maturity to a level which customer needs to be
  • Prevent incidents before they even happen

💪 Strength & Assets

Things that will move the team forward.

  • Troubleshooting capability
  • AWS knowledge
  • Kubernetes experience
  • Infrastructure as a code experience, especially Terragrunt, Terraform
  • Ability to collaborate kloia-wide

🐾 Weaknesses & Risks

Things that will drag you back from your goals and purpose.

  • Monitoring components like databases, queues and administrating their configuration
  • Internal plannings(?)
  • Develop customer journey from onboarding, assessment, incident management, until offboarding
  • Onprem services: VMWare, Data center networking
  • Storage engine: EBS, EFS, Longhorn :)
  • Cloud financial management
  • Security awareness: DDoS Attack, Blue team defense
  • Missing point of contact kloia-wide, map of expertise and whom

📐 Rules&Activities

How are you going to communicate, make decisions, execute and give feedback.

  • Share postmortem in 1 work day after the incident
  • Reply customer back in the email in the response time SLA
  • Utilize OpsGenie for incident escalation
  • Eliminate toils by resolving the root cause or automating the response actions
  • Act pro-actively and continuously improve customer system
  • Write updates asynchronously on Notion Journal

🧃 Needs & Expectations

The needs and expectations from the team.

  • Clear definition between SRE tasks and project tasks
  • Fair oncall rotation policy
  • Oncall engineers checklist
  • As an L1 engineer, we should know our backups and L2 engineers

👥 People & Roles

  • Tunahan Dursun: SRE
  • Ariq Fadlan: SRE
  • Caner Türkaslan: SRE
  • Çağatay Çiftçi: SRE