Cagatay Ornek:
🧘♂️ Mindset
Core principles we share as a team and won’t change overtime. These principles will be applied all the time.
- Openness: honesty, transparency, trust
- Blameless culture
- Continuous improvement
- Proactivity
💙 Purpose
Reason behind your goals.
- Increasing the reliability
- Fixing any incidents
- Creating WoW effect to the customers
🏈 Goals
The goals for the whole team.
- Ensure customer’s services are running as expected
- Spending less time on chore operation and more on development
- Get the observability maturity to a level which customer needs to be
- Prevent incidents before they even happen
💪 Strength & Assets
Things that will move the team forward.
- Troubleshooting capability
- AWS knowledge
- Kubernetes experience
- Infrastructure as a code experience, especially Terragrunt, Terraform
- Ability to collaborate kloia-wide
🐾 Weaknesses & Risks
Things that will drag you back from your goals and purpose.
- Monitoring components like databases, queues and administrating their configuration
- Internal plannings(?)
- Develop customer journey from onboarding, assessment, incident management, until offboarding
- Onprem services: VMWare, Data center networking
- Storage engine: EBS, EFS, Longhorn :)
- Cloud financial management
- Security awareness: DDoS Attack, Blue team defense
- Missing point of contact kloia-wide, map of expertise and whom
📐 Rules&Activities
How are you going to communicate, make decisions, execute and give feedback.
- Share postmortem in 1 work day after the incident
- Reply customer back in the email in the response time SLA
- Utilize OpsGenie for incident escalation
- Eliminate toils by resolving the root cause or automating the response actions
- Act pro-actively and continuously improve customer system
- Write updates asynchronously on Notion Journal
🧃 Needs & Expectations
The needs and expectations from the team.
- Clear definition between SRE tasks and project tasks
- Fair oncall rotation policy
- Oncall engineers checklist
- As an L1 engineer, we should know our backups and L2 engineers
👥 People & Roles
- Tunahan Dursun: SRE
- Ariq Fadlan: SRE
- Caner Türkaslan: SRE
- Çağatay Çiftçi: SRE