Principal Site Reliability Engineer

Principal Site Reliability Engineer

Fourth

Sofia, Bulgaria

About you

You will exhibit both strategic and tactical sensibilities. You are adept at mapping the bigger picture into smaller, yet valuable and achievable chunks. You have excellent written and verbal communication skills, allowing you to work effectively with our worldwide development teams and the SRE community to select the right patterns and practices. You understand the importance of standardisation of technology and practices and have experience of implementing these in a consistent and low-maintenance fashion. You believe useful and relevant documentation is an essential output of your work.

The role

  • Design and implement scalable, highly available, and resilient infrastructure solutions;
  • Working with Product Owners and development teams to elicit and define the needs and requirements for products being migrated to cloud;
  • Innovate and enhance tooling for monitoring, alerting, incident management, and automation;
  • Develop and maintain comprehensive documentation for infrastructure and processes, ensuring compliance with industry standards;
  • Actively participate in incident management, driving rapid recovery, root cause analysis, and continuous improvement;
  • Identify and implement strategies to future-proof infrastructure for growth and increased demand;
  • Assist in the career development of colleagues, acting as a role model and encouraging best practice.

The ideal candidate

  • Bachelor’s degree in computer science, Engineering, or a related field;
  • 7+ years of experience in Site Reliability Engineering, Platform Engineering, DevOps, or a similar role, with experience mentoring others;
  • Proven track record of designing, implementing, and managing large-scale, highly available, and scalable infrastructure;
  • Relevant and recent experience with our main tech stack: Terraform, Configuration Management (Chef ideally, but will consider Ansible or Puppet), Kubernetes (cloud based), Docker (Kubernetes or AWS ECS Fargate);
  • Extensive cloud experience (ideally Azure and AWS);
  • Programming, scripting skills and OOP principles in general;
  • Strong analytical and problem-solving skills;
  • Outstanding communication skills, with the ability to convey complex technical concepts to non-technical stakeholders;
  • Ability to work in a fast-paced, dynamic environment, managing multiple priorities;
  • Experience of working in Agile or Kanban;
  • A natural and positive team player.

Apply Now

Don't forget to mention EuroTechJobs when applying.

Share this Job

More Job Searches

Bulgaria      Hardware and Telecoms      System Administrator and DevOps      Fourth     

© EuroJobsites 2024