Michael Page
Designing, implementing, and maintaining infrastructure automation using Python/Bash scripting and infrastructure-as-code tools Managing and optimizing Kubernetes clusters and containerized applications in Linux environments Creating and enhancing monitoring systems to ensure high availability and performance of critical services Developing automated solutions for incident response, capacity planning, and system recovery Collaborating with development teams to improve application reliability and scalability Participating in on-call rotations to provide L4 support, including potential weekend coverage when required Count with minimum 3 years of experience in the Role. Good to have OpenStack Knowledge Strong experience with Linux systems administration and troubleshooting Pr...
Apply Now
Site Reliability Engineer (SRE)
Description
Descripción
In this role you will play a key role in:
Perfil buscado (h/m)