TD SYNNEX
Ensure reliability, operability, and continuous improvement of TD SYNNEX enterprise platforms across hybrid cloud and on‑prem environments. Engineering‑driven operations focused on automation, Infrastructure‑as‑Code (IaC), observability, and toil reduction. Serve as the L3 escalation for complex incidents; continuously improve platform run posture and readiness for L1/L2 execution. Platform reliability (hybrid cloud + on‑prem): Own L3 reliability posture; define SLOs/KPIs; lead operability gates and production readiness; maintain runbooks/SOPs. Automation & IaC: Design/build operational automation (health checks, remediation workflows); develop Terraform/Ansible configurations; script with Python (preferred), PowerShell, and/or Bash; integrate with ITSM for auditable self‑service and controlled remediation. Incident/problem/RCA (L3): Lead diagnosis, stabilization, an...
Apply Now
Platform SRE
Description
Role purpose:
Core responsibilities: