Site Reliability Lead

2 weeks ago


Pasig, Philippines White Cloak Technologies, Inc. Full time

Join to apply for the Site Reliability Lead role at White Cloak Technologies, Inc.

Job Description
  • Lead, mentor, and manage a team of Site Reliability Engineers, ensuring coverage across shifts and on-call rotations.
  • Define team goals, KPIs, and performance metrics aligned with service reliability and business continuity.
  • Conduct regular coaching, performance reviews, and skills development planning.
  • Oversee workload distribution, escalation protocols, and incident ownership across the team.
  • Champion a culture of documentation, knowledge sharing, and operational discipline.
Key Responsibilities
  • Own the architecture and lifecycle of monitoring, alerting, and logging systems.
  • Ensure early detection, triage, and escalation of service degradation based on SLAs.
  • Lead major incident response, root cause analysis (RCA), and postmortem documentation.
  • Review and approve SOPs, runbooks, and playbooks created by the team.
  • Analyze incident trends and drive systemic fixes to reduce recurrence and improve MTTR.
  • Work closely with DevOps, Infrastructure, QA, and Development teams to improve deployment readiness and system resilience.
  • Represent the SRE function in planning meetings, audits, and compliance reviews.
  • Collaborate with ITSM teams to align incident, problem, and change management processes.
Skills And Competencies
  • Proven leadership experience in managing technical operations or SRE teams.
  • Strong command of ITSM platforms (e.g., ServiceNow, Jira Service Management).
  • Deep understanding of monitoring tools (e.g., Prometheus, Grafana, ELK, Datadog).
  • Familiarity with ITIL principles and regulatory frameworks (e.g., BSP, PDIC, ISO27001).
  • Expertise in incident response, escalation protocols, and RCA methodologies.
  • Excellent communication and stakeholder management skills.
  • Ability to synthesize operational data into actionable insights and team strategies.
Qualifications And Experience
  • Bachelor's degree in Computer Science, Information Technology, Electronics Engineering, or equivalent.
  • 5+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure roles.
  • 2+ years in a leadership or team management capacity.
  • Hands-on experience with cloud platforms (AWS, GCP, Azure).
  • Knowledgeable in scripting (Python, Bash) and Linux systems.
  • Experience in fintech, banking, or SaaS environments with high availability SLAs.
#J-18808-Ljbffr
  • Site Reliability Lead

    4 weeks ago


    Pasig, Philippines White Cloak Technologies Full time

    Job Description Lead, mentor, and manage a team of Site Reliability Engineers, ensuring coverage across shifts and on-call rotations. Define team goals, KPIs, and performance metrics aligned with service reliability and business continuity. Conduct regular coaching, performance reviews, and skills development planning. Oversee workload distribution,...


  • Pasig, National Capital Region, Philippines White Cloak Technologies, Inc. Full time ₱200,000 - ₱1,200,000 per year

    Job DescriptionLead, mentor, and manage a team of Site Reliability Engineers, ensuring coverage across shifts and on-call rotations.Define team goals, KPIs, and performance metrics aligned with service reliability and business continuity.Conduct regular coaching, performance reviews, and skills development planning.Oversee workload distribution, escalation...


  • Pasig, Philippines TaoCrowd Full time

    Overview TaoCrowd – Pasig, National Capital Region, Philippines Lead Site Reliability Engineer We are looking for a Lead Site Reliability Engineer to drive the stability, performance, and security of our infrastructure—spanning internal developer tools like GitLab to high-traffic, client-facing platforms. In this role, you’ll lead reliability efforts,...


  • Pasig, National Capital Region, Philippines TaoCrowd Full time ₱1,200,000 - ₱2,400,000 per year

    We are looking for a Lead Site Reliability Engineerto drive the stability, performance, and security of our infrastructure—spanning internal developer tools like GitLab to high-traffic, client-facing platforms such as Client websites. In this role, you'll lead reliability efforts, guide a growing SRE team, and help shape the systems that power our digital...


  • Pasig, Philippines Jollibee Group Full time

    Overview The Site Reliability Engineer will enable fast, safe change while keeping systems reliable and performant for customers. The role defines and manages service level indicators and objectives, treats reliability as a product feature, and uses software engineering to eliminate toil, improve incident and problem management, and strengthen change and...


  • Pasig, National Capital Region, Philippines August 99 Full time ₱1,200,000 - ₱2,400,000 per year

    OverviewSite Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run production systems. SRE ensures that our services—both our internally critical and our externally-visible systems, e.g. GitLab/developer tooling and hosted client sites for the company—have reliability and uptime...


  • Pasig, Philippines August 99 Full time

    Overview Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run production systems. SRE ensures that our services have reliability and uptime appropriate to users' needs and a fast rate of improvement while keeping an ever-watchful eye on capacity and performance. Key Responsibilities...


  • Pasig, National Capital Region, Philippines August 99, Inc. Full time ₱600,000 - ₱1,200,000 per year

    Overview:Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run production systems. SRE ensures that our services—both our internally critical and our externally-visible systems, e.g. GitLab/developer tooling and hosted client sites for the company—have reliability and uptime...


  • Pasig, National Capital Region, Philippines Seven Seven Global Services, Inc. Full time ₱1,200,000 - ₱2,400,000 per year

    Work Location: Ortigas, Pasig CityShift Schedule: Day ShiftWork Setup: Hybrid (3-4x a week onsite)Job Description:Handle service monitoring, incident response, and drive technical support efficiencyResponsible for managing and maintaining network monitoring tools, systems, and processes that ensure the availability, scalability, and performance of our...

  • Reliability Engineer

    3 weeks ago


    Pasig, Philippines Buscojobs Full time

    Sr Site Reliability Engineer (Project based) Location: 1226 Makati City, National Capital Region | iScale Solutions Posted 16 days ago Job Description This is a remote position. Core Expertise SRE Foundations & Practices Deep understanding of SRE principles (SLIs, SLOs, error budgets, toil reduction, reliability vs. velocity trade-offs). Proven experience...