
Site Reliability Lead
2 weeks ago
Join to apply for the Site Reliability Lead role at White Cloak Technologies, Inc.
Job Description- Lead, mentor, and manage a team of Site Reliability Engineers, ensuring coverage across shifts and on-call rotations.
- Define team goals, KPIs, and performance metrics aligned with service reliability and business continuity.
- Conduct regular coaching, performance reviews, and skills development planning.
- Oversee workload distribution, escalation protocols, and incident ownership across the team.
- Champion a culture of documentation, knowledge sharing, and operational discipline.
- Own the architecture and lifecycle of monitoring, alerting, and logging systems.
- Ensure early detection, triage, and escalation of service degradation based on SLAs.
- Lead major incident response, root cause analysis (RCA), and postmortem documentation.
- Review and approve SOPs, runbooks, and playbooks created by the team.
- Analyze incident trends and drive systemic fixes to reduce recurrence and improve MTTR.
- Work closely with DevOps, Infrastructure, QA, and Development teams to improve deployment readiness and system resilience.
- Represent the SRE function in planning meetings, audits, and compliance reviews.
- Collaborate with ITSM teams to align incident, problem, and change management processes.
- Proven leadership experience in managing technical operations or SRE teams.
- Strong command of ITSM platforms (e.g., ServiceNow, Jira Service Management).
- Deep understanding of monitoring tools (e.g., Prometheus, Grafana, ELK, Datadog).
- Familiarity with ITIL principles and regulatory frameworks (e.g., BSP, PDIC, ISO27001).
- Expertise in incident response, escalation protocols, and RCA methodologies.
- Excellent communication and stakeholder management skills.
- Ability to synthesize operational data into actionable insights and team strategies.
- Bachelor's degree in Computer Science, Information Technology, Electronics Engineering, or equivalent.
- 5+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure roles.
- 2+ years in a leadership or team management capacity.
- Hands-on experience with cloud platforms (AWS, GCP, Azure).
- Knowledgeable in scripting (Python, Bash) and Linux systems.
- Experience in fintech, banking, or SaaS environments with high availability SLAs.
-
Site Reliability Lead
4 weeks ago
Pasig, Philippines White Cloak Technologies Full timeJob Description Lead, mentor, and manage a team of Site Reliability Engineers, ensuring coverage across shifts and on-call rotations. Define team goals, KPIs, and performance metrics aligned with service reliability and business continuity. Conduct regular coaching, performance reviews, and skills development planning. Oversee workload distribution,...
-
Site Reliability Lead
7 days ago
Pasig, National Capital Region, Philippines White Cloak Technologies, Inc. Full time ₱200,000 - ₱1,200,000 per yearJob DescriptionLead, mentor, and manage a team of Site Reliability Engineers, ensuring coverage across shifts and on-call rotations.Define team goals, KPIs, and performance metrics aligned with service reliability and business continuity.Conduct regular coaching, performance reviews, and skills development planning.Oversee workload distribution, escalation...
-
Lead Site Reliability Engineer
6 days ago
Pasig, Philippines TaoCrowd Full timeOverview TaoCrowd – Pasig, National Capital Region, Philippines Lead Site Reliability Engineer We are looking for a Lead Site Reliability Engineer to drive the stability, performance, and security of our infrastructure—spanning internal developer tools like GitLab to high-traffic, client-facing platforms. In this role, you’ll lead reliability efforts,...
-
Lead Site Reliability Engineer
1 week ago
Pasig, National Capital Region, Philippines TaoCrowd Full time ₱1,200,000 - ₱2,400,000 per yearWe are looking for a Lead Site Reliability Engineerto drive the stability, performance, and security of our infrastructure—spanning internal developer tools like GitLab to high-traffic, client-facing platforms such as Client websites. In this role, you'll lead reliability efforts, guide a growing SRE team, and help shape the systems that power our digital...
-
Site Reliability Engineer
2 weeks ago
Pasig, Philippines Jollibee Group Full timeOverview The Site Reliability Engineer will enable fast, safe change while keeping systems reliable and performant for customers. The role defines and manages service level indicators and objectives, treats reliability as a product feature, and uses software engineering to eliminate toil, improve incident and problem management, and strengthen change and...
-
Site Reliability Engineer Night Shift
1 week ago
Pasig, National Capital Region, Philippines August 99 Full time ₱1,200,000 - ₱2,400,000 per yearOverviewSite Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run production systems. SRE ensures that our services—both our internally critical and our externally-visible systems, e.g. GitLab/developer tooling and hosted client sites for the company—have reliability and uptime...
-
Site Reliability Engineer Night Shift
4 days ago
Pasig, Philippines August 99 Full timeOverview Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run production systems. SRE ensures that our services have reliability and uptime appropriate to users' needs and a fast rate of improvement while keeping an ever-watchful eye on capacity and performance. Key Responsibilities...
-
Site Reliability Engineer Night Shift
1 week ago
Pasig, National Capital Region, Philippines August 99, Inc. Full time ₱600,000 - ₱1,200,000 per yearOverview:Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run production systems. SRE ensures that our services—both our internally critical and our externally-visible systems, e.g. GitLab/developer tooling and hosted client sites for the company—have reliability and uptime...
-
Site Reliability Engineer
1 week ago
Pasig, National Capital Region, Philippines Seven Seven Global Services, Inc. Full time ₱1,200,000 - ₱2,400,000 per yearWork Location: Ortigas, Pasig CityShift Schedule: Day ShiftWork Setup: Hybrid (3-4x a week onsite)Job Description:Handle service monitoring, incident response, and drive technical support efficiencyResponsible for managing and maintaining network monitoring tools, systems, and processes that ensure the availability, scalability, and performance of our...
-
Reliability Engineer
3 weeks ago
Pasig, Philippines Buscojobs Full timeSr Site Reliability Engineer (Project based) Location: 1226 Makati City, National Capital Region | iScale Solutions Posted 16 days ago Job Description This is a remote position. Core Expertise SRE Foundations & Practices Deep understanding of SRE principles (SLIs, SLOs, error budgets, toil reduction, reliability vs. velocity trade-offs). Proven experience...