
Site Reliability Engineer
2 weeks ago
Overview
The Site Reliability Engineer will enable fast, safe change while keeping systems reliable and performant for customers. The role defines and manages service level indicators and objectives, treats reliability as a product feature, and uses software engineering to eliminate toil, improve incident and problem management, and strengthen change and release practices.
ResponsibilitiesReliability and Service Level Objective (SLO) Management
- Define, track, and report Service Level Indicators (SLI), SLOs, and error tracking for critical services. Partner with product and platform owners to align reliability goals with customer impact.
- SLO attainment meets or exceeds targets. Errors are tracked and used to guide release pace. Weekly reliability status and trends are shared with stakeholders.
Incident Response and Problem Management
- Participate in on-call. Lead or support incident triage, mitigation, and blameless post-incident reviews. Drive corrective actions to prevent recurrence.
- Mean time to recovery (MTTR) and time to detect improve quarter over quarter. Postmortems are completed for priority incidents with corrective actions closed within agreed SLAs.
Observability, Performance, and Capacity
- Build actionable monitoring, alerting, and dashboards. Establish performance baselines, run load and chaos tests in preproduction, and plan capacity and disaster recovery (DR).
- High signal-to-noise alerting with low false positives. Performance and DR tests pass before releases. Capacity stays within targets with no resource-related outages.
Agile Collaboration and Communication
- Work in agile ceremonies. Share reliability insights, codify runbooks, and mentor peers to spread SRE practices across teams.
- Consistent participation in stand-ups and planning. Runbooks updated each sprint. Measurable uplift in team adoption of SRE practices and tooling.
- Bachelor’s degree in Computer Science, Information Technology, or a related field.
- Relevant cloud and SRE certifications are a plus especially with AWS.
- Minimum of 5 years in SRE, DevOps, platform engineering, or production operations. Proven work with SLOs and SLIs, incident response, observability, and automation. Experience operating services in cloud and containerized environments, specifically Amazon Web Services (AWS).
- Must be willing to work in Ortigas, Pasig (Hybrid Work Setup).
- Mid-Senior level
- Full-time
- Information Technology
- Industries
- Food and Beverage Retail
-
Lead Site Reliability Engineer
6 days ago
Pasig, Philippines TaoCrowd Full timeOverview TaoCrowd – Pasig, National Capital Region, Philippines Lead Site Reliability Engineer We are looking for a Lead Site Reliability Engineer to drive the stability, performance, and security of our infrastructure—spanning internal developer tools like GitLab to high-traffic, client-facing platforms. In this role, you’ll lead reliability efforts,...
-
Site Reliability Lead
2 weeks ago
Pasig, Philippines White Cloak Technologies, Inc. Full timeJoin to apply for the Site Reliability Lead role at White Cloak Technologies, Inc. Job Description Lead, mentor, and manage a team of Site Reliability Engineers, ensuring coverage across shifts and on-call rotations. Define team goals, KPIs, and performance metrics aligned with service reliability and business continuity. Conduct regular coaching,...
-
Lead Site Reliability Engineer
1 week ago
Pasig, National Capital Region, Philippines TaoCrowd Full time ₱1,200,000 - ₱2,400,000 per yearWe are looking for a Lead Site Reliability Engineerto drive the stability, performance, and security of our infrastructure—spanning internal developer tools like GitLab to high-traffic, client-facing platforms such as Client websites. In this role, you'll lead reliability efforts, guide a growing SRE team, and help shape the systems that power our digital...
-
Site Reliability Engineer
1 week ago
Pasig, National Capital Region, Philippines Seven Seven Global Services, Inc. Full time ₱1,200,000 - ₱2,400,000 per yearWork Location: Ortigas, Pasig CityShift Schedule: Day ShiftWork Setup: Hybrid (3-4x a week onsite)Job Description:Handle service monitoring, incident response, and drive technical support efficiencyResponsible for managing and maintaining network monitoring tools, systems, and processes that ensure the availability, scalability, and performance of our...
-
Site Reliability Lead
4 weeks ago
Pasig, Philippines White Cloak Technologies Full timeJob Description Lead, mentor, and manage a team of Site Reliability Engineers, ensuring coverage across shifts and on-call rotations. Define team goals, KPIs, and performance metrics aligned with service reliability and business continuity. Conduct regular coaching, performance reviews, and skills development planning. Oversee workload distribution,...
-
Site Reliability Lead
7 days ago
Pasig, National Capital Region, Philippines White Cloak Technologies, Inc. Full time ₱200,000 - ₱1,200,000 per yearJob DescriptionLead, mentor, and manage a team of Site Reliability Engineers, ensuring coverage across shifts and on-call rotations.Define team goals, KPIs, and performance metrics aligned with service reliability and business continuity.Conduct regular coaching, performance reviews, and skills development planning.Oversee workload distribution, escalation...
-
Site Reliability Engineer Night Shift
1 week ago
Pasig, National Capital Region, Philippines August 99 Full time ₱1,200,000 - ₱2,400,000 per yearOverviewSite Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run production systems. SRE ensures that our services—both our internally critical and our externally-visible systems, e.g. GitLab/developer tooling and hosted client sites for the company—have reliability and uptime...
-
Site Reliability Engineer Night Shift
4 days ago
Pasig, Philippines August 99 Full timeOverview Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run production systems. SRE ensures that our services have reliability and uptime appropriate to users' needs and a fast rate of improvement while keeping an ever-watchful eye on capacity and performance. Key Responsibilities...
-
Reliability Engineer
3 weeks ago
Pasig, Philippines Buscojobs Full timeSr Site Reliability Engineer (Project based) Location: 1226 Makati City, National Capital Region | iScale Solutions Posted 16 days ago Job Description This is a remote position. Core Expertise SRE Foundations & Practices Deep understanding of SRE principles (SLIs, SLOs, error budgets, toil reduction, reliability vs. velocity trade-offs). Proven experience...
-
Site Reliability Engineer Night Shift
1 week ago
Pasig, National Capital Region, Philippines August 99, Inc. Full time ₱600,000 - ₱1,200,000 per yearOverview:Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run production systems. SRE ensures that our services—both our internally critical and our externally-visible systems, e.g. GitLab/developer tooling and hosted client sites for the company—have reliability and uptime...