Senior System Reliability Engineer

24 hours ago


Pasay, National Capital Region, Philippines beBeeReliability Full time $130,000 - $180,000

Job Overview

This role is responsible for ensuring system reliability, developing observability strategies, leading incident response efforts, and driving continuous improvement across our platform.

We're seeking an experienced engineer to take ownership of monitoring, alerting, and incident management for production systems. You'll design data visualizations, build alerting pipelines, and use automation to optimize system performance while ensuring uptime.

The ideal candidate will have strong hands-on experience with monitoring & logging tools, scripting for automation, and running and improving incident response processes.

Key Responsibilities

  • Design and implement data visualization dashboards to ensure system reliability
  • Build alerting pipelines to detect anomalies and prevent issues
  • Use automation to optimize system performance and reduce toil
  • Lead incident response efforts to minimize downtime
  • Perform root cause analysis and document learnings to drive improvement

Requirements

  • Bachelor's degree in Computer Science, IT, or equivalent work experience
  • 5+ years in SRE, DevOps, or IT Operations roles
  • Strong hands-on experience with monitoring & logging tools (Prometheus, Grafana, OpenTelemetry)
  • Skilled in scripting for automation (Python, TypeScript, or Bash)
  • Experience running and improving incident response processes

About Our Team

Our team values reliability and innovation. We're committed to building a culture of observability, incident readiness, and proactive system improvement. As a collaborative team, we're looking for talented engineers who share our vision and are passionate about keeping systems stable and efficient.



  • Pasay, National Capital Region, Philippines beBeeEngineer Full time ₱800,000 - ₱1,200,000

    Job Description:Staff4Me is seeking an experienced DevOps Engineer with expertise in Grafana monitoring tools to join a dynamic team. In this role, the successful candidate will be responsible for implementing and maintaining monitoring solutions using Grafana while collaborating closely with development and operations teams to enhance system performance and...


  • Pasay, National Capital Region, Philippines Royal Caribbean Group Full time $90,000 - $120,000 per year

    The Senior Site Reliability Engineer (Senior SRE) will report to the SRE Manager in support of the Royal Caribbean website by utilizing application and user performance data to guide informed decision-making. The Senior SRE will use application and user performance metrics collected from various sources and tools to support tasks such as initial triage of...


  • Pasay, National Capital Region, Philippines Royal Caribbean Group Full time $70,000 - $120,000 per year

    Position SummaryThe Site Reliability Engineer (Senior SRE) will report to the SRE Manager in support of the Royal Caribbean website by utilizing application and user performance data to guide informed decision-making. The SRE will use application and user performance metrics collected from various sources and tools to support tasks such as initial triage of...


  • Pasay, National Capital Region, Philippines beBeeReliability Full time ₱900,000 - ₱1,200,000

    Site reliability engineers are essential to ensuring the seamless operation of websites and applications. As a site reliability engineer, you will be responsible for the incident management, application performance, configuration management, and operational readiness of products within your ownership.The ideal candidate will have a deep understanding of IT...


  • Pasay, National Capital Region, Philippines Vestas Full time $104,000 - $130,878 per year

    Are you ready to guide the development of innovative infrastructure solutions for a technology-focused entity in the renewable energy sector? We are seeking a Senior Systems Engineer committed to automation, monitoring, and asset management—someone who takes charge of what happens next and promotes continuous improvement in our digital landscape.This is a...


  • Pasay, National Capital Region, Philippines beBeeEngineer Full time ₱150,000 - ₱200,000

    Job Description:The role of Senior System Engineer is a highly technical position that requires expertise in Linux systems, hardware architectures, and open-source communities. We are seeking an experienced software engineer to join the Ubuntu Foundations Engineering team to maintain and enhance the Ubuntu bootloader stack, providing fast, reliable, and...


  • Pasay, National Capital Region, Philippines Royal Caribbean Group Full time $70,000 - $120,000 per year

    Position SummaryThe Lead Site Reliability Engineer (Lead SRE) will report to the SRE Manager in support of the Royal Caribbean website by utilizing application and user performance data to guide informed decision-making. The Lead SRE will use application and user performance metrics collected from various sources and tools to support tasks such as initial...


  • Pasay, National Capital Region, Philippines beBeesiteengineer Full time $150,000 - $200,000

    Job OverviewMaintain the reliability, performance, and scalability of our systems as a proactive engineer who automates processes, identifies bottlenecks, and implements best practices to prevent downtime and ensure high availability.


  • Pasay, National Capital Region, Philippines Private Advertiser Full time $40,000 - $80,000 per year

    Key Responsibilities:Resolve production issues promptly, adhering to formal escalation processes.Collaborate with business users and infrastructure teams to ensure timely issue resolution.Actively monitor the health of systems and services and execute recommended recovery steps as needed.Implement strategies to enhance application stability and...


  • Pasay, National Capital Region, Philippines beBeeReliability Full time ₱1,459,000 - ₱2,191,000

    System Reliability ExpertWe are seeking a skilled System Reliability Engineer to join our team.Job Description:Design, build, and maintain robust infrastructure to support healthcare applications.Automate deployment processes to enhance system performance and reduce downtime.Monitor and troubleshoot system issues to ensure high availability and...