Site Reliability Engineer

1 day ago


Manila, National Capital Region, Philippines Nezda Global Full time

About the Role

As an
SRE SME
, you'll design, implement, and evangelize modern SRE and AIOps frameworks — ensuring systems are reliable, scalable, and intelligent. You'll collaborate across infrastructure, development, and leadership teams to embed observability and reliability at scale.

Key Responsibilities

  • Design and implement
    observability frameworks
    across logs, metrics, traces, and events
  • Architect and optimize monitoring platforms (Prometheus, Grafana, ELK, Splunk, Datadog, Dynatrace, etc.)
  • Lead
    performance benchmarking, load/stress testing
    , and capacity planning for enterprise systems
  • Drive
    incident response automation
    , resilience testing, and chaos engineering initiatives
  • Define and deploy
    AIOps strategies
    for predictive analytics and automated remediation
  • Partner with development and business teams to embed
    SRE best practices
    organization-wide
  • Mentor engineers and influence leadership on reliability and scalability strategies

Must-Have Qualifications

  • 10+ years of experience in
    IT Operations, Reliability, or Performance Engineering
  • Strong hands-on expertise in
    observability tools
    (Prometheus, Grafana, Splunk, Datadog, Dynatrace, ELK)
  • Proficiency in
    performance testing
    (JMeter, LoadRunner, Gatling, or k6)
  • Experience in
    cloud platforms
    (AWS, Azure, GCP) and
    containerized environments
    (Kubernetes, Docker, OpenShift)
  • Skilled in
    automation frameworks
    (Terraform, Ansible, Python, Go, Shell scripting)
  • Familiarity with
    AIOps platforms
    (Moogsoft, BigPanda, Dynatrace Davis AI, ServiceNow AIOps)
  • Deep understanding of
    distributed systems, CI/CD, and DevOps
    principles

Good-to-Have

  • Experience leading
    SRE or Observability transformations
    at enterprise scale
  • Knowledge of
    Chaos Engineering
    tools (Gremlin, Chaos Mesh, Litmus)
  • Certifications such as
    Google SRE, AWS DevOps Engineer, Azure SRE Expert, Dynatrace/Datadog
  • Exposure to
    ITSM/ITIL
    and modern incident management frameworks


  • Manila, National Capital Region, Philippines Hire Manila Full time

    Site Reliability Engineering (SRE) is an engineering discipline that blends software and systems engineering to build and operate reliable production systems. SRE ensures that our client's services—both internally critical and externally visible, such as developer tooling and hosted client sites—meet user expectations for reliability and uptime. It also...


  • Manila, National Capital Region, Philippines Flex Employee Services Full time

    Are you ready for the next step in your career?Flex Employee Services PH,is here to support you along your journey. We are actively seeking talented individuals to fill the following opportunity. If this sounds like you (or someone you know), we'd love to connectAbout the JobTitle: Senior Site Reliability EngineerTerm: Full-timeLocation: On-site in San...


  • Manila, National Capital Region, Philippines Maya Full time

    Base AppsOur MissionOur goal is for everyone to make bolder choices with their finances.To get there, we're creating an all-in-one ecosystem of financial services for today's generation of goal-getters. That feat takes extraordinary people-those with the guts to challenge the way things are and transform them into something better.To be part of Team Maya is...


  • Manila, National Capital Region, Philippines Broadridge Full time

    At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.Role OverviewWe are seeking a Senior Site Reliability Engineer (SRE) to design, build, and operate highly reliable, scalable, and secure...


  • Manila, National Capital Region, Philippines Cambridge University Press & Assessment Full time

    Work setup: Hybrid (open to 2x a week in the office)Work schedule: 10AM to 6PM Manila timeEmployment type: PermanentLocation: Makati City, Metro ManilaPay range: Php 60,000 to Php 81,000We value transparency and encourage applicants comfortable with this range to apply.Discover a world of endless possibilities with Cambridge University Press & Assessment, a...


  • Manila, National Capital Region, Philippines Anime News Network,Inc Full time

    Anime News Network is hiring a full-time DevOps / SRE to help run, scale, and improve the infrastructure powering the internet's most trusted anime news source.What you'll work onMaintain and improve production systems reliability, performance, and securityManage deployments, automation, monitoring, alerts, and incident responseOptimize web stack performance...


  • Manila, National Capital Region, Philippines Nezda Global Full time

    About the RoleAs aSite Reliability Engineer (SRE), you'll own the stability and performance ofIBM MQ and/or Confluent Kafkaplatforms. You'll work hands-on with monitoring, automation, incident response, and system optimization — while partnering closely with application and DevOps teams.This role is ideal for someone who enjoys deep technical ownership and...


  • Manila, National Capital Region, Philippines Machinery Reliability Systems Full time

    Hydraulics Technician Job Description:About the RoleWe are seeking a Hydraulic Technician with expertise in hoses, fittings, and the repair of hydraulic cylinders and pumps. This individual will play a crucial role in diagnosing, repairing, and maintaining hydraulic systems for our clients.Key ResponsibilitiesInspection and Diagnosis:Inspect hydraulic...


  • Manila, National Capital Region, Philippines Infineon Technologies Full time

    We are looking for experienced professional for reliability and qualification assessment for our quality department.Your RoleKey responsibilities in your new roleDefine reliability assessment plan for new package technologies with a failure mechanism-based approach and determine the technical capability of a package technology to meet the specified...

  • Site Engineer

    1 day ago


    Manila, National Capital Region, Philippines Pancrudo Builders Corporation Full time

    About the role Key ResponsibilitiesProject Oversight: Oversee daily site operations, managing schedules, materials, and subcontractors to keep projects on track.Quality & Safety: Ensure work adheres to engineering designs, quality standards, and strict health & safety regulations, conducting inspections and risk assessments.Technical Support: Interpret...