Site Reliability Engineer

4 days ago


Ortigas Metro Manila, Philippines YONDU INC. Full time ₱1,200,000 - ₱2,400,000 per year

Job Description:


• Handle service monitoring, incident response, and drive technical support efficiency


• Responsible for managing and maintaining network monitoring tools, systems, and

processes that ensure the availability, scalability, and performance of our production

environments.


• Responsible for incident handling, service monitoring, and technical support efficiency.


• Closely work with developers, DevOps, infrastructure teams, and different stakeholders

to achieve proactive incident prevention, issue resolution and incident documentations.

Key Responsibilities:


• Ensure that all tickets are updated and handled based on set KPI's and SLA's


• Manage monitoring, alerting, and logging tools to ensure system health and service

uptime.


• Ensure early detection, triage and escalation of service degradation based on defined

service level agreement


• Trigger L2 ticket handling and on-call rotations for critical incidents.


• Execute triage, diagnosis, and resolution of incidents required for L3 escalations, both

internal and 3rd party support teams


• Support major incident response, contribute to root cause analysis (RCA), and help

document postmortems.


• Track, analyze, and act on incident trends and recurring technical issues.


• Use data from ticketing systems (Jira, ServiceNow, etc.) to improve team responsiveness

and resolution quality.


• Update and maintain SOPs, runbooks, and knowledge base articles including the

documentation of known issues, fixes, and playbooks to improve mean time to resolution.


• Collaborate with development and QA teams to improve deployment readiness and

reliability


• Participate in technical competency mapping to ensure coverage and reduce unnecessary

escalations.

Skills and Competencies:


• Hands-on experience with ITSM platforms (e.g., ServiceNow, Jira Service Management).


• Familiarity with ITIL principles and ITSM process areas (incident, problem, request,

change, asset, and service catalog management).


• Basic knowledge of IT infrastructure components (networks, servers, applications) and

how they support IT services.


• Experience in monitoring system performance and escalating outages or performance

degradation.


• Ability to troubleshoot and document IT issues effectively for escalation and closure.


• Strong attention to detail in documentation, ticket updates, and asset records.


• Familiarity with regulatory and compliance frameworks (e.g., BSP, PDIC, ISO 27001,

COBIT) is a plus.


• Clear written and verbal communication skills for ticket handling and team collaboration.


• Proactive, detail-oriented, and able to manage multiple tasks in a structured IT operations

environment.

Qualifications and Experience:


• Bachelor's degree in Electronics Engineering, Information Technology, Computer

Science, Management Information Systems, or equivalent.


• 2–5 years of experience in Site Reliability Engineering, DevOps, or Infrastructure roles..


• Hands-on experience with monitoring tools (e.g., Prometheus, Grafana, ELK, or

Datadog).


• Familiarity with incident response and troubleshooting in production systems.


• Experience with at least one cloud platform (AWS, GCP, or Azure).


• Knowledgeable in scripting (e.g., Python, Bash) and Linux systems.


• Exposure to ITIL-based processes, especially Incident and Problem Management.


• Experience working in fintech, banking, or SaaS with high availability SLAs.


• Familiarity with DevOps practices, CI/CD pipelines, and cloud-based monitoring tools.


• Experience with automation platforms


• Knowledge of BSP regulatory frameworks, policies, and guidelines.



  • , Metro Manila, Philippines Buscojobs Full time

    Site Reliability Engineer jobs in the Philippines 47 Site Reliability Engineer jobs in the Philippines Site Reliability Engineer Posted today Job Viewed Tap Again To Close Job Description Responsibilities: Develop, maintain, and optimize SAP landscapes on GCP for our clients, ensuring optimal performance, reliability, and efficiency. Utilize industry-leading...


  • , Metro Manila, Philippines ABC Worldwide (AKA BRIP Careers Worldwide) Full time

    Overview Our client, a global Business Process Outsourcing (BPO) business, is looking for Site Reliability Engineers (SRE) to support their global payment technology company that provides platforms to consumers, businesses and organizations to make electronic payments. The successful candidate will be responsible for ensuring site reliability & performance,...


  • Manila, National Capital Region, Philippines Michael Page Full time

    Join a growing team.Enjoy market-aligned salaries & benefits.About Our ClientThe hiring company is a large organization in the healthcare industry, focused on delivering innovative solutions to improve patient care and operational efficiency. The company is committed to leveraging cutting-edge technology to support its services.Job DescriptionOversee the...


  • Manila, National Capital Region, Philippines HGS Offshore Staffing Solutions Full time ₱2,000,000 - ₱2,500,000 per year

    SENIOR SITE RELIABILITY ENGINEERPOSITION OVERVIEWWe are seeking an experienced Senior AWS Site Reliability Engineer to join our cross-functionalcloud platform team. Working alongside a diverse group of DevOps and Site ReliabilityEngineers, you will combine deep technical expertise in AWS cloud infrastructure with strongleadership capabilities in incident...


  • , Metro Manila, Philippines Broadridge Full time

    Join to apply for the Site Reliability Engineer (Hybrid) role at Broadridge 1 week ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (Hybrid) role at Broadridge Direct message the job poster from Broadridge Talent Acquisition Specialist @ Broadridge | Bridging Talent and Opportunity in Fintech | Mastering APAC Markets:...


  • , Metro Manila, Philippines Michael Page Full time

    Join a growing team. Enjoy market-aligned salaries & benefits. About Our Client The hiring company is a large organization in the healthcare industry, focused on delivering innovative solutions to improve patient care and operational efficiency. The company is committed to leveraging cutting-edge technology to support its services. Job Description Oversee...


  • Manila, National Capital Region, Philippines Cambridge University Press & Assessment Full time ₱60,000 - ₱81,000 per year

    Salary:Php 60,000 to Php 81,000- Location:Manila- Country:Philippines- Business Unit:Technology- Vacancy Type:Permanent- Closing Date:9 October 2025Meet the recruiterImee SantosWork setup: Hybrid (open to 2x a week in the office)Work schedule: 10AM to 6PM Manila timeEmployment type: PermanentLocation: Makati City, Metro ManilaPay range: Php 60,000 to Php...


  • Manila, National Capital Region, Philippines Canonical Full time

    OverviewJoin to apply for the Site Reliability Engineer role at Canonical. Canonical is hiring a Site Reliability Engineer to work on open source infrastructure and cloud engineering. Location: Globally remote role.ResponsibilitiesDeploy and run OpenStack, Kubernetes, storage solutions, and open source applications, applying DevOps practices.Identify and...


  • Manila, National Capital Region, Philippines Braintrust Full time ₱30,000 - ₱150,000 per year

    Job Description*Compensation range varies off level of experience:*Jr SRE $12k-$18k/yr, Intermediate: $20k-$30k/yr, Senior: $35k - $50k/yrSome travel may be required.*Card payment domain knowledge/experience is key:*Our client, a global Business Process Outsourcing (BPO) businesses is looking for Site Reliability Engineers (SRE) to support their client, a...


  • Southern Manila District, Philippines Royal Caribbean International Full time

    Overview Position Summary: The Site Reliability Engineer (Senior SRE) reports to the SRE Manager in support of the Royal Caribbean website by utilizing application and user performance data to guide informed decision-making. The SRE uses performance metrics from various sources and tools to support tasks such as initial triage of critical production...