Site Reliability Engineer

3 weeks ago


Metro Manila Philippines Buscojobs Full time

Overview

Work setup : Hybrid (open to x a week in the office)

Work schedule : AM to PM Manila time

Employment type : Permanent

Location : Makati City, Metro Manila

Pay range : Php , to Php ,

We value transparency and encourage applicants comfortable with this range to apply.

Discover a world of endless possibilities with Cambridge University Press & Assessment, a distinguished global academic publisher and assessment organization proudly affiliated with the prestigious University of Cambridge.

We are recruiting for a Site Reliability Engineer to be part of our Education Technology Team. As a Site Reliability Engineer (AI Operations), you\'ll be pioneering operational excellence for AI systems that are transforming how millions learn worldwide

Why Cambridge?

Cambridge University Press & Assessment is a world-renowned not-for-profit academic publisher and assessment organisation, proudly part of the prestigious University of Cambridge. With a legacy rooted in over years of educational excellence, we are dedicated to unlocking the potential of learners and educators across the globe.

Joining Cambridge\'s second largest global office in the Philippines —operating for over years with ,+ colleagues— means becoming a part of an extraordinary institution renowned worldwide. We are recognised as a Great Place to Work for three consecutive years, reflecting our inclusive culture, strong sense of purpose, and commitment to the professional growth and well-being of our people. At Cambridge, we don\'t just publish books or deliver tests—we empower progress, inspire curiosity, and champion the pursuit of knowledge.

What can you get from Cambridge?

At Cambridge, you\'ll become a part of a vibrant and forward-thinking community that transcends tradition, fostering a culture of continuous growth and personal development. Here, we provide the right environment for you to thrive, supporting your professional journey and empowering you to reach your highest potential, that is whyour pay philosophy is intricately tied to your skills and competencies, ensuring that your compensation aligns with the unique value you bring to the role you are applying for.

The organization offers a wide range of benefits and opportunities including :

  • Regular Employment on Day
  • HMO Coverage and Life Insurance on Day
  • Paid Annual Leaves (Vacation, Well-being, Flexible, Holiday, and Volunteering leaves)
  • Vesting / Retirement package
  • Opportunities for career growth and development
  • Access to well-being programs
  • Flexible schedule, hybrid work arrangement and work-life balance
  • Opportunity to collaborate with colleagues from diverse branches that will expand your horizons and enrich your understanding of different cultures
What will you do as a Site Reliability Engineer?

You'll be joining our Education Technology Platform Operations team at a pivotal moment as we embrace AI to enhance learning outcomes globally. Working alongside passionate technologists, you'll help us transform how we deploy and operate AI services - from large language models to intelligent automation platforms - ensuring they're reliable, cost-effective, and ethically sound.

In this role, you'll bridge the gap between cutting-edge AI innovation and production excellence. You'll establish the operational frameworks that allow us to deploy AI responsibly in education, always keeping learner safety and data protection at the forefront.

  • Drive innovation in AI operations by implementing observability solutions for LLM deployments, workflow automation platforms ( nn), and AI services across AWS Bedrock and Azure OpenAI
  • Make a real difference by establishing governance frameworks that ensure our AI services are ethical, compliant, and safe for educational use
  • Transform our approach to cost optimisation for AI workloads through intelligent caching, model selection, and resource allocation strategies
  • Collaborate with teams to operationalise AI features, sharing your expertise to help developers build production-ready, scalable AI solutions
  • Be continuously learning about emerging AI operational tools like Portkey and LiteLLM, bringing new approaches to improve reliability and efficiency
  • Strengthen our impact by implementing sustainable AI practices that consider the environmental footprint of compute-intensive workloads

Please review the attached job description for further details on the role.

What makes you the ideal candidate for this role?
  • Education & Experience : – years in Site Reliability Engineering or related roles, with proven application of operational excellence in emerging technologies. Degree or equivalent experience in Computer Science, Engineering, or related field.
  • Cloud & Infrastructure : Strong experience with cloud platforms, particularly AWS, including Infrastructure as Code (Terraform, CDK, CloudFormation) and cloud-native services.
  • Automation & Delivery : Skilled in delivering change through automation with strong scripting abilities (Python, Bash, etc.) and hands-on experience with CI / CD pipelines (GitHub Actions, Jenkins, Bitbucket Pipelines).
  • Monitoring & Reliability : Practical experience with monitoring and observability systems (Datadog, New Relic, Grafana, ELK / EFK stack) to ensure performance, availability, and incident response in distributed systems.
  • API & Distributed Systems : Knowledge of API management, rate limiting, scalability, and the complexities of distributed architectures, particularly for AI-related workloads.
  • AI & Emerging Tech : Familiarity with Large Language Models, cloud AI services, or workflow automation tools. Willingness to learn and apply new approaches to maximize impact in education technology.
  • Ways of Working : Enthusiastic about exploring possibilities with AI while maintaining operational rigor. Collaborative, curious, and aligned with the vision of using technology to unlock potential in learners worldwide.

This is more than a technical role - it's an opportunity to define how AI operates in educational technology, ensuring it's deployed responsibly and effectively. You'll be at the forefront of establishing best practices that could influence how the entire education sector approaches AI operations.

#J-18808-Ljbffr

  • , Metro Manila, Philippines Buscojobs Full time

    Site Reliability Engineer jobs in the Philippines 47 Site Reliability Engineer jobs in the Philippines Site Reliability Engineer Posted today Job Viewed Tap Again To Close Job Description Responsibilities: Develop, maintain, and optimize SAP landscapes on GCP for our clients, ensuring optimal performance, reliability, and efficiency. Utilize industry-leading...


  • , Oriental Mindoro, Philippines Buscojobs Full time

    Site Reliability Engineer jobs involve ensuring the reliability and performance of systems. Key responsibilities include monitoring system performance, identifying bottlenecks, and implementing optimization strategies. Requirements for Site Reliability Engineers typically include: Bachelor's degree in Computer Science, Engineering, or a related field. Proven...


  • , Metro Manila, Philippines Michael Page Full time

    Join a growing team. Enjoy market-aligned salaries & benefits. About Our Client The hiring company is a large organization in the healthcare industry, focused on delivering innovative solutions to improve patient care and operational efficiency. The company is committed to leveraging cutting-edge technology to support its services. Job Description Oversee...


  • Manila, National Capital Region, Philippines HGS Offshore Staffing Solutions Full time ₱2,000,000 - ₱2,500,000 per year

    SENIOR SITE RELIABILITY ENGINEERPOSITION OVERVIEWWe are seeking an experienced Senior AWS Site Reliability Engineer to join our cross-functionalcloud platform team. Working alongside a diverse group of DevOps and Site ReliabilityEngineers, you will combine deep technical expertise in AWS cloud infrastructure with strongleadership capabilities in incident...


  • , Metro Manila, Philippines Buscojobs Full time

    Lead Site Reliability Engineer Posted today Job Description What Makes Us, Us. Join some of the most innovative thinkers in FinTech as we lead the evolution of financial technology. If you are an innovative, curious, collaborative person who embraces challenges and wants to grow, learn and pursue outcomes with our prestigious financial clients, say Hello to...


  • Manila, National Capital Region, Philippines Cambridge University Press & Assessment Full time ₱60,000 - ₱81,000 per year

    Salary:Php 60,000 to Php 81,000- Location:Manila- Country:Philippines- Business Unit:Technology- Vacancy Type:Permanent- Closing Date:9 October 2025Meet the recruiterImee SantosWork setup: Hybrid (open to 2x a week in the office)Work schedule: 10AM to 6PM Manila timeEmployment type: PermanentLocation: Makati City, Metro ManilaPay range: Php 60,000 to Php...


  • , , Philippines Penbrothers Full time

    Overview Senior Site Reliability Engineer role at Penbrothers. Penbrothers is an HR & remote talent management partner and one of the fastest-growing companies in the Philippines. This role supports reliability for Salesforce and web/mobile application environments, collaborating with engineers to apply world-class practices. Responsibilities Service...


  • Ortigas, Metro Manila, Philippines YONDU INC. Full time ₱1,200,000 - ₱2,400,000 per year

    Job Description:• Handle service monitoring, incident response, and drive technical support efficiency• Responsible for managing and maintaining network monitoring tools, systems, andprocesses that ensure the availability, scalability, and performance of our productionenvironments.• Responsible for incident handling, service monitoring, and technical...


  • Manila, National Capital Region, Philippines Braintrust Full time ₱30,000 - ₱150,000 per year

    Job Description*Compensation range varies off level of experience:*Jr SRE $12k-$18k/yr, Intermediate: $20k-$30k/yr, Senior: $35k - $50k/yrSome travel may be required.*Card payment domain knowledge/experience is key:*Our client, a global Business Process Outsourcing (BPO) businesses is looking for Site Reliability Engineers (SRE) to support their client, a...


  • Southern Manila District, Philippines Royal Caribbean International Full time

    Overview Position Summary: The Site Reliability Engineer (Senior SRE) reports to the SRE Manager in support of the Royal Caribbean website by utilizing application and user performance data to guide informed decision-making. The SRE uses performance metrics from various sources and tools to support tasks such as initial triage of critical production...