Site Reliability Engineer

2 days ago


Makati City, National Capital Region, Philippines Cambridge Assessment Full time ₱62,000 - ₱84,000 per year
Work setup: We operate in a hybrid work environment, and we encourage applicants who are open to working in the office two days a week to apply.

Work schedule: 10AM to 6PM Manila time

Employment type: Permanent

Location: Makati City, Metro Manila

Pay range: Php 62,000 to Php 84,000

We value transparency and encourage applicants comfortable with this range to apply.

Discover a world of endless possibilities with Cambridge University Press & Assessment, a distinguished global academic publisher and assessment organization proudly affiliated with the prestigious University of Cambridge.

We are recruiting for a Site Reliability Engineer to be part of our Education Technology Team. As a Site Reliability Engineer (AI Operations), you'll be pioneering operational excellence for AI systems that are transforming how millions learn worldwide

Why Cambridge?

Cambridge University Press & Assessment is a world-renowned not-for-profit academic publisher and assessment organisation, proudly part of the prestigious University of Cambridge. With a legacy rooted in over 800 years of educational excellence, we are dedicated to unlocking the potential of learners and educators across the globe.

Joining Cambridge's second largest global office in the Philippines -operating for over 22 years with 1,300+ colleagues- means becoming a part of an extraordinary institution renowned worldwide. We are recognised as a Great Place to Work for three consecutive years, reflecting our inclusive culture, strong sense of purpose, and commitment to the professional growth and well-being of our people. At Cambridge, we don't just publish books or deliver tests-we empower progress, inspire curiosity, and champion the pursuit of knowledge.

What can you get from Cambridge?

At Cambridge, you'll become a part of a vibrant and forward-thinking community that transcends tradition, fostering a culture of continuous growth and personal development. Here, we provide the right environment for you to thrive, supporting your professional journey and empowering you to reach your highest potential, that is why our pay philosophy is intricately tied to your skills and competencies, ensuring that your compensation aligns with the unique value you bring to the role you are applying for.

The organization offers a wide range of benefits and opportunities including:
  • HMO Coverage and Life Insurance on Day 1
  • Paid Annual Leaves (Vacation, Well-being, Flexible, Holiday, and Volunteering leaves)
  • Vesting/Retirement package
  • Opportunities for career growth and development
  • Access to well-being programs
  • Flexible schedule, hybrid work arrangement and work-life balance
  • Opportunity to collaborate with colleagues from diverse branches that will expand your horizons and enrich your understanding of different cultures
What will you do as a Site Reliability Engineer?

You'll be joining our Education Technology Platform Operations team at a pivotal moment as we embrace AI to enhance learning outcomes globally. Working alongside passionate technologists, you'll help us transform how we deploy and operate AI services - from large language models to intelligent automation platforms - ensuring they're reliable, cost-effective, and ethically sound.
  • Drive innovation in AI operations by implementing observability solutions for LLM deployments, workflow automation platforms (e.g. n8n), and AI services across AWS Bedrock and Azure OpenAI
  • Make a real difference by establishing governance frameworks that ensure our AI services are ethical, compliant, and safe for educational use
  • Transform our approach to cost optimisation for AI workloads through intelligent caching, model selection, and resource allocation strategies
  • Collaborate with teams to operationalise AI features, sharing your expertise to help developers build production-ready, scalable AI solutions
  • Be continuously learning about emerging AI operational tools like Portkey and LiteLLM, bringing new approaches to improve reliability and efficiency
  • Strengthen our impact by implementing sustainable AI practices that consider the environmental footprint of compute-intensive workloads
Please review the attached job description for further details on the role.

What makes you the ideal candidate for this role?
  • 3-5 years in Site Reliability Engineering or related roles, with proven application of operational excellence in emerging technologies. Degree or equivalent experience in Computer Science, Engineering, or related field.
  • Cloud & Infrastructure: Strong experience with cloud platforms, particularly AWS, including Infrastructure as Code (Terraform, CDK, CloudFormation) and cloud-native services.
  • Automation & Delivery: Skilled in delivering change through automation with strong scripting abilities (Python, Bash, etc.) and hands-on experience with CI/CD pipelines (GitHub Actions, Jenkins, Bitbucket Pipelines).
  • Monitoring & Reliability: Practical experience with monitoring and observability systems (Datadog, New Relic, Grafana, ELK/EFK stack) to ensure performance, availability, and incident response in distributed systems.
  • API & Distributed Systems: Knowledge of API management, rate limiting, scalability, and the complexities of distributed architectures, particularly for AI-related workloads.
  • AI & Emerging Tech: Familiarity with Large Language Models, cloud AI services, or workflow automation tools. Willingness to learn and apply new approaches to maximize impact in education technology.
This is more than a technical role - it's an opportunity to define how AI operates in educational technology, ensuring it's deployed responsibly and effectively. You'll be at the forefront of establishing best practices that could influence how the entire education sector approaches AI operations.

  • Makati City, National Capital Region, Philippines Brixio Full time ₱80,000 - ₱150,000 per year

    #RemoteWork Opportunity: AZURE Cloud: Site Reliability Engineer (SRE)*MUST BE RESIDING IN THE PHILIPPINES*Position: Site Reliability Engineer (SRE)Location: Philippines (Remote)About the Project:Join us in supporting the groundbreaking Website Factory (WSF) project for a global cosmetics company. This project manages over 400 brand websites, providing a...


  • Makati City, National Capital Region, Philippines Cambridge University Press & Assessment | Manila Full time ₱62,000 - ₱84,000 per year

    NOTE: When you click the apply button, you will be re-directed to Cambridge University Press & Assessment's website where you will be required to create a profile and upload a copy of your CV to complete your application.Work setup: We operate in a hybrid work environment, and we encourage applicants who are open to working in the office two days a week to...


  • Makati City, National Capital Region, Philippines Descartes Systems Group Full time ₱30,000 - ₱60,000 per year

    Descartes Unites the People and Technology that Move the WorldThe need for efficient, secure, and agile supply chains and logistics operations has become ever more critical and complex. By combining innovative technology, powerful trade intelligence and the reach of our network, Descartes helps get goods, information, transportation assets, and people where...


  • Makati City, National Capital Region, Philippines The Descartes Systems Group, Inc. Full time ₱60,000 - ₱120,000 per year

    Descartes Unites the People and Technology that Move the WorldThe need for efficient, secure, and agile supply chains and logistics operations has become ever more critical and complex. By combining innovative technology, powerful trade intelligence and the reach of our network, Descartes helps get goods, information, transportation assets, and people where...


  • Makati City, National Capital Region, Philippines Broadridge Full time ₱1,800,000 - ₱2,500,000 per year

    At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.Role OverviewWe are seeking a Site Reliability Engineer (Cloud) to lead the design, implementation, and operational support of our...


  • Quezon City, National Capital Region, Philippines Comrise Full time ₱900,000 - ₱1,200,000 per year

    We are seeking a Site Reliability Engineer (Cloud) to join our growing technology team. In this role, you will be responsible for maintaining and enhancing the reliability, performance, and scalability of our cloud infrastructure. You'll apply software engineering principles to operations tasks, helping ensure the continuous availability and resilience of...

  • Site Reliability

    7 days ago


    Makati City, National Capital Region, Philippines Canonical - Jobs Full time ₱2,500,000 - ₱6,000,000 per year

    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers,...


  • Makati City, National Capital Region, Philippines Canonical - Jobs Full time ₱80,000 - ₱120,000 per year

    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers, and...


  • Makati City, National Capital Region, Philippines iScale Solutions, Inc. Full time ₱2,000,000 - ₱2,500,000 per year

    Preferred QualificationsHands-on experience migrating applications to SRE operating models in multi-team/multi-application settings.Certification(s): Google Cloud Professional DevOps Engineer, Kubernetes CKA/CKS, or equivalent.Core ExpertiseSRE Foundations & PracticesDeep understanding of SRE principles (SLIs, SLOs, error budgets, toil reduction,...


  • Makati City, National Capital Region, Philippines Broadridge Full time ₱80,000 - ₱120,000 per year

    At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.Role OverviewWe are looking for a seasoned Site Reliability Engineer to design, implement, and maintain scalable, secure, and high-performing...