Site Reliability Engineer

2 days ago


Manila, National Capital Region, Philippines Cambridge University Press & Assessment Full time ₱62,000 - ₱84,000 per year

Work setup
: We operate in a hybrid work environment, and we encourage applicants who are open to working in the office
two days a week
to apply.

Work schedule
: 10AM to 6PM Manila time

Employment type
: Permanent

Location
: Makati City, Metro Manila

Pay range
: Php 62,000 to Php 84,000

We value transparency and encourage applicants comfortable with this range to apply.
Discover a world of endless possibilities with Cambridge University Press & Assessment, a distinguished global academic publisher and assessment organization proudly affiliated with the prestigious University of Cambridge.

We are recruiting for a Site Reliability Engineer to be part of our Education Technology Team. As a Site Reliability Engineer (AI Operations), you'll be pioneering operational excellence for AI systems that are transforming how millions learn worldwide
Why Cambridge?
Cambridge University Press & Assessment is a world-renowned not-for-profit academic publisher and assessment organisation, proudly part of the prestigious University of Cambridge. With a legacy rooted in over 800 years of educational excellence, we are dedicated to unlocking the potential of learners and educators across the globe.

Joining Cambridge's second largest global office in the Philippines —operating for over 22 years with 1,300+ colleagues— means becoming a part of an extraordinary institution renowned worldwide. We are recognised as a Great Place to Work for three consecutive years, reflecting our inclusive culture, strong sense of purpose, and commitment to the professional growth and well-being of our people. At Cambridge, we don't just publish books or deliver tests—we empower progress, inspire curiosity, and champion the pursuit of knowledge.

What can you get from Cambridge?
At Cambridge, you'll become a part of a vibrant and forward-thinking community that transcends tradition, fostering a culture of continuous growth and personal development. Here, we provide the right environment for you to thrive, supporting your professional journey and empowering you to reach your highest potential, that is why
our pay philosophy is intricately tied to your skills and competencies, ensuring that your compensation aligns with the unique value you bring to the role you are applying for.
The organization offers a wide range of benefits and opportunities including:

  • HMO Coverage and Life Insurance on Day 1
  • Paid Annual Leaves (Vacation, Well-being, Flexible, Holiday, and Volunteering leaves)
  • Vesting/Retirement package
  • Opportunities for career growth and development
  • Access to well-being programs
  • Flexible schedule, hybrid work arrangement and work-life balance
  • Opportunity to collaborate with colleagues from diverse branches that will expand your horizons and enrich your understanding of different cultures

What will you do as a Site Reliability Engineer?
You'll be joining our Education Technology Platform Operations team at a pivotal moment as we embrace AI to enhance learning outcomes globally. Working alongside passionate technologists, you'll help us transform how we deploy and operate AI services - from large language models to intelligent automation platforms - ensuring they're reliable, cost-effective, and ethically sound.

  • Drive innovation in AI operations by implementing observability solutions for LLM deployments, workflow automation platforms (e.g. n8n), and AI services across AWS Bedrock and Azure OpenAI
  • Make a real difference by establishing governance frameworks that ensure our AI services are ethical, compliant, and safe for educational use
  • Transform our approach to cost optimisation for AI workloads through intelligent caching, model selection, and resource allocation strategies
  • Collaborate with teams to operationalise AI features, sharing your expertise to help developers build production-ready, scalable AI solutions
  • Be continuously learning about emerging AI operational tools like Portkey and LiteLLM, bringing new approaches to improve reliability and efficiency
  • Strengthen our impact by implementing sustainable AI practices that consider the environmental footprint of compute-intensive workloads

Please review the attached job description for further details on the role.
What makes you the ideal candidate for this role?

  • 3–5 years in Site Reliability Engineering or related roles, with proven application of operational excellence in emerging technologies. Degree or equivalent experience in Computer Science, Engineering, or related field.
  • Cloud & Infrastructure: Strong experience with cloud platforms, particularly AWS, including Infrastructure as Code (Terraform, CDK, CloudFormation) and cloud-native services.
  • Automation & Delivery: Skilled in delivering change through automation with strong scripting abilities (Python, Bash, etc.) and hands-on experience with CI/CD pipelines (GitHub Actions, Jenkins, Bitbucket Pipelines).
  • Monitoring & Reliability: Practical experience with monitoring and observability systems (Datadog, New Relic, Grafana, ELK/EFK stack) to ensure performance, availability, and incident response in distributed systems.
  • API & Distributed Systems: Knowledge of API management, rate limiting, scalability, and the complexities of distributed architectures, particularly for AI-related workloads.
  • AI & Emerging Tech: Familiarity with Large Language Models, cloud AI services, or workflow automation tools. Willingness to learn and apply new approaches to maximize impact in education technology.

This is more than a technical role - it's an opportunity to define how AI operates in educational technology, ensuring it's deployed responsibly and effectively. You'll be at the forefront of establishing best practices that could influence how the entire education sector approaches AI operations.



  • Manila, National Capital Region, Philippines HGS Offshore Staffing Solutions Full time ₱2,000,000 - ₱2,500,000 per year

    SENIOR SITE RELIABILITY ENGINEERPOSITION OVERVIEWWe are seeking an experienced Senior AWS Site Reliability Engineer to join our cross-functionalcloud platform team. Working alongside a diverse group of DevOps and Site ReliabilityEngineers, you will combine deep technical expertise in AWS cloud infrastructure with strongleadership capabilities in incident...


  • Manila, National Capital Region, Philippines CDOps Tech Full time ₱120,000 - ₱180,000 per year

    About the OpportunityWe are seeking a seasoned and passionate Site Reliability Engineer for a high-impact contract engagement with one of our key clients, a leader in the marketing-tech sector. This is not just a typical SRE role; you will be the foundational expert responsible for spearheading the adoption of SRE culture and practices within the client's...


  • Manila, National Capital Region, Philippines Russell Tobin Full time ₱120,000 - ₱180,000 per year

    We are seeking a highly skilledSite Reliability Engineering (SRE) Subject Matter Expert (SME)to lead and advance our observability, performance engineering, reliability, and AIOps practices. The SME will be responsible for designing, implementing, and evangelizing modern SRE capabilities that improve system reliability, scalability, and efficiency across our...


  • Manila, National Capital Region, Philippines Broadridge Full time ₱1,200,000 - ₱2,400,000 per year

    At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.Role OverviewAtBroadridge Trading & Connectivity Solutions, we foster a culture of empowerment, innovation, and collaboration, where...


  • Manila, National Capital Region, Philippines CDOps Tech Full time ₱2,000,000 - ₱2,500,000 per year

    About the OpportunityWe are seeking a seasoned and passionate Senior Site Reliability Engineer for a high-impact contract engagement with one of our key clients, a leader in the marketing-tech sector. This is not just a typical SRE role; you will be the foundational expert responsible for spearheading theadoption of SRE culture and practiceswithin the...


  • Manila, National Capital Region, Philippines Russell Tobin Full time $60,000 - $120,000 per year

    We are seeking a highly skilledSite Reliability Engineering (SRE) Subject Matter Expert (SME)to lead and advance our observability, performance engineering, reliability, and AIOps practices. The SME will be responsible for designing, implementing, and evangelizing modern SRE capabilities that improve system reliability, scalability, and efficiency across our...


  • Manila, National Capital Region, Philippines Broadridge Full time ₱1,200,000 - ₱2,400,000 per year

    At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.Role OverviewWe are seeking a Site Reliability Engineer (Cloud) to lead the design, implementation, and operational support of our...


  • Manila, National Capital Region, Philippines QualityKiosk Technologies Full time ₱1,500,000 - ₱2,500,000 per year

    Experience:6 to 10 yearsLocation:MakatiAbout QualityKiosk TechnologiesQualityKiosk Technologies is one of the world's largest independent Quality Engineering (QE) providers and digital transformation enablers, helping companies build and manage applications for optimal performance and user experience. Founded in 2000, the company specializes in providing...


  • Manila, National Capital Region, Philippines Tyler Technologies Full time $80,000 - $150,000 per year

    DescriptionResponsibilitiesImplement tooling to monitor AWS EKS-based systems focusing on performance, reliability, and scalability.Ensure that architecture and deployment models are sufficient to support SLA commitments and are well prepared for future problems of scale.Leverage cloud technology and platform capabilities to provide operationally sustainable...


  • Manila, National Capital Region, Philippines Broadridge Full time

    At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.Role OverviewWe are looking for a seasoned Site Reliability Engineer to design, implement, and maintain scalable, secure, and high-performing...