
Site Reliability Engineer
10 hours ago
Overview Site Reliability Engineer (AI Operations) - 6426 Work setup: Hybrid (open to 2x a week in the office) Work schedule: 10AM to 6PM Manila time Employment type: Permanent Pay range: Php 60,000 to Php 81,000 We are recruiting for a Site Reliability Engineer to be part of our Education Technology Team. As a Site Reliability Engineer (AI Operations), you'll be pioneering operational excellence for AI systems that are transforming how millions learn worldwide. Discover a world of endless possibilities with Cambridge University Press & Assessment, a distinguished global academic publisher and assessment organization proudly affiliated with the prestigious University of Cambridge. Responsibilities Drive innovation in AI operations by implementing observability solutions for LLM deployments, workflow automation platforms (e.g. n8n), and AI services across AWS Bedrock and Azure OpenAI Establish governance frameworks that ensure our AI services are ethical, compliant, and safe for educational use Transform our approach to cost optimisation for AI workloads through intelligent caching, model selection, and resource allocation strategies Collaborate with teams to operationalise AI features, sharing expertise to help developers build production-ready, scalable AI solutions Be continuously learning about emerging AI operational tools like Portkey and LiteLLM, bringing new approaches to improve reliability and efficiency Strengthen our impact by implementing sustainable AI practices that consider the environmental footprint of compute-intensive workloads Qualifications Education & Experience: 3–5 years in Site Reliability Engineering or related roles, with proven application of operational excellence in emerging technologies. Degree or equivalent experience in Computer Science, Engineering, or related field. Cloud & Infrastructure: Strong experience with cloud platforms, particularly AWS, including Infrastructure as Code (Terraform, CDK, CloudFormation) and cloud-native services. Automation & Delivery: Skilled in delivering change through automation with strong scripting abilities (Python, Bash, etc.) and hands-on experience with CI/CD pipelines (GitHub Actions, Jenkins, Bitbucket Pipelines). Monitoring & Reliability: Practical experience with monitoring and observability systems (Datadog, New Relic, Grafana, ELK/EFK stack) to ensure performance, availability, and incident response in distributed systems. API & Distributed Systems: Knowledge of API management, rate limiting, scalability, and the complexities of distributed architectures, particularly for AI-related workloads. AI & Emerging Tech: Familiarity with Large Language Models, cloud AI services, or workflow automation tools. Willingness to learn and apply new approaches to maximize impact in education technology. Ways of Working: Enthusiastic about exploring possibilities with AI while maintaining operational rigor. Collaborative, curious, and aligned with the vision of using technology to unlock potential in learners worldwide. Additional notes This is more than a technical role - it's an opportunity to define how AI operates in educational technology, ensuring it's deployed responsibly and effectively. You'll be at the forefront of establishing best practices that could influence how the entire education sector approaches AI operations. #J-18808-Ljbffr
-
Site Reliability Engineer
1 week ago
, Metro Manila, Philippines Buscojobs Full timeSite Reliability Engineer jobs in the Philippines 47 Site Reliability Engineer jobs in the Philippines Site Reliability Engineer Posted today Job Viewed Tap Again To Close Job Description Responsibilities: Develop, maintain, and optimize SAP landscapes on GCP for our clients, ensuring optimal performance, reliability, and efficiency. Utilize industry-leading...
-
Site Reliability Engineer
2 weeks ago
Manila, National Capital Region, Philippines Cambridge University Press & Assessment Full time ₱60,000 - ₱81,000 per yearSalary:Php 60,000 to Php 81,000- Location:Manila- Country:Philippines- Business Unit:Technology- Vacancy Type:Permanent- Closing Date:9 October 2025Meet the recruiterImee SantosWork setup: Hybrid (open to 2x a week in the office)Work schedule: 10AM to 6PM Manila timeEmployment type: PermanentLocation: Makati City, Metro ManilaPay range: Php 60,000 to Php...
-
Site Reliability Engineer
2 weeks ago
Manila, National Capital Region, Philippines Braintrust Full time ₱30,000 - ₱150,000 per yearJob Description*Compensation range varies off level of experience:*Jr SRE $12k-$18k/yr, Intermediate: $20k-$30k/yr, Senior: $35k - $50k/yrSome travel may be required.*Card payment domain knowledge/experience is key:*Our client, a global Business Process Outsourcing (BPO) businesses is looking for Site Reliability Engineers (SRE) to support their client, a...
-
Engineer, Site Reliability
2 weeks ago
Southern Manila District, Philippines Royal Caribbean International Full timeOverview Position Summary: The Site Reliability Engineer (Senior SRE) reports to the SRE Manager in support of the Royal Caribbean website by utilizing application and user performance data to guide informed decision-making. The SRE uses performance metrics from various sources and tools to support tasks such as initial triage of critical production...
-
Site Reliability Engineer
2 weeks ago
Manila, National Capital Region, Philippines Broadridge Full time $90,000 - $120,000 per yearAt Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.Role OverviewAt Broadridge Trading & Connectivity Solutions, we foster a culture of empowerment, innovation, and collaboration, where...
-
Site Reliability Engineer
2 weeks ago
Manila, National Capital Region, Philippines Broadridge Full time ₱1,200,000 - ₱2,400,000 per yearAt Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.Role OverviewAtBroadridge Trading & Connectivity Solutions, we foster a culture of empowerment, innovation, and collaboration, where...
-
Senior Site Reliability Engineer
4 weeks ago
Manila, National Capital Region, Philippines Broadridge Financial Solutions Full timeSenior Site Reliability Engineer (Hybrid) page is loaded## Senior Site Reliability Engineer (Hybrid)locations: Manila - 6805 Ayala Avetime type: Full timeposted on: Posted Todayjob requisition id: JR1075784At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your...
-
Senior Site Reliability Engineer
1 day ago
Manila, National Capital Region, Philippines CDOps Tech Full time ₱2,000,000 - ₱2,500,000 per yearAbout the OpportunityWe are seeking a seasoned and passionate Senior Site Reliability Engineer for a high-impact contract engagement with one of our key clients, a leader in the marketing-tech sector. This is not just a typical SRE role; you will be the foundational expert responsible for spearheading theadoption of SRE culture and practiceswithin the...
-
Cloud Site Reliability Engineer
3 weeks ago
Manila, Philippines Tyler Technologies Full timeJoin to apply for the Cloud Site Reliability Engineer role at Tyler Technologies Overview Responsibilities Implement tooling to monitor AWS EKS-based systems focusing on performance, reliability, and scalability. Ensure that architecture and deployment models are sufficient to support SLA commitments and are well prepared for future problems of scale....
-
Site Reliability Engineer
3 weeks ago
, Metro Manila, Philippines ABC Worldwide (AKA BRIP Careers Worldwide) Full timeOverview Our client, a global Business Process Outsourcing (BPO) business, is looking for Site Reliability Engineers (SRE) to support their global payment technology company that provides platforms to consumers, businesses and organizations to make electronic payments. The successful candidate will be responsible for ensuring site reliability & performance,...