Senior AWS Site Reliability Engineer
4 weeks ago
Direct message the job poster from Flexisource IT
We are seeking an experienced Senior AWS Site Reliability Engineer to join our cross-functional cloud platform team. Working alongside a diverse group of DevOps and Site Reliability Engineers, you will combine deep technical expertise in AWS cloud infrastructure with strong leadership capabilities in incident response and system reliability. In this role, you will be instrumental in leading incident response, maintaining, optimizing and scaling our cloud infrastructure while ensuring exceptional system reliability and performance.
Responsibilities- Lead incident response from initial detection, real-time mitigation, root cause analysis, post-mortem documentation (using Incident IO) and implementation of lessons learned, with a focus on continuous improvement.
- Develop and execute comprehensive incident response strategies to minimise downtime and business impact
- Participate in a 24/7 on-call rotation to ensure continuous system availability
- Implement and maintain comprehensive observability solutions using Cloudwatch, DataDog or similar monitoring platforms
- Maintain, improve, and optimise AWS infrastructure using Terraform while ensuring scalability, reliability, and cost efficiency.
- Continuously assess and enhance AWS infrastructure to optimise performance and cost-effectiveness
- Monitor and optimise serverless technologies including AWS Lambda and API Gateway for peak performance and cost efficiency
- Monitor and maintain ECS Fargate deployments for containerised applications, ensuring optimal resource utilization
- Collect and analyse metrics to identify resource consumption, abnormal behavior, and potential performance bottlenecks
- Configure and manage alerting, dashboards, and automated monitoring across distributed systems
- Foster improved collaboration between development and operations teams by implementing SRE practices.
- Previous experience in a DevOps or SRE role
- Exceptional written and verbal communication skills
- Proven experience in incident response and 24/7 on-call responsibilities
- Expert-level knowledge of Infrastructure as Code, primarily Terraform (demonstrated experience with other IaC tools will be highly regarded)
- Expert-level knowledge of AWS compute infrastructure
- Proficiency in automation tools and scripting languages
- Strong understanding of monitoring, metrics collection, and performance analysis
- Expert knowledge of observability and monitoring platforms such as DataDog, New Relic, Prometheus, or similar tools
- Experience with log aggregation, APM (Application Performance Monitoring), and distributed tracing
- Excellent collaboration abilities and capacity to work effectively in cross-functional teams
- Strong analytical and problem-solving skills
- Demonstrated ability to work autonomously and take ownership
- Experience with incident.io (highly desirable).
- Background in payments and PCI compliance environments (highly desirable).
- AWS certifications.
- Experience with container orchestration and microservices architecture.
- Knowledge of security best practices in cloud environments.
- Schedule: Monday- Friday, 6:00am- 3:00pm or 7:00am- 4:00pm (PH Time); depending on business needs
- Location: Makati | Work from Home Until Further Notice
-
Site Reliability Engineer
7 days ago
Manila, National Capital Region, Philippines HGS Offshore Staffing Solutions Full time ₱2,000,000 - ₱2,500,000 per yearSENIOR SITE RELIABILITY ENGINEERPOSITION OVERVIEWWe are seeking an experienced Senior AWS Site Reliability Engineer to join our cross-functionalcloud platform team. Working alongside a diverse group of DevOps and Site ReliabilityEngineers, you will combine deep technical expertise in AWS cloud infrastructure with strongleadership capabilities in incident...
-
Senior Site Reliability Engineer
2 weeks ago
Manila, National Capital Region, Philippines CDOps Tech Full time ₱2,000,000 - ₱2,500,000 per yearAbout the OpportunityWe are seeking a seasoned and passionate Senior Site Reliability Engineer for a high-impact contract engagement with one of our key clients, a leader in the marketing-tech sector. This is not just a typical SRE role; you will be the foundational expert responsible for spearheading theadoption of SRE culture and practiceswithin the...
-
Site Reliability Engineer
3 weeks ago
, Metro Manila, Philippines Buscojobs Full timeSite Reliability Engineer jobs in the Philippines 47 Site Reliability Engineer jobs in the Philippines Site Reliability Engineer Posted today Job Viewed Tap Again To Close Job Description Responsibilities: Develop, maintain, and optimize SAP landscapes on GCP for our clients, ensuring optimal performance, reliability, and efficiency. Utilize industry-leading...
-
Engineer, Site Reliability
4 weeks ago
Southern Manila District, Philippines Royal Caribbean International Full timeOverview Position Summary: The Site Reliability Engineer (Senior SRE) reports to the SRE Manager in support of the Royal Caribbean website by utilizing application and user performance data to guide informed decision-making. The SRE uses performance metrics from various sources and tools to support tasks such as initial triage of critical production...
-
Site Reliability Engineer
5 days ago
Manila, National Capital Region, Philippines Braintrust Full time ₱30,000 - ₱150,000 per yearJob Description*Compensation range varies off level of experience:*Jr SRE $12k-$18k/yr, Intermediate: $20k-$30k/yr, Senior: $35k - $50k/yrSome travel may be required.*Card payment domain knowledge/experience is key:*Our client, a global Business Process Outsourcing (BPO) businesses is looking for Site Reliability Engineers (SRE) to support their client, a...
-
Cloud Site Reliability Engineer
5 days ago
Manila, National Capital Region, Philippines Tyler Technologies Full time $80,000 - $150,000 per yearDescriptionResponsibilitiesImplement tooling to monitor AWS EKS-based systems focusing on performance, reliability, and scalability.Ensure that architecture and deployment models are sufficient to support SLA commitments and are well prepared for future problems of scale.Leverage cloud technology and platform capabilities to provide operationally sustainable...
-
Site Reliability Engineer
3 days ago
Manila, National Capital Region, Philippines Cambridge University Press & Assessment Full time ₱60,000 - ₱81,000 per yearSalary:Php 60,000 to Php 81,000- Location:Manila- Country:Philippines- Business Unit:Technology- Vacancy Type:Permanent- Closing Date:9 October 2025Meet the recruiterImee SantosWork setup: Hybrid (open to 2x a week in the office)Work schedule: 10AM to 6PM Manila timeEmployment type: PermanentLocation: Makati City, Metro ManilaPay range: Php 60,000 to Php...
-
Senior Site Reliability Engineer
5 days ago
Manila, National Capital Region, Philippines Satori Full time ₱1,200,000 - ₱2,400,000 per year**Our client, a multinational leader in fleet performance management, is establishing its operations in the Philippines and is currently hiring members for the pioneer team.Job Summary:You will be part of an autonomous team, responsible for maintaining and developing the Client's global SaaS platforms. Your efforts will directly contribute to enabling the...
-
Site Reliability Engineer
5 days ago
Manila, National Capital Region, Philippines Broadridge Full time ₱1,200,000 - ₱2,400,000 per yearAt Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.Role OverviewWe are seeking a Site Reliability Engineer (Cloud) to lead the design, implementation, and operational support of our...
-
Senior Site Reliability Engineer
5 days ago
Manila, National Capital Region, Philippines Broadridge Full time ₱1,200,000 - ₱2,400,000 per yearAt Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.Role OverviewWe are looking for a seasoned Site Reliability Engineer to design, implement, and maintain scalable, secure, and high-performing...