Site Reliability Engineer
3 weeks ago
Human Resources Executive at Tata Consultancy Services Job Description: Site Reliability Engineering (SRE) SME Position Overview We are seeking a highly skilled Site Reliability Engineering (SRE) Subject Matter Expert (SME) to lead and advance our observability, performance engineering, reliability, and AIOps practices. The SME will be responsible for designing, implementing, and evangelizing modern SRE capabilities that improve system reliability, scalability, and efficiency across our IT ecosystem. This role requires deep technical expertise, hands‑on problem‑solving skills, and the ability to influence cross‑functional teams. Key Responsibilities Observability & Monitoring Define and implement observability frameworks across logs, metrics, traces, and events. Establish SLOs, SLIs, and error budgets in collaboration with product and engineering teams. Drive proactive incident detection and root cause analysis. Performance Engineering Lead performance benchmarking, load/stress testing, and scalability assessments of applications and infrastructure. Build performance models and capacity planning strategies for critical business systems. Partner with development teams to identify performance bottlenecks and optimize application/infrastructure efficiency. Reliability Engineering Design and implement automation for incident response, disaster recovery, and self‑healing systems. Lead Chaos Engineering and resilience testing initiatives. Drive reliability reviews, postmortems, and blameless RCA culture. Ensure best practices for fault tolerance, availability, and resilience are embedded in system design. Define AIOps strategy and deploy ML/AI‑driven observability and incident response capabilities. Leverage anomaly detection, event correlation, and predictive analytics for proactive IT operations. Integrate AIOps platforms with ITSM tools for intelligent ticketing, alert suppression, and automated remediation. Act as a thought leader in SRE practices, mentoring engineers and influencing leadership decisions. Partner with development, infrastructure, and business teams to embed SRE principles across the enterprise. Drive continuous improvement culture for availability, scalability, and operational excellence. Required Qualifications 10+ years of experience in IT Operations, Reliability Engineering, or Performance Engineering. Deep expertise in observability and monitoring platforms (Prometheus, Grafana, Splunk, Datadog, Dynatrace, ELK, AppDynamics, etc.). Strong background in performance testing tools (JMeter, LoadRunner, Gatling, k6, etc.) and capacity planning. Hands‑on experience in cloud platforms (AWS, Azure, GCP) and containerized environments (Kubernetes, Docker, OpenShift). Experience with AIOps platforms (Moogsoft, BigPanda, Dynatrace Davis AI, ServiceNow AIOps, etc.) and ML‑driven IT operations. Strong understanding of distributed systems, networking, CI/CD, and DevOps practices. Preferred Qualifications Prior experience leading enterprise‑wide SRE/Observability transformations. Knowledge of Chaos Engineering platforms (Gremlin, Chaos Mesh, Litmus). Exposure to ITSM/ITIL processes and modern incident management practices. Strong communication skills with ability to influence CxO‑level stakeholders. Certifications: Google SRE, AWS DevOps Engineer, Azure SRE Expert, Dynatrace/Datadog certifications (preferred). Strategic and analytical thinker with problem‑solving mindset. Strong leadership, mentorship, and stakeholder engagement skills. Passionate about automation, scalability, and resilience engineering. Ability to balance reliability with velocity in fast‑paced environments. Seniority level Mid‑Senior level Employment type Full‑time Job function Information Technology Industries IT Services and IT Consulting Get notified about new Site Reliability Engineer jobs in Manila, National Capital Region, Philippines . #J-18808-Ljbffr
-
Site Reliability Engineer
4 weeks ago
Manila, Philippines Russell Tobin Full timeSenior Associate - Talent Acquisition - Corporate Strategy Hiring | Specialized in APAC We are seeking a highly skilled Site Reliability Engineering (SRE) Subject Matter Expert (SME) to lead and advance our observability, performance engineering, reliability, and AIOps practices. The SME will be responsible for designing, implementing, and evangelizing...
-
Engineer, Site Reliability
4 weeks ago
Southern Manila District, Philippines Royal Caribbean International Full timeOverview Position Summary: The Site Reliability Engineer (Senior SRE) reports to the SRE Manager in support of the Royal Caribbean website by utilizing application and user performance data to guide informed decision-making. The SRE uses performance metrics from various sources and tools to support tasks such as initial triage of critical production...
-
Site Reliability Engineering Manager
3 weeks ago
Manila, Philippines Russell Tobin Full timeSenior Associate - Talent Acquisition - Corporate Strategy Hiring | Specialized in APAC We are seeking a highly skilled Site Reliability Engineering (SRE) Subject Matter Expert (SME) to lead and advance our observability, performance engineering, reliability, and AIOps practices. The SME will be responsible for designing, implementing, and evangelizing...
-
Cloud Site Reliability Engineer
4 weeks ago
Manila, Philippines Tyler Technologies, Inc. Full timeCloud Site Reliability Engineer Apply Online Location Manila, Philippines Responsibilities Implement tooling to monitor AWS EKS-based systems focusing on performance, reliability, and scalability. Ensure that architecture and deployment models are sufficient to support SLA commitments and are well prepared for future problems of scale. Leverage cloud...
-
Senior Site Reliability Engineer
1 week ago
Manila, National Capital Region, Philippines CDOps Tech Full time ₱2,000,000 - ₱2,500,000 per yearAbout the OpportunityWe are seeking a seasoned and passionate Senior Site Reliability Engineer for a high-impact contract engagement with one of our key clients, a leader in the marketing-tech sector. This is not just a typical SRE role; you will be the foundational expert responsible for spearheading theadoption of SRE culture and practiceswithin the...
-
Site Reliability Engineer 14N25
2 weeks ago
, Metro Manila, Philippines TALENTMATE Full timeJob Description As a Site Reliability Engineer (SRE) 14N25, you will be integral in transforming and maintaining reliable systems while working across diverse engineering, operations, and support teams. Your primary focus will be ensuring the uptime, performance, and resilience of crucial online platforms and services. By employing both software engineering...
-
Site Reliability Engineer
2 weeks ago
Manila, National Capital Region, Philippines Aumtrend Full time ₱1,200,000 - ₱2,400,000 per yearRole : Site Reliability Engineer -IBM MQCompany : One of the global clientLocation : BGC - ManilaWork Setup : Hybrid-2 days onsite/weekSchedule: Day shiftPermanent position & Direct Hiring by the clientRequired Technical Skill Set :Hiring for two levels: L3 (Senior) and L2.5 (Mid-senior)Technical requirements :Core focus: IBM MQ and Kafka administration —...
-
Site Reliability Engineer
4 weeks ago
, Metro Manila, Philippines Michael Page Full timeJoin a growing team. Enjoy market-aligned salaries & benefits. About Our Client The hiring company is a large organization in the healthcare industry, focused on delivering innovative solutions to improve patient care and operational efficiency. The company is committed to leveraging cutting-edge technology to support its services. Job Description Oversee...
-
Site Reliability Engineer
2 weeks ago
, Metro Manila, Philippines QualityKiosk Technologies Full timeUniting Talent with Opportunity | Talent Acquisition | Strategic Hiring | Global Recruitment | SAAS GTM & Tech Hiring | MarTech | FinTech Experience: 6 to 10 years Location: Makati About QualityKiosk Technologies QualityKiosk Technologies is one of the world’s largest independent Quality Engineering (QE) providers and digital transformation enablers,...
-
Site Reliability Engineer
1 week ago
Manila, National Capital Region, Philippines Acquire Intelligence Full time ₱1,500,000 - ₱2,500,000 per yearWe're an award-winning global outsourcer providing contact center and back office services on behalf of our global clients. Come work at a place where innovation and teamwork come together to support the most exciting missions in the worldAcquire Intelligence exists to help businesses unlock smarter ways of working. We believe that by combining the best of...