Reliability Operations Engineer
1 week ago
As employees, we take pride in contributing to the world's largest and only full-stack cloud communication platform. But it's not just what we do, it's how we do it: with curiosity, passion, and a whole lot of collaboration.
If you're looking for meaningful work and challenges that grow you in a culture where people show up with purpose, this is your opportunity.
Let's build what's next, together.
What this role is all about
As part of Reliability Operations, you'll be on the frontline of platform health-detecting, triaging, and coordinating responses to incidents that impact our customers. You'll improve observability, tune alerting, and collaborate across engineering to drive fast, effective mitigation. You'll document learnings, evolve runbooks, and automate wherever possible to prevent issues from becoming incidents.
What you'll do
- Act as first responder to platform alerts; triage issues, assess customer impact, and coordinate mitigation
- Monitor products and services, detect anomalies, and utilize runbooks to identify affected systems, locations, and teams
- Lead incident resolution as incident commander; clearly communicate status and escalate to responsible stakeholders
- Prevent incidents by fine-tuning alerting, improving observability, and collaborating on common mitigation tactics
- Create and maintain runbooks; capture post-incident learnings and drive corrective actions
- Automate detection/response workflows to reduce toil and improve response time
- Partner with product and engineering to improve reliability and operational readiness
- 3+ years in reliability operations, network operations, production support, or similar (engineering/support background)
- Hands-on with monitoring/observability tools (e.g., Grafana, Prometheus, New Relic, Graylog, Kibana/Elasticsearch/OpenSearch)
- Strong systems thinking, troubleshooting, analytical and investigative skills; comfortable with large data sets
- Clear communicator in English; able to summarize incidents for technical and non-technical audiences
- Experience following and improving runbooks; coordinating incident response is a plus
- Familiarity with system administration tasks or scripting/automation is an advantage
- Curious, detail-oriented, collaborative; driven by continuous improvement and learning
- Financial rewards & recognition - A fair compensation aligned with your experience, industry, and market standards, performance-driven bonuses, regular reviews to support your growth and recognize your contributions, and a culture that values your impact.
- Flexible work arrangements - We combine in-person collaboration with remote work and flexible working hours, because great ideas happen everywhere - and not always between 9 and 6.
- ESOP (Employee Stock Ownership Plan) - As an Infobip employee, you'll have the opportunity to share in our company's success through stock options.
- Work-life balance and Well Being - We offer time off when you need it, special leave days for life's big moments, and a flexible hybrid work model tailored to local regulations.
- Career mobility - Your career is a journey. With internal mobility, upskilling, and mentorship, we help you shape your path.
- Professional development - Learning never stops. Onboarding, mentorship, and training programs help you grow-no matter where you start.
- International mobility - Ready to take your career global? Explore short- and long-term opportunities in our Hubs worldwide.
#LI-Hybrid
Diversity drives connection
Infobip is built on diverse backgrounds, perspectives, and talents. We're proud to be an equal-opportunity employer and are committed to fostering an inclusive workplace.
No matter your race, gender, age, background, or identity - if you have the passion and skills to thrive, there's a place for you here.
All qualified applicants will receive consideration for employment without regard to race, color, ancestry, religion, age, sex, sexual orientation, gender, gender identity, national origin, citizenship, disability, veteran status or any other part of one's identity.
Read more about our hiring process.
#LI-KA1
-
Reliability Operations Engineer
22 hours ago
Manila, National Capital Region, Philippines Infobip Full time ₱1,200,000 - ₱2,400,000 per yearWorking at Infobip means being part of something truly global. With 75+ offices across six continents, we're not just building technology — we're shaping how more than 80% of the world connects and communicates.As employees, we take pride in contributing to the world's largest and only full-stack cloud communication platform. But it's not just what we do,...
-
Reliability Operations Engineer
3 days ago
Manila, National Capital Region, Philippines Infobip Full time ₱1,200,000 - ₱2,400,000 per yearWorking at Infobip means being part of something truly global. With 75+ offices across six continents, we're not just building technology — we're shaping how more than 80% of the world connects and communicates. As employees, we take pride in contributing to the world's largest and only full-stack cloud communication platform. But it's not just what we do,...
-
Site Reliability Engineer
2 weeks ago
Manila, National Capital Region, Philippines Russell Tobin Full time ₱120,000 - ₱180,000 per yearWe are seeking a highly skilledSite Reliability Engineering (SRE) Subject Matter Expert (SME)to lead and advance our observability, performance engineering, reliability, and AIOps practices. The SME will be responsible for designing, implementing, and evangelizing modern SRE capabilities that improve system reliability, scalability, and efficiency across our...
-
Site Reliability Engineer
3 days ago
Manila, National Capital Region, Philippines CDOps Tech Full time ₱120,000 - ₱180,000 per yearAbout the OpportunityWe are seeking a seasoned and passionate Site Reliability Engineer for a high-impact contract engagement with one of our key clients, a leader in the marketing-tech sector. This is not just a typical SRE role; you will be the foundational expert responsible for spearheading the adoption of SRE culture and practices within the client's...
-
Site Reliability Engineer
3 days ago
Manila, National Capital Region, Philippines Cambridge University Press & Assessment Full time ₱62,000 - ₱84,000 per yearWork setup: We operate in a hybrid work environment, and we encourage applicants who are open to working in the officetwo days a weekto apply.Work schedule: 10AM to 6PM Manila timeEmployment type: PermanentLocation: Makati City, Metro ManilaPay range: Php 62,000 to Php 84,000We value transparency and encourage applicants comfortable with this range to...
-
Site Reliability Engineering Manager
1 week ago
Manila, National Capital Region, Philippines Russell Tobin Full time $60,000 - $120,000 per yearWe are seeking a highly skilledSite Reliability Engineering (SRE) Subject Matter Expert (SME)to lead and advance our observability, performance engineering, reliability, and AIOps practices. The SME will be responsible for designing, implementing, and evangelizing modern SRE capabilities that improve system reliability, scalability, and efficiency across our...
-
Site Reliability Engineer
7 days ago
Manila, National Capital Region, Philippines HGS Offshore Staffing Solutions Full time ₱2,000,000 - ₱2,500,000 per yearSENIOR SITE RELIABILITY ENGINEERPOSITION OVERVIEWWe are seeking an experienced Senior AWS Site Reliability Engineer to join our cross-functionalcloud platform team. Working alongside a diverse group of DevOps and Site ReliabilityEngineers, you will combine deep technical expertise in AWS cloud infrastructure with strongleadership capabilities in incident...
-
Senior Site Reliability Engineer
2 weeks ago
Manila, National Capital Region, Philippines CDOps Tech Full time ₱2,000,000 - ₱2,500,000 per yearAbout the OpportunityWe are seeking a seasoned and passionate Senior Site Reliability Engineer for a high-impact contract engagement with one of our key clients, a leader in the marketing-tech sector. This is not just a typical SRE role; you will be the foundational expert responsible for spearheading theadoption of SRE culture and practiceswithin the...
-
Site Reliability Engineer
3 days ago
Manila, National Capital Region, Philippines Broadridge Full time ₱1,200,000 - ₱2,400,000 per yearAt Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.Role OverviewWe are seeking a Site Reliability Engineer (Cloud) to lead the design, implementation, and operational support of our...
-
Automation Software Engineer Lead
1 week ago
Manila, National Capital Region, Philippines Operations Full time ₱1,224,000 - ₱1,668,000 per yearAutomation Software Engineer LeadWork setup: We operate in a hybrid work environment, and we encourage applicants who are open to working in the office two days a week to apply.Work schedule: Monday to Friday, 3PM to 11PM Manila time, overlaps with UK operating hoursEmployment type: PermanentLocation: Makati City, Metro ManilaPay range: We value transparency...