Senior Site Reliability Engineer
3 days ago
We are looking for a Senior Site Reliability Engineer with Cloud platform experience. This individual will be part of a team responsible for operating and maintaining production clusters and developing our observability solutions; they will collaborate with team members to develop automation strategies, monitoring & alerting, and ensuring overall platform reliability. Your goal will be to become an integral part of the team, making every challenge of the platform – your own challenge, and solving them accordingly.
Responsibilities
- Ensure platform reliability and availability across production and pre-production environments through proactive monitoring, alerting, and automation.
- First response for incidents, contribute to problem management and root cause analysis.
- Supporting the development team's effort towards reliability, creating a solid reliability culture within the development lifecycle.
- Develop troubleshooting documentation for production support resources.
- Collaborate with Engineering teams to develop optimised and productive runbooks, operational documentation and automation of operational tasks.
- Collaborate with development and cloud engineering teams to embed reliability and performance into the software delivery lifecycle.
- Design, implement, and evolve observability solutions (metrics, logs, traces, dashboards) using tools such as Prometheus, Grafana, and ELK.
- Participate in on-call rotations and continuously improve alert quality and response processes.
- Champion a culture of reliability, performance, and continuous improvement across teams.
- Bachelor's Degree or MS in Engineering or equivalent.
- Experience in operating at least one container orchestration cluster (Kubernetes, Docker Swarm).
- Experience developing or maintaining software for production services at scale.
- Experience with ELK.
- Experience with AWS.
- Experience with Grafana/Prometheus stack.
- Strong scripting skills (Bash, Python or Go).
- Excellent communication skills.
- Thinking out of the box and anticipating challenges. It is imperative we are not simply reactive; we must expect challenges and question technologies, procedures and thinking already in place. You will be expected to constantly review and challenge at all levels.
- Versatility. We work with agile/lean methods. We'd much rather iterate and learn than assume we know all the answers.
- Being a team player. You don't (always) work in isolation and are excited by the thought of using your team whilst involving product, experience design, engineering, and more in the process.
Will be considered as a plus:
- Telephony knowledge (SIP, VoIP);
- Experience in Linux Administration (RedHat, CentOS, AL);
- Working knowledge in Configuration Management tools (Terraform, Ansible);
- Experience with TCP/IP and general networking concepts;
- RDBMS knowledge (MySQL, Postgres);
- NoSQL knowledge (Redis).
- Fixed compensation;
- Long-term employment with the working days vacation;
- Development in professional growth (courses, training, etc);
- Being part of successful cutting-edge technology products that are making a global impact in the service industry;
- Proficient and fun-to-work-with colleagues;
- Apple gear.
Omilia is proud to be an equal opportunity employer and is dedicated to fostering a diverse and inclusive workplace. We believe that embracing diversity in all its forms enriches our workplace and drives our collective success. We are committed to creating an environment where everyone feels welcomed, valued, and empowered to contribute their unique perspectives without regard to factors such as race, color, religion, gender, gender identity or expression, sexual orientation, national origin, heredity, disability, age, or veteran status, all eligible candidates will be given consideration for employment.
-
Site Reliability Engineer
3 days ago
Philippines Avid Technology Full time ₱900,000 - ₱1,200,000 per yearIt's fun to work in a company where people truly BELIEVE in what they're doingWe're committed to bringing passion and customer focus to the business.ABOUT AVIDAvid makes technology and collaborative tools so creators can entertain, inform, educate and enlighten the world. Our customers are the visionaries behind the most inspiring feature films, television...
-
Site Reliability Engineer
3 weeks ago
, Metro Manila, Philippines QualityKiosk Technologies Full timeUniting Talent with Opportunity | Talent Acquisition | Strategic Hiring | Global Recruitment | SAAS GTM & Tech Hiring | MarTech | FinTech Experience: 6 to 10 years Location: Makati About QualityKiosk Technologies QualityKiosk Technologies is one of the world’s largest independent Quality Engineering (QE) providers and digital transformation enablers,...
-
Site Reliability Engineer
3 days ago
Remote - Philippines Avid Full time $40,000 - $80,000 per yearIt's fun to work in a company where people truly BELIEVE in what they're doingWe're committed to bringing passion and customer focus to the business.ABOUT AVIDAvid makes technology and collaborative tools so creators can entertain, inform, educate and enlighten the world. Our customers are the visionaries behind the most inspiring feature films, television...
-
Site Reliability Engineer
3 days ago
Pasig City, , Philippines BEL USA Full time $60,000 - $120,000 per yearWe are seeking a Site Reliability Engineer (SRE) with a strong software engineering background and a passion for building reliable, scalable, and highly observable systems. As an SRE, you will focus on improving service reliability through automation, reducing operational toil, implementing SLOs and error budgets, and partnering closely with software...
-
Site Reliability Engineer 14N25
3 weeks ago
, Metro Manila, Philippines TALENTMATE Full timeJob Description As a Site Reliability Engineer (SRE) 14N25, you will be integral in transforming and maintaining reliable systems while working across diverse engineering, operations, and support teams. Your primary focus will be ensuring the uptime, performance, and resilience of crucial online platforms and services. By employing both software engineering...
-
Site Reliability Engineer
6 days ago
Descartes Systems (Philippines), Inc. Descartes SmartCompliance Full time ₱1,500,000 - ₱3,000,000 per yearDescartes Unites the People and Technology that Move the WorldThe need for efficient, secure, and agile supply chains and logistics operations has become ever more critical and complex. By combining innovative technology, powerful trade intelligence and the reach of our network, Descartes helps get goods, information, transportation assets, and people...
-
Lead Site Reliability Engineer
3 weeks ago
, Central Luzon, Philippines Sim Full timeOverview WHAT MAKES US, US Join some of the most innovative thinkers in FinTech as we lead the evolution of financial technology. If you are an innovative, curious, collaborative person who embraces challenges and wants to grow, learn and pursue outcomes with our prestigious financial clients, say Hello to SimCorp! At its foundation, SimCorp is guided by our...
-
Senior Network Engineer
3 weeks ago
, , Philippines Tenerity Full timeOverview Senior Network Engineer The Senior Network Engineer role within the Network and Telecom Services Team requires strong technical and communication skills. The responsibilities include analyzing, designing, installing, configuring, maintaining, and improving network infrastructure to ensure it is secure, reliable, and performs optimally for a global...
-
Site Engineer
3 weeks ago
, Bulacan, Philippines Grand Apex Constructions Inc. Full timeAbout the role Join Grand Apex Constructions Inc. as a Site Engineer based in San Miguel, Bulacan . In this full-time position, you will play a crucial role in the successful delivery of construction projects by providing technical expertise, project coordination, and on-site management. Your strategic contributions will support the company's mission to...
-
Reliability Operations Engineer
3 weeks ago
, Metro Manila, Philippines Infobip Full timeWorking at Infobip means being part of something truly global. With 75+ offices across six continents, we’re not just building technology — we’re shaping how more than 80% of the world connects and communicates. As employees, we take pride in contributing to the world’s largest and only full-stack cloud communication platform. But it’s not just...