
Senior Data Platform Reliability Engineer
6 days ago
OpsWerks is a technical consulting company specializing in operational services for the high-tech industry. We help platform and infrastructure teams operate multi-cloud environments, execute complex migrations, and enable seamless app deployments.
Your Role
- Run managed services, not just systems. Operate multi-tenant data/AI platforms (Spark, Airflow, Flink, Jupyter) with clear SLAs/SLIs/SLOs, cost guardrails, and capacity plans across AWS/GCP + Kubernetes.
- Be the face of reliability. Lead incidents end-to-end, own customer comms and post-incident reviews (RCA with actions customers can see and feel).
- Design for Customer experience. Help Data scientists and customers reduce failed/slow jobs, improve time-to-data, and optimize costs—so customers notice faster pipelines and fewer surprises.
- Standardize & scale. Build service runbooks, golden paths, and automation that make onboarding and daily ops predictable across customers.
- Automate the toil away. Ship tooling (Bash/Python, GitOps, CI/CD) for backups, DR drills, upgrades, access, and environment bootstrapping.
- Make signals meaningful. Instrument platforms with metrics/logs/traces; tune alerting to cut noise and improve detection and response times
- Govern change. Plan and execute upgrades/migrations within change windows; champion safe deploys and rollback strategies.
- Partner & mentor. Guide junior engineers; collaborate with customer dev/data teams to unblock delivery and raise the reliability bar.
- Participate in on-call. Join a 24x7 rotation with crisp handoffs and playbooks.
Your Qualifications
- Background: Bachelor's in IT/Engineering (or equivalent practical experience).
- Data operations: Hands-on support for ETL/ELT, SQL, and production pipelines/workflows.
- Platform depth: Strong experience in at least one of Spark, Airflow, Flink, or Jupyter (plus the ecosystem around it).
- Scripting/Programming: Solid working knowledge in at least one (1) language - Python, Java or Scala (Automations, Data Manipulations & Orchestrations)
- Cloud & containers: Real-world AWS or GCP and production environment usage as a User or Administrator
- Kubernetes (or Docker) for scheduling/scale.
- Ops craft: Incident management, post-incident reviews, change management, and service reporting.
- Communication: Clear customer-facing comms (status updates, RCAs, runbooks).
- Tenure: 5+ years across the domains above, with depth in at least 1–2 tools per domain.
Plus Points
- Certifications: CKA/CKAD, AWS (Associate/Professional), or equivalent.
- IaC & DevOps: Terraform, Helm, Argo CD/GitOps, CI/CD for data platforms.
- Observability & ITSM: Prometheus/Grafana/Datadog; Jira Service Management/ServiceNow, StatusPage.
- Security & compliance basics (least-privilege access, audit trails)
Ready to start your awesome journey and be part of OpsWerks?
-
Senior Fullstack Engineer
2 weeks ago
Manila, National Capital Region, Philippines Motor Platform Full time ₱1,200,000 - ₱2,400,000 per yearMotorPlatform is Australia's fastest‑growing digital wholesale software business, delivering massive impact on every meaningful automotive business metric—from opportunity conversions to profit per unit to days in stock. Our driven team is laser‑focused on reshaping the automotive industry. MotorMarket, our flagship marketplace, lets dealers buy and...
-
Senior Site Reliability Engineer
4 weeks ago
Manila, National Capital Region, Philippines Broadridge Financial Solutions Full timeSenior Site Reliability Engineer (Hybrid) page is loaded## Senior Site Reliability Engineer (Hybrid)locations: Manila - 6805 Ayala Avetime type: Full timeposted on: Posted Todayjob requisition id: JR1075784At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your...
-
Site Reliability Engineer
2 weeks ago
Manila, National Capital Region, Philippines Broadridge Full time $90,000 - $120,000 per yearAt Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.Role OverviewAt Broadridge Trading & Connectivity Solutions, we foster a culture of empowerment, innovation, and collaboration, where...
-
Site Reliability Engineer
2 weeks ago
Manila, National Capital Region, Philippines Broadridge Full time ₱1,200,000 - ₱2,400,000 per yearAt Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.Role OverviewAtBroadridge Trading & Connectivity Solutions, we foster a culture of empowerment, innovation, and collaboration, where...
-
Senior Site Reliability Engineer
6 hours ago
Manila, National Capital Region, Philippines CDOps Tech Full time ₱2,000,000 - ₱2,500,000 per yearAbout the OpportunityWe are seeking a seasoned and passionate Senior Site Reliability Engineer for a high-impact contract engagement with one of our key clients, a leader in the marketing-tech sector. This is not just a typical SRE role; you will be the foundational expert responsible for spearheading theadoption of SRE culture and practiceswithin the...
-
Senior Site Reliability Engineer
4 weeks ago
Manila, National Capital Region, Philippines Broadridge Full timeOverviewSenior Site Reliability Engineer (Hybrid-Flexible Options) – BroadridgeWe are looking for a seasoned Site Reliability Engineer to design, implement, and maintain scalable, secure, and high-performing infrastructure solutions across a full-stack environment. This role requires deep collaboration with cross-functional teams to drive automation,...
-
Senior Site Reliability Engineer
4 weeks ago
Manila, National Capital Region, Philippines Canonical Full timeOverviewJoin to apply for the Senior Site Reliability Engineer role at Canonical.Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT. Our...
-
Senior Site Reliability Engineer
2 weeks ago
Manila, National Capital Region, Philippines Satori Full time ₱1,200,000 - ₱2,400,000 per year**Our client, a multinational leader in fleet performance management, is establishing its operations in the Philippines and is currently hiring members for the pioneer team.Job Summary:You will be part of an autonomous team, responsible for maintaining and developing the Client's global SaaS platforms. Your efforts will directly contribute to enabling the...
-
Senior Cloud Data Engineering Lead
6 hours ago
Manila, National Capital Region, Philippines Complete Development (CoDev) Full time ₱2,500,000 - ₱5,000,000 per yearOverviewReporting to the Chief Technology Officer - NA, the Senior Cloud Data Engineer is responsible for leading initiatives in a cross functional team, playing a critical role architecting and delivering custom cloud solutions with our clients. You will work collaboratively with our experienced Google Cloud and Google Marketing Platform experts to deliver...
-
Data Platform SRE
2 weeks ago
Manila, National Capital Region, Philippines Ciena Full time ₱900,000 - ₱1,200,000 per yearAs the global leader in high-speed connectivity, Ciena is committed to a people-first approach. Our teams enjoy a culture focused on prioritizing a flexible work environment that empowers individual growth, well-being, and belonging. We're a technology company that leads with our humanity—driving our business priorities alongside meaningful social,...