Senior Data Platform Reliability Engineer

2 weeks ago


Manila, Philippines OpsWerks Full time

Senior Data Platform Reliability Engineer OpsWerks is a technical consulting company specializing in operational services for the high-tech industry. We help platform and infrastructure teams operate multi-cloud environments, execute complex migrations, and enable seamless app deployments. Your Role Run managed services, not just systems. Operate multi-tenant data/AI platforms (Spark, Airflow, Flink, Jupyter) with clear SLAs/SLIs/SLOs, cost guardrails, and capacity plans across AWS/GCP + Kubernetes. Be the face of reliability. Lead incidents end-to-end, own customer comms and post-incident reviews (RCA with actions customers can see and feel). Design for Customer experience. Help Data scientists and customers reduce failed/slow jobs, improve time-to-data, and optimize costs—so customers notice faster pipelines and fewer surprises. Standardize & scale. Build service runbooks, golden paths, and automation that make onboarding and daily ops predictable across customers. Automate the toil away. Ship tooling (Bash/Python, GitOps, CI/CD) for backups, DR drills, upgrades, access, and environment bootstrapping. Make signals meaningful. Instrument platforms with metrics/logs/traces; tune alerting to cut noise and improve detection and response times. Govern change. Plan and execute upgrades/migrations within change windows; champion safe deploys and rollback strategies. Partner & mentor. Guide junior engineers; collaborate with customer dev/data teams to unblock delivery and raise the reliability bar. Participate in on-call. Join a 24x7 rotation with crisp handoffs and playbooks. Your Qualifications Background: Bachelor’s in IT/Engineering (or equivalent practical experience). Data operations: Hands-on support for ETL/ELT, SQL, and production pipelines/workflows. Platform depth: Strong experience in at least one of Spark, Airflow, Flink, or Jupyter (plus the ecosystem around it). Scripting/Programming: Solid working knowledge in at least one language - Python, Java or Scala (Automations, Data Manipulations & Orchestrations). Cloud & containers: Real-world AWS or GCP and production environment usage as a User or Administrator. Kubernetes (or Docker) for scheduling/scale. Ops craft: Incident management, post-incident reviews, change management, and service reporting. Communication: Clear customer-facing comms (status updates, RCAs, runbooks). Tenure: 5+ years across the domains above, with depth in at least 1–2 tools per domain. Plus Points Certifications: CKA/CKAD, AWS (Associate/Professional), or equivalent. IaC & DevOps: Terraform, Helm, Argo CD/GitOps, CI/CD for data platforms. Observability & ITSM: Prometheus/Grafana/Datadog; Jira Service Management/ServiceNow, StatusPage. Security & compliance basics (least-privilege access, audit trails) Ready to start your awesome journey and be part of OpsWerks? Seniority level Mid-Senior level Employment type Full-time Job function Engineering and Information Technology Industries IT Services and IT Consulting #J-18808-Ljbffr



  • Manila, National Capital Region, Philippines OpsWerks Full time ₱1,200,000 - ₱2,400,000 per year

    OpsWerks is a technical consulting company specializing in operational services for the high-tech industry. We help platform and infrastructure teams operate multi-cloud environments, execute complex migrations, and enable seamless app deployments.Your RoleRun managed services, not just systems. Operate multi-tenant data/AI platforms (Spark, Airflow, Flink,...


  • Manila, Philippines Tata Consultancy Services Full time

    Human Resources Executive at Tata Consultancy Services Job Description: Site Reliability Engineering (SRE) SME Position Overview We are seeking a highly skilled Site Reliability Engineering (SRE) Subject Matter Expert (SME) to lead and advance our observability, performance engineering, reliability, and AIOps practices. The SME will be responsible for...


  • Southern Manila District, Philippines HRTX Full time

    Overview We are looking for a Senior Platform Engineer to join our Infrastructure team in Manila. This candidate will write codes used in operations and will be working with infrastructure management. This candidate will also maintain production health 24/7/365. It is also expected that the engineer would work in a global team capacity. Responsibilities...


  • Manila, Philippines Russell Tobin Full time

    Senior Associate - Talent Acquisition - Corporate Strategy Hiring | Specialized in APAC We are seeking a highly skilled Site Reliability Engineering (SRE) Subject Matter Expert (SME) to lead and advance our observability, performance engineering, reliability, and AIOps practices. The SME will be responsible for designing, implementing, and evangelizing...


  • Manila, Philippines Russell Tobin Full time

    Senior Associate - Talent Acquisition - Corporate Strategy Hiring | Specialized in APAC We are seeking a highly skilled Site Reliability Engineering (SRE) Subject Matter Expert (SME) to lead and advance our observability, performance engineering, reliability, and AIOps practices. The SME will be responsible for designing, implementing, and evangelizing...


  • Manila, National Capital Region, Philippines CDOps Tech Full time ₱2,000,000 - ₱2,500,000 per year

    About the OpportunityWe are seeking a seasoned and passionate Senior Site Reliability Engineer for a high-impact contract engagement with one of our key clients, a leader in the marketing-tech sector. This is not just a typical SRE role; you will be the foundational expert responsible for spearheading theadoption of SRE culture and practiceswithin the...


  • , Metro Manila, Philippines QualityKiosk Technologies Full time

    Uniting Talent with Opportunity | Talent Acquisition | Strategic Hiring | Global Recruitment | SAAS GTM & Tech Hiring | MarTech | FinTech Experience: 6 to 10 years Location: Makati About QualityKiosk Technologies QualityKiosk Technologies is one of the world’s largest independent Quality Engineering (QE) providers and digital transformation enablers,...

  • Data Engineer

    2 weeks ago


    , Metro Manila, Philippines InvestEd Full time

    InvestEd National Capital Region, Philippines We are looking for a Data Engineer to develop, optimize, and maintain our data infrastructure. In this role, you will be responsible for building and optimizing data pipelines, ensuring cost and compute efficiency, and enabling high-quality, curated datasets for analytics and data science. This role requires...


  • Manila, Philippines Kroll Full time

    Kroll Business Services (KBS) is seeking a Senior Client Platform Engineer, Team Lead to design, build, and maintain the enterprise forms and integrations platform that powers our client data collection websites. This is a hands-on technical role with light team leadership. You’ll guide engineers, manage coverage across the APAC region, and personally...


  • Manila, Philippines CentiForce Full time

    A leading provider of reliability testing solutions is seeking a Senior Sales Engineer in Manila. This full-time role involves providing technical expertise, delivering presentations, and supporting sales strategies. Candidates should have a Bachelor's degree in Engineering and proven sales engineering skills. Experience in reliability testing is preferred....