
Senior Site Reliability Engineer
2 weeks ago
About Penbrothers
Penbrothers is an HR & remote talent management partner and one of the fastest-growing companies in the Philippines. We provide talented Filipinos with global opportunities in high-growth startups and dynamic companies, from the comfort of their own homes.
About the Client
The client is a pioneer in medical recruitment, is seeking an experienced Tech Lead to drive their mission to enhance doctors' well-being. This is an opportunity to contribute your unique skills and expertise to create technology that truly matters, impacting lives on a daily basis
About the Role
We are looking for a Senior SRE/DevOps Specialist to play a vital role in ensuring the reliability of our Salesforce and web/mobile application environments. You will work closely with our engineers to continually improve and enhance our platform leaning towards world class best practices.
Service reliability and observability
- Analysing resource utilization and forecasting capacity needs to ensure the system can handle expected traffic and workloads without performance issues.
- Writing code and scripts to automate repetitive operational tasks, configuration management, and deployment processes to reduce human error and increase efficiency.
- Managing changes to production systems and services, ensuring that new releases and configuration changes are rolled out with minimal disruption and risk.
- Identifying and addressing performance bottlenecks, optimizing software and infrastructure to improve response times and reduce resource consumption.
- Maintaining thorough documentation of systems, configurations, and incident response procedures to facilitate knowledge sharing and onboarding of new team members.
- Defining and maintaining service level objectives that specify the acceptable level of service quality, such as uptime and latency, for a particular system or service.
- Defining the key performance metrics and indicators that will be used to measure the system's performance and reliability, such as error rates and response times.
- Designing and implementing monitoring systems to track the SLIs and using alerting mechanisms to notify the team when the system deviates from its defined SLOs.
Incident management & Disaster recovery planning
- Responding to and mitigating incidents that impact service availability or performance,
- following an incident management process, and conducting post-incident reviews to learn and improve.
- Planning and implementing and executing disaster recovery and backup strategies to ensure data and service availability in case of failures or disasters.
Security
- Ensure systems and infrastructure are securely configured and hardened by default
- Manage secrets, credentials, and access controls across environments
- Monitor for security-related events and support incident response efforts
- Maintain secure CI/CD pipelines and enforce safe deployment practices
- Planning and implementing disaster recovery and backup strategies to ensure data and service availability in case of failures or disasters.
Continuous Improvement
- Continuously evaluating and improving system reliability, efficiency, cost optimization and automation to meet our evolving business needs and customer expectations.
- Rationalizing, evaluating and integrating 3rd party developer tooling and services.
- Troubleshooting platform issues with development teams
- Providing tooling support and access management for development teams
- Stay ahead of the tech curve, bringing new tools and frameworks to the table
What You Bring
- A degree in IT, Computer Science, or relevant experience
- 7+ years in software engineering roles, with extensive experience as an SRE/DevOps
- Engineer with exposure to the Salesforce environment.
- Expertise in implementing and managing monitoring and alerting solutions (CloudWatch, Sentry, Sonar Cloud, Jira Service Management) to ensure system health.
- Experience in incident response and troubleshooting complex issues in production environments.
- Proficiency in programming and scripting languages, such as Python, Java, or Typescript.
- Experience with container orchestration tools like Kubernetes and containerization technologies like Docker.
- Familiarity with cloud platforms (AWS, Azure) and infrastructure-as-code tools (Terraform, Ansible).
- Strong automation skills to streamline repetitive tasks and improve operational efficiency.
- Understanding of reliability engineering principles and practices, including designing for failure and implementing fault-tolerant systems.
- Proven experience in designing and implementing scalable and high-performance systems.
- Knowledge of load balancing, caching strategies, and other techniques for optimizing system performance.
- Strong interpersonal and communication skills are essential for collaborating with cross functional teams.
- Awareness of security best practices and the ability to contribute to the security aspects of the system.
Hiring Process
We utilize AI tools to enhance our hiring efficiency and ensure a fair evaluation of all candidates. As a result, candidates who passed our initial evaluations should expect an AI Interviewer as a component of our recruitment process. This is supervised by Human Talent Acquisition Experts who will also engage with you throughout your application journey.
What You'll Get
At Penbrothers, we are obsessed with creating positive employee experiences. Here you'll find an environment that nurtures learning and provides opportunities for growth. You'll have the opportunity to make an impact on fast-growing startups and dynamic companies.
- Meaningful work & Growth: We take every opportunity to stretch ourselves and deliver an excellent client experience.
- Employee as our biggest asset: We are genuinely invested in our people's career and welfare.
- Global reach & local impact: Get to work with high-growth startups and dynamic companies from the comfort of your own home.
- Powering global startups: We've created 1,400 Filipino jobs that empower global start-ups to focus on growth.
-
Site Reliability Engineer
2 weeks ago
Mandaluyong City, National Capital Region, Philippines Maya Bank Full time $80,000 - $100,000 per yearMaya Mandaluyong, National Capital Region, PhilippinesSite Reliability Engineer (IAU)Maya Mandaluyong, National Capital Region, Philippines3 days ago Be among the first 25 applicants Work on an environment driven by automation. Build and simplify infrastructure resource deployment by creating reusable templates. Advanced knowledge in AWS with...
-
Senior Site Reliability Engineer
2 weeks ago
Mandaluyong City, National Capital Region, Philippines The Dairy Farm Company, Limited- ROHQ Full time $90,000 - $120,000 per yearAs a Site Reliability Engineer (SRE) at DFI Retail Group, you will be the bridge between development and operations, ensuring our systems are designed, implemented, and maintained for maximum reliability, scalability, and performance. You will leverage your software engineering expertise to automate operations, optimize system performance, and develop...
-
Site Reliability Engineer
2 weeks ago
Mandaluyong City, National Capital Region, Philippines DFI Retail Group Full time $90,000 - $120,000 per yearDFI Team BriefAs a Site Reliability Engineer (SRE) at DFI Retail Group, you will be the bridge between development and operations, ensuring our systems are designed, implemented, and maintained for maximum reliability, scalability, and performance. You will leverage your software engineering expertise to automate operations, optimize system performance, and...
-
Site Reliability Engineer
2 weeks ago
Makati City, National Capital Region, Philippines Royal Caribbean International Full time $80,000 - $100,000 per yearGet AI-powered advice on this job and more exclusive features. Site Reliability Engineer (SRE) will assist the SRE team in support of the Royal Caribbean website using application and user performance data to guide informed decision making. The SRE will use site performance metrics collected by various sources and tools to support the following tasks: the...
-
Site Reliability Engineer
2 weeks ago
Mandaluyong City, National Capital Region, Philippines Maya Bank Full time $80,000 - $100,000 per yearMaya Mandaluyong, National Capital Region, PhilippinesSite Reliability Engineer (Banking)Maya Mandaluyong, National Capital Region, Philippines1 month ago Be among the first 25 applicants This role will heavily contribute in the setup, maintenance, and configuration of Maya's cloud infrastructure with significant focus on: security, network, performance,...
-
Senior Site Reliability Engineer
2 weeks ago
Makati City, National Capital Region, Philippines Royal Caribbean International Full time $90,000 - $120,000 per yearSenior Site Reliability Engineer (Sr. SRE) will support the Royal Caribbean website by analyzing application and user performance data to inform decision-making. The Sr. SRE will utilize site performance metrics from various sources and tools to:Assist in triaging critical production incidents Analyze bugs and implement best practices in site reliability...
-
Senior Site Reliability Engineer
2 weeks ago
Makati City, National Capital Region, Philippines eTap Inc. Full time ₱900,000 - ₱1,200,000 per yeare Tap Inc. Makati, National Capital Region, PhilippinesSenior Site Reliability EngineereTap Inc. Makati, National Capital Region, Philippines1 day ago Be among the first 25 applicants Direct message the job poster from e Tap Inc.Human Resources Manager at Electronic Transfer and Advance Processing Inc.About Electronic Transfer and Advance Processing Inc (e...
-
Site Reliability Engineer
2 weeks ago
Makati City, National Capital Region, Philippines Globant Full time $80,000 - $100,000 per yearGlobant Makati, National Capital Region, PhilippinesSite Reliability EngineerGlobant Makati, National Capital Region, PhilippinesWe are seeking a motivated and experienced Site Reliability Engineer (SRE) to join our dynamic team. The ideal candidate will have a strong background in application performance monitoring, logging and tracing, and web performance...
-
Site Reliability Engineer
2 weeks ago
Mandaluyong City, National Capital Region, Philippines The Penbrothers International, Inc. Full time $90,000 - $120,000 per yearAbout PenbrothersPenbrothers is an HR & remote talent management partner and one of the fastest growing companies in the Philippines. We provide talented Filipinos with global opportunities in high-growth startups and dynamic companies. About the ClientOur client is a purpose-driven organization and company headquartered in Sweden, operating with a globally...
-
Site Reliability Engineering Specialist
2 weeks ago
Makati City, National Capital Region, Philippines Electronic Transfer and Advance Processing Inc. Full time $90,000 - $120,000 per yearJob DescriptionWe are seeking a Senior Site Reliability Engineer (SRE) to lead the design, deployment, and management of highly available and scalable AWS cloud infrastructure. This role will focus on building automation solutions, optimizing system performance, and strengthening the reliability and security of cloud services. As a senior member of the team,...