Site Reliability Engineer
7 days ago
About the Role:
We are seeking a skilled and motivated Site Reliability Engineer (SRE) with expertise in supporting and managing MQ and Kafka systems. The ideal candidate will have a strong background in Unix systems administration, experience with Kubernetes (preferred), and a passion for maintaining high availability, performance, and reliability in distributed systems.
Key Responsibilities:
· Provide technical guidance and assist application teams with adoption of MQ Encryption in Transit.
· Manage and Support MQ & Kafka Systems: Monitor, maintain, and troubleshoot IBM MQ and Kafka clusters to ensure optimal performance and reliability. Handle incident management, including root cause analysis and post-mortem reviews.
Automation and Scripting: Develop and maintain automation scripts to streamline operational processes, deployment pipelines, and monitoring solutions using tools like Ansible, Python and /or shell scrips or similar.
· Monitoring and Alerting: Implement and manage monitoring tools (e.g., Prometheus, Grafana) to track the health and performance of MQ, Kafka, and related systems. Create and manage alerting mechanisms to proactively identify and resolve issues.
· Performance Tuning and Optimization: Continuously monitor system performance, identifying and resolving bottlenecks. Implement best practices for scaling Kafka and MQ clusters.
· Collaboration and Support: Work closely with application development, DevOps, and other engineering teams to support new and existing applications.
· Documentation: Maintain clear and comprehensive documentation for system configurations, procedures, and troubleshooting guides.
Qualifications
Experience: 5+ years of experience as a Site Reliability Engineer or in a similar role, with hands-on experience in supporting MQ (e.g., IBM MQ) and Confluent Kafka.
Technical Skills: Experience in managing & supporting a large IBM MQ and/or Kafka plant. One of the 2 below is mandatory.
o IBM MQ Administration:
· Installation & Configuration: Proficiency in installing and configuring IBM MQ.
Queue Management: Creating, configuring, and managing queues, channels, listeners, and other MQ objects.
Security Configuration: Implementing SSL/TLS, access control lists (ACLs), and MQ object security.
· Troubleshooting MQ Performance related issues.
o Confluent Kafka Administration:
· Installation & Configuration: Proficiency in installing and configuring Kafka brokers, Zookeeper, Kafka Connect, and Schema Registry on various platforms.
· Cluster Management: Skills in managing Kafka clusters, including adding/removing brokers, partitioning, and replication strategies.
· Kafka Broker Configuration: Deep understanding of broker configurations such as log retention, segment sizes, and in-sync replica (ISR) management.
· Experience of managing Kafka on Kubernetes is preferred.
o Familiarity with CI/CD pipelines and related tools (e.g., Jenkins, Git).
o Experience with configuration management tools (e.g., Ansible).
o Proficiency in Unix/Linux administration.
o Familiar with Networking concepts and utilities
o Proficiency with Python programming language
o High Availability (HA) & Disaster Recovery (DR): Setting up and maintaining HA clusters and disaster recovery solutions, including replication and failover mechanisms.
Soft-Skills
· Strong problem-solving skills and attention to detail.
- Excellent communication and collaboration skills.
- Ability to work in a fast-paced, dynamic environment.
Job Types: Full-time, Permanent
Pay: Php120, Php180,000.00 per month
Work Location: In person
-
Site Reliability Engineer
1 day ago
Taguig, National Capital Region, Philippines Tata Consultancy Services Full time ₱900,000 - ₱1,200,000 per yearRole:EIT MQ L3About the Role:We are seeking a skilled and motivated Site Reliability Engineer (SRE) with expertise in supporting and managingMQ and Kafka systems. The ideal candidate will have a strong background in Unix systems administration, experience with Kubernetes (preferred), and a passion for maintaining high availability, performance, and...
-
Site Reliability Engineer
1 week ago
Taguig, National Capital Region, Philippines Procter & Gamble Full time ₱1,200,000 - ₱2,400,000 per yearJob LocationTaguig CityJob DescriptionInformation Technology (IT) at Procter & Gamble is where business, innovation and technology integrate to build a competitive advantage for P&G. Our mission is clear -- you deliver IT to help P&G win with consumers.Do you love implementing continuous improvement in IT solutions to drive efficiency and agility in meeting...
-
Site Reliability Engineer
3 days ago
Taguig, National Capital Region, Philippines Philtech Full time ₱1,200,000 - ₱2,400,000 per yearAbout the RoleWe are seeking a highly skilled and motivated Site Reliability Engineer (SRE) with a strong focus on front-end application performance and reliability. In this role, you will ensure the scalability, availability, and responsiveness of our web and mobile user-facing platforms. You will collaborate closely with engineering, product, and design...
-
Site Reliability Engineering
3 days ago
Taguig, National Capital Region, Philippines Tata Consultancy Services Full time ₱2,000,000 - ₱2,500,000 per yearRequired Qualifications10+ years of experience in IT Operations, Reliability Engineering, or Performance Engineering.Deep expertise in observability and monitoring platforms (Prometheus, Grafana, Splunk, Datadog, Dynatrace, ELK, AppDynamics, etc.).Strong background in performance testing tools (JMeter, LoadRunner, Gatling, k6, etc.) and capacity...
-
Service Reliability Engineer
1 day ago
Taguig, National Capital Region, Philippines YONDU INC. Full time ₱900,000 - ₱1,200,000 per yearAbout the role: As a Service Reliability Engineer at YONDU INC.', you will be responsible for ensuring the smooth and reliable operation of the company's critical IT systems and infrastructure. This full-time position is based in Taguig City Metro Manila and is a key role in supporting the company's overall business objectives.What you'll be...
-
Site Reliability Engineer
5 days ago
Taguig, National Capital Region, Philippines NASDAQ Full time ₱1,200,000 - ₱2,400,000 per yearWhy NasdaqWhen you work at Nasdaq, you're working for more open and transparent markets so that more people can access opportunities. Connections can be made, jobs can be created, and communities can thrive. We want all our employees to have access to opportunity, too. That means planning for career growth, ensuring you have the tools you need, and promoting...
-
Site Reliability Engineer
3 days ago
Taguig, National Capital Region, Philippines Nasdaq Full time $100,000 - $150,000 per yearWhy NasdaqWhen you work at Nasdaq, you're working for more open and transparent markets so that more people can access opportunities. Connections can be made, jobs can be created, and communities can thrive. We want all our employees to have access to opportunity, too. That means planning for career growth, ensuring you have the tools you need, and promoting...
-
Site Reliability Engineer
1 week ago
Taguig, National Capital Region, Philippines YONDU INC. Full time ₱900,000 - ₱1,200,000 per yearNon nego:5+ years' experience (Senior)Will handle stakeholders like (bancnet, instapay, external and internal stakeholders)With Banking experience or Hybrid (Telco etc. as long as with Banking)Will handle 3 FTEsJIRA as Ticketing toolNice to have:Slack – communication toolOffice 365, MS Teams – KnowledgeAny certifications relating to Mobile...
-
Site Manager
2 weeks ago
Taguig, National Capital Region, Philippines SAMER EMPLOYMENT SERVICES Full time ₱500,000 - ₱1,000,000 per yearBachelor's degree in Engineering, Construction Management, or related fieldWith at least 3–5 years of experience in construction supervision or project site managementStrong knowledge of construction processes, safety standards, and quality controlAble to read and interpret engineering drawings and technical plansSkilled in site coordination,...
-
Site Engineer
7 days ago
Taguig, National Capital Region, Philippines Primus@Knowledge Specialists, Inc. Full time ₱495,000 - ₱660,000 per yearJob description:Key Responsibilities:You will perform in this role as a complex and civil work engineering activities that support the design and implementation of a telecom site. The key role is do the planning, progress management, subcon coordination and implementation quaility management. The additional role as follows.Formulate project planning rules...