
System Reliability Specialist
2 days ago
Job Description:
Staff4Me is seeking an experienced DevOps Engineer with expertise in Grafana monitoring tools to join a dynamic team. In this role, the successful candidate will be responsible for implementing and maintaining monitoring solutions using Grafana while collaborating closely with development and operations teams to enhance system performance and reliability.
The ideal candidate will have a strong understanding of monitoring solutions and metrics gathering, as well as experience with cloud platforms like AWS, Azure, or Google Cloud. Docker and Kubernetes experience is also highly desirable.
Key responsibilities include designing, implementing, and maintaining Grafana dashboards to visualize and monitor system performance metrics, collaborating with the DevOps team to manage infrastructure using tools like Terraform and Ansible, analyzing system performance and availability to identify areas for improvement and optimization, participating in incident response by monitoring alerts and providing timely resolutions to outages and performance degradation, and working closely with development teams to integrate monitoring solutions into CI/CD pipelines.
Required Skills and Qualifications:- Proven experience with Grafana and promoting its use in production environments.
- Strong understanding of monitoring solutions and metrics gathering.
- Familiarity with cloud platforms like AWS, Azure, or Google Cloud.
- Experience with Docker and Kubernetes.
- Basic knowledge of scripting languages like Bash, Python, or Go.
- Excellent written and verbal communication skills.
- Experience in managing monitoring tools and solutions (Grafana, Prometheus, ELK stack).
- Proficient in infrastructure as code (Terraform, CloudFormation).
- Hands-on experience with CI/CD tools (Jenkins, GitLab CI/CD, etc.).
- Strong analytical and problem-solving skills with a focus on system reliability.
- Familiarity with Linux/Unix operating systems and networking concepts.
- Competitive salary and performance-based bonuses.
- Health, dental, and vision insurance.
- Flexible working hours and remote work options.
- Opportunities for professional development and training.
- Collaborative and inclusive work environment.
- Mid-Senior level
- Full-time
- Engineering and Information Technology
- Technology, Information and Internet
-
Senior System Reliability Engineer
1 day ago
Pasay, National Capital Region, Philippines beBeeReliability Full time $130,000 - $180,000Job OverviewThis role is responsible for ensuring system reliability, developing observability strategies, leading incident response efforts, and driving continuous improvement across our platform.We're seeking an experienced engineer to take ownership of monitoring, alerting, and incident management for production systems. You'll design data visualizations,...
-
Reliable Infrastructure Specialist
2 days ago
Pasay, National Capital Region, Philippines beBeeReliability Full time ₱1,459,000 - ₱2,191,000System Reliability ExpertWe are seeking a skilled System Reliability Engineer to join our team.Job Description:Design, build, and maintain robust infrastructure to support healthcare applications.Automate deployment processes to enhance system performance and reduce downtime.Monitor and troubleshoot system issues to ensure high availability and...
-
System Reliability Associate
1 week ago
Pasay, National Capital Region, Philippines Private Advertiser Full time $40,000 - $80,000 per yearKey Responsibilities:Resolve production issues promptly, adhering to formal escalation processes.Collaborate with business users and infrastructure teams to ensure timely issue resolution.Actively monitor the health of systems and services and execute recommended recovery steps as needed.Implement strategies to enhance application stability and...
-
Reliable Infrastructure Specialist
1 day ago
Pasay, National Capital Region, Philippines beBeeCardPayment Full time $12,000 - $50,000System Engineer PositionAs a System Engineer, you will play a key role in ensuring the reliability and efficiency of our systems. Your primary responsibility will be to bridge the gap between development and operations, applying software engineering principles to service management.Key Responsibilities:Manage pipeline build and maintenance according to...
-
Reliability Solutions Expert
1 day ago
Pasay, National Capital Region, Philippines beBeesiteengineer Full time $150,000 - $200,000Job OverviewMaintain the reliability, performance, and scalability of our systems as a proactive engineer who automates processes, identifies bottlenecks, and implements best practices to prevent downtime and ensure high availability.
-
Specialist in Reliable Crypto Applications
2 days ago
Pasay, National Capital Region, Philippines beBeeCrypto Full time $100,000 - $150,000Reliable Crypto Applications SpecialistWe are seeking a dedicated and detail-oriented Crypto QA Engineer who is deeply immersed in the crypto ecosystem and passionate about ensuring the reliability and security of our SaaS products. Our ideal candidate has a strong focus on web applications and APIs, with experience in automated testing and API testing.The...
-
Cloud Infrastructure Reliability Expert
2 days ago
Pasay, National Capital Region, Philippines beBeeExpertise Full time $90,000 - $150,000About the RoleWe are seeking a Cloud Infrastructure Reliability Expert to join our team. As a key member of our infrastructure group, you will be responsible for ensuring the reliability and performance of our cloud-based systems.
-
Site Reliability Engineer
2 weeks ago
Pasay, National Capital Region, Philippines beBeeReliability Full time ₱900,000 - ₱1,200,000Site reliability engineers are essential to ensuring the seamless operation of websites and applications. As a site reliability engineer, you will be responsible for the incident management, application performance, configuration management, and operational readiness of products within your ownership.The ideal candidate will have a deep understanding of IT...
-
Engineer, Site Reliability
5 days ago
Pasay, National Capital Region, Philippines Royal Caribbean Group Full time $70,000 - $120,000 per yearPosition SummaryThe Site Reliability Engineer (Senior SRE) will report to the SRE Manager in support of the Royal Caribbean website by utilizing application and user performance data to guide informed decision-making. The SRE will use application and user performance metrics collected from various sources and tools to support tasks such as initial triage of...
-
Digital Systems Engineer
1 day ago
Pasay, National Capital Region, Philippines beBeeIntegration Full time ₱600,000 - ₱800,000System Integration SpecialistWe are seeking a skilled System Integration Specialist to develop custom front-end interfaces for Manufacturing Execution Systems (MES) and integrate new and existing equipment into broader manufacturing infrastructures.Key Responsibilities:Develop and integrate PLC programs with broader manufacturing systems.Design and develop...