
Engineer, Site Reliability
2 weeks ago
Join to apply for the Engineer, Site Reliability role at Royal Caribbean Group
Join to apply for the Engineer, Site Reliability role at Royal Caribbean Group
Get AI-powered advice on this job and more exclusive features.
The Site Reliability Engineer (Senior SRE) will report to the SRE Manager in support of the Royal Caribbean website by utilizing application and user performance data to guide informed decision-making. The SRE will use application and user performance metrics collected from various sources and tools to support tasks such as initial triage of critical production incidents, bug analysis, implementation of best practices in site reliability engineering, infrastructure optimization, and seamless collaboration between internal teams and external service providers, among other operational initiatives.
The ideal candidate will have a deep understanding and proven track record in an IT support role. The ideal candidate will also have an eye toward the rapidly evolving technology landscape and implement proactive and preventative measures that avoid technical incidents.
S/he must be able to work with multiple product and project teams simultaneously, thrive in a fast-paced and dynamic environment and connect unexpected threads across disparate teams.
Essential Duties And Responsibilities
At a high-level, responsibilities for this role will include:
- Product Health : Responsible for the Incident Management, Application Performance, Configuration Management and Operational Readiness of the products within her/his ownership. Partners with and collaborate closely with stakeholders from the various teams within IT to ensure that performance tools, configuration tools and monitoring tools meet the needs of her/his products.
- Incident Management. Responsible for the initial response, triage, and communication of key production incidents (customer impacting) that occur on the site with the goal to restore systems/applications back to normal service operation as quickly as possible and minimizing the impact on guest/crew experience or business operations, thus ensuring the best possible service levels and availability are maintained. Performs analysis of incident impact on site to determine the root cause by reviewing performance data, including end user experience, application metrics, and infrastructure metrics. Support product team initiatives and releases. Synthesizes and communicates incident details to the production team, stakeholders, including executive level stakeholders. Document incident, perform postmortem and create next steps (as needed)
- Application Performance Management (APM) . Ensures the proactive monitoring and management of performance and availability of the software applications within the products s/he is responsible for. Strives to detect and diagnose complex application performance problems to maintain an expected level of service. Provides insight into application performance metrics (errors, exceptions, baseline violations, etc.) to identify technical impacts of bugs and enhancements. Understands key performance metrics (traffic volumes, booking volumes, response times, etc.) to identify business value of bug fixes and enhancements.
- Configuration Management . Understands high level view of the website operations to identify performance trends between business processes . Performs daily governance of application monitoring software.
- Change Control Governance . Ensuring all production changes required by the product teams are carried out in a planned and authorized manner, within established change control policies and procedures and that all changes are thoroughly tested and validated from the monitoring perspective.
- Production Operations Readiness. Ensure all product implementations go through an operational readiness review. Establish and maintain clear communication channels (e.g., Slack, Teams) with the scrum and marketing teams. Ensure all team members are informed about relevant updates and changes that may affect the website.
- 3-6 years in Site Reliability Engineering (SRE), DevOps, QA, or a related IT operations role.
- Bachelor’s degree in Computer Science, Information Technology, Computer Engineering, or other relevant advanced degree preferred.
- Technical Expertise :
- Proficiency in cloud platforms such as AWS, AWS Elastic Beanstalk.
- Understanding of API design principles: REST, SOAP, Graph
- Advanced knowledge of monitoring and logging tools (AppDynamics, Datadog, Splunk, New Relic, etc.).
- Familiarity with Adobe AEM Cloud is preferred to enhance system performance and reliability
- Problem-Solving Skills :
- Strong analytical and troubleshooting skills to diagnose and resolve complex production issues swiftly.
- Ability to develop and implement effective incident response plans.
- Communication and Collaboration :
- Excellent written and verbal communication skills for effective interaction with cross-functional teams and documentation.
- Ability to collaborate with Development, QA, IT, and external managed service providers to ensure seamless operations.
- The SRE may be required to participate in an on-call rotation to handle urgent incidents and ensure 24x7 system reliability.
- On-call duties may include evenings, weekends, and holidays as needed.
- Seniority level Mid-Senior level
- Employment type Full-time
- Job function Engineering and Information Technology
- Industries Travel Arrangements
Referrals increase your chances of interviewing at Royal Caribbean Group by 2x
Sign in to set job alerts for “Site Reliability Engineer” roles.Pasay, National Capital Region, Philippines 1 day ago
Taguig, National Capital Region, Philippines 6 days ago
Makati, National Capital Region, Philippines 1 week ago
Mandaluyong, National Capital Region, Philippines 2 weeks ago
Pasig, National Capital Region, Philippines 2 weeks ago
Manila, National Capital Region, Philippines 1 year ago
Quezon City, National Capital Region, Philippines 3 weeks ago
Mandaluyong, National Capital Region, Philippines 1 month ago
Makati, National Capital Region, Philippines 4 weeks ago
Makati, National Capital Region, Philippines 1 week ago
DevOps/Site Reliability Engineer (Nigeria-Remote)Taguig, National Capital Region, Philippines 6 days ago
Pasay, National Capital Region, Philippines 3 weeks ago
Mandaluyong, National Capital Region, Philippines 1 week ago
Taguig, National Capital Region, Philippines 2 months ago
Manila, National Capital Region, Philippines 1 week ago
Pasay, National Capital Region, Philippines 2 weeks ago
Makati, National Capital Region, Philippines 1 week ago
Pasay, National Capital Region, Philippines 4 days ago
Taguig, National Capital Region, Philippines 4 days ago
Muntinlupa City, National Capital Region, Philippines 3 months ago
Taguig, National Capital Region, Philippines 1 month ago
Senior Site Reliability Engineer - Remote (PH)Makati, National Capital Region, Philippines 2 months ago
Pasay, National Capital Region, Philippines 2 weeks ago
Pasig, National Capital Region, Philippines 1 month ago
Taguig, National Capital Region, Philippines 3 weeks ago
Quezon City, National Capital Region, Philippines 3 weeks ago
Quezon City, National Capital Region, Philippines 2 weeks ago
Makati, National Capital Region, Philippines 5 hours ago
Taguig, National Capital Region, Philippines 2 weeks ago
Taguig, National Capital Region, Philippines 1 month ago
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr-
Site Reliability Engineer
2 weeks ago
Pasay, Philippines Vestas Full timeJoin or sign in to find your next job Join to apply for the Site Reliability Engineer role at Vestas 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer role at Vestas Are you ready to guide the development of innovative infrastructure solutions for a technology-focused entity in the renewable energy sector? We...
-
Engineer, Sr. Site Reliability
4 days ago
Pasay, National Capital Region, Philippines Royal Caribbean Group Full time $90,000 - $120,000 per yearThe Senior Site Reliability Engineer (Senior SRE) will report to the SRE Manager in support of the Royal Caribbean website by utilizing application and user performance data to guide informed decision-making. The Senior SRE will use application and user performance metrics collected from various sources and tools to support tasks such as initial triage of...
-
Site Reliability Engineer
4 days ago
Pasay, National Capital Region, Philippines Vestas Full time ₱1,200,000 - ₱1,500,000 per yearAre you ready to guide the development of innovative infrastructure solutions for a technology-focused entity in the renewable energy sector? We are seeking a Senior Systems Engineer committed to automation, monitoring, and asset management—someone who takes charge of what happens next and promotes continuous improvement in our digital landscape.This is a...
-
Site Reliability Engineer
2 weeks ago
Pasay, Philippines - Full timeOverview Do you enjoy enabling colleagues to do their job even better, faster, and with less friction? Do you enjoy working with infrastructure and making provisioning and operations as smooth as possible? If yes, you may be the right person to join our Database Management and Platform Product Team! You will be part of the team responsible for the database...
-
Site Engineer
4 days ago
Pasay, National Capital Region, Philippines Juster Property Management Full time ₱900,000 - ₱1,200,000 per yearWe are looking for a skilled and motivated Site Engineer / Architect to handle property repairs, maintenance, and site improvements. If you have the drive to lead on-site works, coordinate teams effectively, and ensure quality execution, we'd love to work with you.ResponsibilitiesPlan and oversee project requirements, timelines, and deliverables.Implement...
-
Site Engineer
1 week ago
Pasay, Philippines Juster Property Management Full timeOverview We are looking for a skilled and motivated Site Engineer / Architect to handle property repairs, maintenance, and site improvements. If you have the drive to lead on-site works, coordinate teams effectively, and ensure quality execution, we’d love to work with you. Responsibilities Plan and oversee project requirements, timelines, and...
-
Telco Site Engineer
2 days ago
Pasay, National Capital Region, Philippines SINIAN INT'L CORPORATION Full time ₱480,000 - ₱1,440,000 per yearJob Title: Site Engineer for Telecommunication ProjectsJob Description:The Site Engineer for Telecommunication (Telco) Projects will oversee and manage on-site construction activities for telecommunication towers and related infrastructure. This role is responsible for ensuring the successful execution of projects, maintaining quality and safety standards,...
-
Civil Site Engineer
2 weeks ago
Pasay, National Capital Region, Philippines COMMSEC INC. Full time ₱600,000 - ₱1,200,000 per year**Job Responsibilities:**Assist the project manager in managing construction site activities, reasonably arrange the construction schedule based on project planning, and ensure timely delivery of the project.Actively participate in the civil engineering design phase, providing professional suggestions based on on-site conditions to ensure the feasibility of...
-
Pasay, National Capital Region, Philippines SMDC Full time ₱800,000 - ₱1,200,000 per yearJOB SUMMARYThe On-Site Design Officer is responsible for overseeing design related aspects of construction projects directly at the site. This role ensures that the design intent is accurately implemented, coordinates with contractors and consultants, and resolves design issues during construction. The architect/engineer acts as a bridge between design and...
-
Mechanical Engineer
2 weeks ago
Pasay, Philippines Top Source Maintenance And Contracting Services, Inc. Full timeOverview Design, develop, and test mechanical devices and systems. Collaborate with cross-functional teams to enhance product performance. Utilize CAD software for modeling and simulations. Conduct failure analysis and reliability assessments. Ensure compliance with industry standards and regulations. Provide technical support and troubleshooting...