Site Reliability Operations Engineer III: Architect of High Availability

hace 2 días


San José, San José, Costa Rica Zuora A tiempo completo
About Zuora

We're revolutionizing the way businesses operate by empowering them to adopt new, sustainable models. Our team is dedicated to delivering exceptional services that prioritize security, resilience, and performance.

Role Overview

This role offers an exciting opportunity for a skilled Service Reliability Operations (SRO) professional to join our team. As an SRO, you will be responsible for ensuring our services are designed and delivered with security, resiliency, scale, and performance in mind.

Key Responsibilities:
  • Incident Management: Drive Command Center Incident Bridges to resolution for customer issues.
  • Observability and Escalation: Respond to Observability Alerts/Alarms and escalated issues from Customer Support.
  • Runbook Development: Write and automate runbooks to reduce alerts/incidents and service requests.
  • Collaboration: Partner with service owners to make services rock-solid and efficient.

You will serve as a key escalation point for issues documented as Standard Operating Procedures (SOPs) or requiring in-depth troubleshooting and analysis. Your tasks will include maintaining up-to-date documentation on deployments, processes, and SOP runbooks.

Requirements:
  • Technical Expertise: Strong understanding of service topology and dependencies for troubleshooting and mitigation.
  • Programming Skills: Competence in shell scripting and high-level programming languages such as Bash, Ansible, Python, Terraform.
  • Soft Skills: Proactive, self-motivated, customer-focused, organized, and excellent communication skills.
  • Experience: Over 4 years of experience in a related field.
Salary Information

The estimated salary for this role is $145,000 - $200,000 per year, depending on location and experience.



  • San José, San José, Costa Rica Zuora A tiempo completo

    **About Zuora**Zuora is a leading company in the Subscription Economy, helping businesses transform their approach to customer relationships and growth. Our mission is to empower companies to build recurring relationships with their customers, focusing on sustainable growth and customer satisfaction.**The Role**We are seeking a highly skilled Site...

  • Site Reliability Engineer

    hace 3 semanas


    San José, San José, Costa Rica Netskope A tiempo completo

    About NetskopeThe cloud security landscape is rapidly evolving, and Netskope is at the forefront of this shift. As a Site Reliability Engineer, you will play a crucial role in ensuring the Netskope platform is highly available, performant, and secure.Key Responsibilities:Partner closely with development teams and product managers to architect and build...

  • Site Reliability Engineer

    hace 4 semanas


    San José, San José, Costa Rica Netskope A tiempo completo

    About NetskopeNetskope is a cloud security company that has been leading the market since 2012. We have a strong culture of openness, honesty, and transparency, which is reflected in our open desk layouts and large meeting spaces. Our team is passionate about building a secure and reliable cloud platform, and we're looking for a skilled Site Reliability...

  • Site Reliability Engineer

    hace 4 semanas


    San José, San José, Costa Rica Equifax A tiempo completo

    At Equifax, we're looking for a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our internal and external services. This involves taking an engineering approach to building and running large-scale, distributed, fault-tolerant systems.**Key...

  • Site Reliability Engineer

    hace 4 semanas


    San José, San José, Costa Rica Equifax A tiempo completo

    At Equifax, we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our large-scale, distributed systems.**Key Responsibilities:**• Engage in and improve the software development lifecycle, from inception and design through development,...

  • Site Reliability Engineer

    hace 2 semanas


    San José, San José, Costa Rica Fullstack Labs A tiempo completo

    We're looking for a talented Site Reliability Engineer to join our team at FullStack Labs. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability, scalability, and performance of our clients' software systems.Key Responsibilities:Design and implement scalable, high-quality software systemsCollaborate with cross-functional...


  • San José, San José, Costa Rica Vs-Staffing A tiempo completo

    Job Title: Senior Site Reliability EngineerJob Overview:At Vs-Staffing, we are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our SRE team, you will be responsible for ensuring the reliability, scalability, and security of our systems.Key Responsibilities:Leadership and Mentorship: Direct and mentor junior...


  • San José, San José, Costa Rica Netskope A tiempo completo

    About NetskopeNetskope is a leading cloud security company that has been redefining cloud, network, and data security since 2012. Our mission is to protect data wherever it goes, and we've built a market-leading cloud security platform to achieve this goal.Job SummaryWe're seeking a highly skilled Senior Cloud Reliability Engineer to join our team. As a key...

  • Site Reliability Engineer

    hace 4 semanas


    San José, San José, Costa Rica Datasite A tiempo completo

    Datasite is a global leader in M&A and high-value transactions, providing data rooms and SaaS technology to deliver projects worldwide. We're looking for a skilled Site Reliability Engineer to join our team and help us carry our success into the future.**Job Description:**The successful candidate will assist in maturing our organization's operational...

  • Site Reliability Engineer

    hace 4 semanas


    San José, San José, Costa Rica Vs-Staffing A tiempo completo

    Job Description - Cybersecurity Incident Response Specialist**Job Title:**Cybersecurity Incident Response Specialist**Overview:**At Vs-Staffing, we are seeking a highly skilled Cybersecurity Incident Response Specialist to join our team. As a key member of our cybersecurity team, you will be responsible for leading the response to security incidents through...


  • San José, San José, Costa Rica Encora A tiempo completo

    **Job Summary**As a Senior Cloud Reliability Engineer at Encora, you will be part of a highly skilled technology and agile team by supporting and developing cutting-edge solutions to meet our business requirements. You will help us accelerate our customers' business results by innovating cutting-edge digital products.Your responsibilities will include...

  • Site Reliability Engineer

    hace 2 semanas


    San José, San José, Costa Rica Hitachi Solutions Ltd A tiempo completo

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a skilled Site Reliability Engineer to join our product organization. The ideal candidate will have extensive experience in designing and implementing Continuous Integration/Continuous Deployment (CI/CD) tooling using GitHub Actions / Azure DevOps, as well as related technologies like...

  • Cloud Security Engineer

    hace 3 semanas


    San José, San José, Costa Rica Netskope A tiempo completo

    About NetskopeNetskope is a leading provider of cloud security solutions, helping organizations protect their data and applications in the cloud. As a Cloud Security Engineer, you will play a key role in designing and implementing our security solutions to ensure the confidentiality, integrity, and availability of our customers' data.About the RoleThe...


  • San José, San José, Costa Rica Nucleus Health A tiempo completo

    Senior Site Reliability Engineer Job DescriptionNucleus Health, a U.S.-based company on a mission to develop the world's largest online marketplace and media platform, is seeking a Senior DevOps/SRE Engineer. This role involves collaborating with cross-functional teams to enhance system performance, reliability, and effectiveness.About Nucleus Health:Nucleus...


  • San José, San José, Costa Rica Encora A tiempo completo

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Encora. As a key member of our digital engineering team, you will be responsible for designing, implementing, and maintaining cutting-edge cloud infrastructure solutions on AWS.ResponsibilitiesLead and participate in the design, development, and delivery of...


  • San José, San José, Costa Rica Netskope A tiempo completo

    Netskope is a leading cloud security company that requires a skilled Cloud Security Engineer to join our team. As a Site Reliability Engineer at Netskope, you will be responsible for designing, developing, and implementing strategies to improve the reliability of our production services.The ideal candidate will have experience with working in private or...


  • San José, San José, Costa Rica Vs-Staffing A tiempo completo

    Vs-Staffing is seeking a Site Reliability Engineer to join our team. As a key member of our cybersecurity department, you will be responsible for leading incident response efforts and developing strategies to mitigate threats.The ideal candidate will have a comprehensive understanding of cyber threats and attack methodologies, as well as expertise in Splunk...


  • San José, San José, Costa Rica Splunk Inc A tiempo completo

    About the RoleWe are seeking a highly skilled TechOps SRE to join our Splunk Cloud TechOps team. As a key member of our team, you will be responsible for maintaining, contributing to, and improving the next generation of our large-scale Cloud offering.Key ResponsibilitiesOperate and maintain large-scale cloud infrastructure, ensuring high availability and...

  • Solutions Architect

    hace 3 semanas


    San José, San José, Costa Rica Netskope A tiempo completo

    About NetskopeNetskope is a leading cloud security company that requires a team of experts to design, build, and manage complex computing and data systems. We are seeking a highly skilled Solutions Architect - Data Services to join our team.Responsibilities:Design, develop, and implement strategies for Netskope's production data services.Collaborate with...


  • San José, San José, Costa Rica Rejuve A tiempo completo

    **Position: Cloud Systems Architect****Location**:100% Remote**About Rejuve.AI**Rejuve.AI is an emerging spin-off project of SingularityNET, focused on extending the healthy human lifespan by creating a decentralized self-sustained research community powered by blockchain, AI, and the valuable contributions of data and AI models. Rejuve.AI's core mission is...