Sr. Site Reliability Engineer

hace 2 meses


San José, Costa Rica VS-Staffing A tiempo completo

Job Description - Sr. Site Reliability Engineer

**Title**:
Sr. Site Reliability Engineer

**Location**:
Remote, based in Costa Rica

**Job Overview**:
**Key responsibilities include**:

- Leadership and Mentorship: Direct and mentor junior SREs, fostering a culture of excellence, continuous improvement, and learning within the team.
- Strategy Development: Lead the creation and execution of sophisticated strategies for system optimization, ensuring scalability, reliability, and security at all levels.
- System Architecture Contribution: Engage in the design and review of system architecture, advocating for security and reliability best practices.
- Advanced Incident Management: Manage complex security incidents with expertise, guiding the team during crisis situations and ensuring swift, effective resolutions.
- Cross-Functional Collaboration: Serve as a primary liaison between the SRE team, IT department, and other technical and business units, driving cohesive efforts towards shared organizational goals.
- Innovation and Research: Champion innovation by researching, advocating for, and implementing cutting-edge technologies and methodologies to enhance system reliability and security.
- Procedure Development: Formulate and maintain up-to-date incident response procedures and playbooks, ensuring their effectiveness and compliance with industry standards.
- Post-Incident Analysis: Conduct thorough post-incident reviews, deriving insights and recommendations to prevent recurrence and improve system security and reliability.
- Collaboration and Detection: Work closely with the vSOC to enhance detection and reporting mechanisms for timely incident response.
- Threat and Vulnerability Assessment: Provide expertise in threat analysis, conduct vulnerability assessments, and perform penetration testing using leading-edge tools and techniques.
- Security Measures Implementation: Partner with the IT team to deploy security controls and measures that safeguard against future incidents while ensuring system compliance and reliability.
- Stakeholder Engagement: Develop and maintain relationships with key external stakeholders, staying abreast of the latest security trends and practices.
- Technology Proficiency: Utilize and manage advanced incident response and reliability tools, including Splunk, Crowdstrike Falcon Complete, and MS Defender, among others.

**Preferred Qualifications and Experience**:

- Educational Background: Bachelor's degree in Computer Science, Information Technology, or equivalent experience. Advanced degrees or specialized certifications in site reliability engineering or cybersecurity are preferred.
- Professional Experience: A minimum of 5-7 years in cybersecurity, with extensive experience in site reliability engineering, including leadership roles or substantial project management experience.
- Technical Expertise: Deep understanding of cyber threats, attack methodologies, incident response techniques, and a solid grasp of NIST and ISO 27001 frameworks, with the ability to lead in architecture design, advanced troubleshooting, and performance optimization.
- Leadership Skills: Demonstrated leadership capabilities, with experience in guiding projects, mentoring team members, and leading by example in a high-stakes environment.
- Strategic Planning: Proven track record in strategic planning and execution, aligning technical projects with broader business objectives.
- Tools Proficiency: Expertise in using incident response tools and technologies such as SIEM, XDR, and threat intelligence platforms, with advanced knowledge in Splunk administration and other critical technologies.
- Analytical Skills: Exceptional analytical and problem-solving abilities, capable of sifting through large data sets to identify and address security incidents effectively.
- Communication: Strong communication skills, with the capacity to articulate complex technical information clearly to both technical and non-technical stakeholders.
- Adaptability: Ability to thrive in a fast-paced, ever-changing environment, showing flexibility and a commitment to continuous learning and improvement.
- Desirable Skills: Familiarity with Qualys, Contrast Security, KnowBe4 PhishER, PCI, and SOX compliance, along with experience in using Pager Duty, Jira, and Confluence, is advantageous.

**Desirable Skills**:

- Advanced Technical Skills: Experience with leading-edge technologies or methodologies, such as cloud-native technologies, Kubernetes, or advanced automation and orchestration platforms.
- Industry Leadership: Contributions to the field through speaking engagements, publications, or active participation in relevant professional communities are highly valued.



  • San José, Costa Rica OfficeSpace A tiempo completo

    OfficeSpace Software is the workplace management platform enabling the future of work, with software that helps teams plan, connect, and perform in the hybrid workplace. 1,000 of the world’s top organizations use OfficeSpace to get the most out of their space and connect the people in it, with intuitive space planning, desk and room booking, employee...


  • San José, San José, Costa Rica OfficeSpace A tiempo completo

    OfficeSpace Software is the workplace management platform enabling the future of work, with software that helps teams plan, connect, and perform in the hybrid workplace. 1,000 of the world's top organizations use OfficeSpace to get the most out of their space and connect the people in it, with intuitive space planning, desk and room booking, employee...


  • San José, San José, Costa Rica Datasite A tiempo completo

    Datasite is where deals are made. We provide the data rooms and SaaS technology used in M&A and other high-value transactions, to deliver projects in more than 170 countries. Carrying that success into the future is all about you. Your useful skills, your unusual experience, your unique ideas. Everyone here brings something unexpected. What's yours? Invest...

  • Site Reliability Engineer

    hace 2 semanas


    San José, San José, Costa Rica Oracle A tiempo completo

    Site Reliability Engineer-230001K1Applicants are required to read, write, and speak the following languages: EnglishPreferred QualificationsSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle...


  • San José, Costa Rica Encora A tiempo completo

    **Important Information** Experience: + 5 years Job Mode: Full-time Work Mode: Work from home **Job Summary** As a **_Senior Site Reliability Engineer (6632)_**, you will be part of a highly skilled technology and agile team by supporting and developing cutting-edge solutions to meet our business requirements. You will help us accelerate our customers'...


  • San José, Costa Rica Encora A tiempo completo

    **Important Information** Experience: + 5 years Job Mode: Full-time Work Mode: Work from home **Job Summary** As a **_Senior Site Reliability Engineer (6632)_**, you will be part of a highly skilled technology and agile team by supporting and developing cutting-edge solutions to meet our business requirements. You will help us accelerate our customers'...

  • Site Reliability Engineer

    hace 2 semanas


    San José, San José, Costa Rica CRG Solutions A tiempo completo

    Reporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical andprocess guidance specific to a business unit. Key areas of impact this role provides are in depth knowledgeof the engineering environments within the specific business unit and providing automated, stable, andAutomation Solutions Engineering, CI/CD...

  • Site Reliability Engineer

    hace 3 semanas


    San Pedro, Costa Rica CRG Solutions A tiempo completo

    Reporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical and process guidance specific to a business unit. Key areas of impact this role provides are in depth knowledge of the engineering environments within the specific business unit and providing automated, stable, and Automation Solutions Engineering, CI/CD...


  • San José, Costa Rica VS-Staffing A tiempo completo

    Job Description - Site Reliability Engineer - Remote Costa Rica **Title**: Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Incident Management: Lead the response to security incidents through identification, containment, analysis, and mitigation strategies to minimize impact. -...

  • Site Reliability Engineer

    hace 2 semanas


    San José, Costa Rica Hitachi Solutions Ltd A tiempo completo

    **Company Description** Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain,...

  • Site Reliability Engineer

    hace 2 semanas


    San José, Costa Rica Hitachi Solutions A tiempo completo

    Company Description Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain, a...

  • Site Reliability Engineer

    hace 4 semanas


    Ubicación San José, San José, Costa Rica Udersol A tiempo completo

    Requisitos: Technical Requirements: - Bachelor-s degree in computer science, IT or other highly technical, scientific discipline. - 3+ Years experience in a Site Reliability role. - Ability to program with one or more high level languages, such as Python, Ruby, and Javascript. Experience with automation and scripting languages, including CloudFormation and...

  • Site Reliability Engineer

    hace 2 semanas


    San José, San José, Costa Rica Hitachi Solutions Ltd A tiempo completo

    Company DescriptionHitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain, a...

  • Site Reliability Engineer

    hace 2 semanas


    San José, San José, Costa Rica Hitachi Solutions A tiempo completo

    Company DescriptionHitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain, a...

  • Site Reliability Engineer

    hace 3 semanas


    San José, Costa Rica Sysdig A tiempo completo

    Sysdig is driving the standard for securing the cloud and containers. We created Falco, the open standard for cloud-native threat detection, and consistently contribute to open source software projects. We are passionate, technical problem-solvers, continually innovating and delivering powerful solutions to secure the cloud from source to run. We value...

  • Site Reliability Engineer

    hace 2 semanas


    San José, San José, Costa Rica Equifax A tiempo completo

    Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. _ SREs in our team...

  • Site Reliability Engineer

    hace 2 semanas


    Ubicación San José, San José, Costa Rica Udersol A tiempo completo

    Requisitos:Technical Requirements: Bachelors degree in computer science, IT or other highly technical, scientific discipline. 3+ Years experience in a Site Reliability role. Ability to program with one or more high level languages, such as Python, Ruby, and Javascript. Experience with automation and scripting languages, including CloudFormation and Terraform...

  • Site Reliability Engineer

    hace 4 semanas


    San José, Costa Rica Equifax A tiempo completo

    Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _ **What you’ll do**: - You will influence and design the infrastructure, architecture, standards, and methods for large-scale systems. - Will...


  • San José, Costa Rica Canonical - Jobs A tiempo completo

    **Site Reliability Engineer**: To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers. As a...


  • San José, San José, Costa Rica Equifax A tiempo completo

    Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _What you'll do: You will influence and design the infrastructure, architecture, standards, and methods for largescale systems. Will support...