Senior Site Reliability Engineer

hace 6 días


San José, Costa Rica Nucleus Health A tiempo completo

A U.S.based company that is on a mission to develop the largest online marketplace and media platform in the world is looking for a Senior DevOps/SRE Engineer. The engineer will be working with cross-functional teams to raise system performance, reliability, and effectiveness. The company is developing a knowledge-commerce platform that connects clients and advisers through its customized online and telephonic technology solutions. The company has managed to secure more than $288mn in funding so far.

**Responsibilities**:

- Architect, automate, and manage engineers' and corporate users' platforms
- Contribute to the creation of plans for moving current environments and services to the cloud
- Create and maintain CI/CD pipelines for a range of on-premises and cloud apps

**Job Requirements**:

- Bachelor’s/Master’s degree in Engineering, Computer Science (or equivalent experience)
- At least 8+ years of relevant experience as a DevOps, System Administration, or SRE Engineer
- At least 5+ years of experience working with Azure
- 5+ years of experience working with virtualization platforms including VMware, Docker, and Kubernetes
- 3+ years of experience with configuration management tools like SaltStack, Ansible, or Puppet
- 3+ years of experience with Azure DevOps for CI/CD to multi-cloud and on-prem
- Thorough understanding of the operation and data flow of n-tier web apps
- Strong knowledge of a variety of web hosting technologies, including IIS, Nginx, and Apache
- Prior knowledge in maintaining e-commerce websites around-the-clock
- Thorough awareness of DR, HA, and fault best practices in a cloud deployment based on K8s
- Solid comprehension of the OSI model, DNS, NTP, and vLANs, among other networking essentials
- Extensive knowledge of KPI monitoring and alerting
- Prolific Windows, Linux, and cloud experience
- In-depth knowledge and experience in Linux administration including Ubuntu, Debian, and Centos
- Nice to have some experience with security standards like CIS, NIST, or SANS
- Prior experience using scripts and APIs as part of monitoring tools like Zabbix is desirable
- Some familiarity with SAML, particularly utilizing Okta for SSO is preferred
- Nice to have some knowledge of using Azure DevOps to automate tedious activities and create modern CI/CD pipelines
- Excellent spoken and written English communication skills

**Job Type**: Contract


  • Site Reliability Engineer

    hace 2 semanas


    San José, Costa Rica Oracle A tiempo completo

    Site Reliability Engineer-2200087E **Applicants are required to read, write, and speak the following languages**: English **Preferred Qualifications** Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and...

  • Site Reliability Engineer

    hace 3 semanas


    San Pedro, Costa Rica CRG Solutions A tiempo completo

    Reporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical and process guidance specific to a business unit. Key areas of impact this role provides are in depth knowledge of the engineering environments within the specific business unit and providing automated, stable, and Automation Solutions Engineering, CI/CD...

  • Site Reliability Engineer

    hace 4 semanas


    San José, Costa Rica VS-Staffing A tiempo completo

    Job Description - Site Reliability Engineer - Remote Costa Rica **Title**: Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Incident Management: Lead the response to security incidents through identification, containment, analysis, and mitigation strategies to minimize impact. -...


  • San José, Costa Rica Equifax A tiempo completo

    Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _ **What you’ll do**: - You will influence and design the infrastructure, architecture, standards, and methods for large-scale systems. - Will...


  • San José, San José, Costa Rica Akamai A tiempo completo

    **Do you have a passion for cutting edge technologies and tackling system problems?****Are you a self-starting professional who thrives in a dynamic environment?****Join the Akamai SRE Infrastructure team**As Site Reliability Engineer Senior II, youll be responsible for the operational stability and performance of critical systems and services.Part of a...


  • San José, Costa Rica Akamai A tiempo completo

    **Do you have a passion for cutting edge technologies and tackling system problems?** **Are you a self-starting professional who thrives in a dynamic environment?** **Join the Akamai SRE Infrastructure team** As Site Reliability Engineer Senior II, youll be responsible for the operational stability and performance of critical systems and services. Part of a...


  • San José, San José, Costa Rica Nucleus Health A tiempo completo

    A U.S.based company that is on a mission to develop the largest online marketplace and media platform in the world is looking for a Senior DevOps/SRE Engineer.The engineer will be working with cross-functional teams to raise system performance, reliability, and effectiveness.The company is developing a knowledge-commerce platform that connects clients and...


  • San José, Costa Rica Canonical - Jobs A tiempo completo

    **Site Reliability Engineer**: To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers. As a...

  • Site Reliability Engineer

    hace 4 semanas


    Ubicación San José, San José, Costa Rica Udersol A tiempo completo

    Requisitos: Technical Requirements: - Bachelor-s degree in computer science, IT or other highly technical, scientific discipline. - 3+ Years experience in a Site Reliability role. - Ability to program with one or more high level languages, such as Python, Ruby, and Javascript. Experience with automation and scripting languages, including CloudFormation and...

  • Site Reliability Engineer

    hace 4 semanas


    San José, San José, Costa Rica Equifax A tiempo completo

    Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems.SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles._- SREs in our team take...


  • San José, Costa Rica VS-Staffing A tiempo completo

    Job Description - Sr. Site Reliability Engineer **Title**: Sr. Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Leadership and Mentorship: Direct and mentor junior SREs, fostering a culture of excellence, continuous improvement, and learning within the team. - Strategy Development:...

  • Site Reliability Engineer

    hace 3 semanas


    San José, Costa Rica Sysdig A tiempo completo

    Sysdig is driving the standard for securing the cloud and containers. We created Falco, the open standard for cloud-native threat detection, and consistently contribute to open source software projects. We are passionate, technical problem-solvers, continually innovating and delivering powerful solutions to secure the cloud from source to run. We value...


  • San José, Costa Rica Scalable Systems A tiempo completo

    Scalable Systems is a USA-based Big Data, Analytics and Digital Transformation Company focused on vertical, innovative solutions. By providing next-generation technology solutions and services, we help organizations to identify risks & opportunities, achieve operational excellence, and gain an innovative edge. **Openings**: **Title**: Site Reliability...


  • San José, Costa Rica Akamai A tiempo completo

    **Do you have a passion for cutting edge technologies and tackling system problems?** **Are you a self-starting professional who thrives in a dynamic environment?** **Join our Site Reliability team** **Help us shape the future of the Internet** As a Site Reliability Engineer, you will be responsible for: - Deploying, managing, and operating scalable,...

  • Site Reliability Engineer

    hace 4 semanas


    San José, San José, Costa Rica Scalable Systems A tiempo completo

    Scalable Systems?is a USA-based Big Data, Analytics and Digital Transformation Company focused on vertical, innovative solutions.By providing next-generation technology solutions and services, we help organizations to identify risks & opportunities, achieve operational excellence, and gain an innovative edge.**Openings**:**Title**: Site Reliability...

  • Site Reliability Engineer

    hace 3 semanas


    San José, Costa Rica Equifax A tiempo completo

    Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _ **What you’ll do**: - You will influence and design the infrastructure, architecture, standards, and methods for large-scale systems. - Will...


  • San José, San José, Costa Rica Akamai A tiempo completo

    **Do you have a passion for cutting edge technologies and tackling system problems?****Are you a self-starting professional who thrives in a dynamic environment?****Join the Akamai SRE Infrastructure team**As Site Reliability Engineer II youll be responsible for the operational stability and performance of critical systems and services.Part of a Global team...


  • San José, San José, Costa Rica Wikimedia Foundation A tiempo completo

    **Staff Site Reliability Engineer (Traffic)****Summary**We are looking for a Staff Site Reliability Engineer to support and develop the platform serving the world's favorite encyclopedia to millions of people around the globe.Wikimedia's Site Reliability Engineering (SRE) team is principally responsible for ensuring our global top-15 website, our...


  • San José, San José, Costa Rica Vs-Staffing A tiempo completo

    About the JobThis role is an exciting opportunity to join our team as a Site Reliability Engineer.You will be working closely with our IT team to deploy security controls and measures that safeguard against future incidents while ensuring system compliance and reliability.The successful candidate will have experience in site reliability engineering and be...


  • San José, Costa Rica Akamai A tiempo completo

    **Do you have a passion for cutting edge technologies and tackling system problems?** **Are you a self-starting professional who thrives in a dynamic environment?** **Join the Akamai SRE Infrastructure team** As Site Reliability Engineer II youll be responsible for the operational stability and performance of critical systems and services. Part of a Global...