Site Reliability Engineer

hace 4 días


San José, Costa Rica Splunk Inc A tiempo completo

Join us as we pursue our disruptive new vision to make machine data accessible, usable and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we’re committed to our work, customers, having fun and most importantly to each other’s success. Learn more about Splunk careers and how you can become a part of our journey

The Cloud organization at Splunk focuses on building and maintaining robust and resilient platform solutions for SaaS hosting of Splunk’s enterprise software. Our main technologies are Cloud Infrastructure based, focusing on puppet and terraform.The Splunk Cloud TechOps team is globally distributed with teams based in SF, Plano, Sydney, London, Costa Rica and India.It is the responsibility of the TechOps engagement team to monitor and resolve issues that affect the availability and performance of Splunk for our cloud customers 24/7. As the authority on our customer’s experience, the engagement team is the frontline of defense in making sure each of our customers has an exceptional experience.The TechOps engagement team provides a backstop for all staff on any questions or issues that arise during their shift related to their technical area of expertise. The TechOps engineers lead their respective queue and ensure all requests coming into that queue are addressed in a timely manner.

Responsibilities:

We are looking for a TechOps Engineer to help maintain, contribute to, and improve the next generation of our large scale Cloud offering. You will be working with providers and supporting the infrastructure that powers Splunk’s cloud offering.Provide technical support for the Splunk Cloud fleetPerform impact assessments and problem solving according to established proceduresDocument issues, remediation steps, and help with follow up problem managementAssist other TechOps engineers on your shift on complex tasksUse the internal tools to restore normal service operations as quickly as possible to minimize the impact to business operations during escalated incidentsLead by example and drive the core values of the companyAlways ensure a quality customer experienceYou love large complex systems. Experience in working on distributed systems or a passion for finding edge cases that appear at scale. You're interested in how to bring something from a small one off task to how to implement it across several thousand machines at onceData drives your decisions and excites you - you make decisions based on numbers rather than assumptions. If an issue arises, you strive to be alerted before our customers notice.

What We Provide:

Opportunities to develop and grow as an engineer. We are always expanding into new areas, working with cross-collaborative teams, and exploring new technologies.A team of awesome, capable and dedicated peers, all the way from engineering to product management and customer support.Breadth and depth. You are interested in working in an area that dynamically scales to meet the needs. You want to go deep into optimizing how we automate every manual process and tedious task we encounter.Growth and mentorship. We believe in growing engineers through ownership and leadership opportunities. We also believe that mentors help both sides of the equation.A stable, collaborative, and supportive work environment. Honesty and collaboration are values we see as a core of our team identity. We understand the value in open communication—working together to get things done, and to adapt to the changing needs of the team and individuals. This is reflected in both our internal communications and also in how we interact with our customers.Balance. We're not expecting people to work 12-hour days. We want you to be successful outside of work too and welcome work-from-home days. We trust our colleagues to be responsible with their time and commitment, and believe that balance helps cultivate a positive environment.

Requirements:

Requires a minimum of 2 years of related experience with a technical Bachelor’s degree; or equivalent practical experience.
  • Site Reliability Engineer

    hace 3 semanas


    San José, San José, Costa Rica Oracle A tiempo completo

    Site Reliability Engineer-230001K1Applicants are required to read, write, and speak the following languages: EnglishPreferred QualificationsSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle...


  • San José, Costa Rica Encora A tiempo completo

    **Important Information** Experience: + 5 years Job Mode: Full-time Work Mode: Work from home **Job Summary** As a **_Senior Site Reliability Engineer (6632)_**, you will be part of a highly skilled technology and agile team by supporting and developing cutting-edge solutions to meet our business requirements. You will help us accelerate our customers'...

  • Site Reliability Engineer

    hace 4 semanas


    San Pedro, Costa Rica CRG Solutions A tiempo completo

    Reporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical and process guidance specific to a business unit. Key areas of impact this role provides are in depth knowledge of the engineering environments within the specific business unit and providing automated, stable, and Automation Solutions Engineering, CI/CD...

  • Site Reliability Engineer

    hace 3 semanas


    San José, San José, Costa Rica CRG Solutions A tiempo completo

    Reporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical andprocess guidance specific to a business unit. Key areas of impact this role provides are in depth knowledgeof the engineering environments within the specific business unit and providing automated, stable, andAutomation Solutions Engineering, CI/CD...


  • San José, Costa Rica VS-Staffing A tiempo completo

    Job Description - Site Reliability Engineer - Remote Costa Rica **Title**: Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Incident Management: Lead the response to security incidents through identification, containment, analysis, and mitigation strategies to minimize impact. -...

  • Site Reliability Engineer

    hace 3 semanas


    San José, Costa Rica Hitachi Solutions Ltd A tiempo completo

    **Company Description** Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain,...

  • Site Reliability Engineer

    hace 2 semanas


    San José, Costa Rica Hitachi Solutions A tiempo completo

    Company Description Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain, a...

  • Site Reliability Engineer

    hace 4 semanas


    Ubicación San José, San José, Costa Rica Udersol A tiempo completo

    Requisitos: Technical Requirements: - Bachelor-s degree in computer science, IT or other highly technical, scientific discipline. - 3+ Years experience in a Site Reliability role. - Ability to program with one or more high level languages, such as Python, Ruby, and Javascript. Experience with automation and scripting languages, including CloudFormation and...


  • San José, Costa Rica VS-Staffing A tiempo completo

    Job Description - Sr. Site Reliability Engineer **Title**: Sr. Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Leadership and Mentorship: Direct and mentor junior SREs, fostering a culture of excellence, continuous improvement, and learning within the team. - Strategy Development:...

  • Site Reliability Engineer

    hace 3 semanas


    San José, San José, Costa Rica Hitachi Solutions A tiempo completo

    Company DescriptionHitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain, a...

  • Site Reliability Engineer

    hace 3 semanas


    San José, San José, Costa Rica Hitachi Solutions Ltd A tiempo completo

    Company DescriptionHitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain, a...


  • San José, Costa Rica OfficeSpace A tiempo completo

    OfficeSpace Software is the workplace management platform enabling the future of work, with software that helps teams plan, connect, and perform in the hybrid workplace. 1,000 of the world’s top organizations use OfficeSpace to get the most out of their space and connect the people in it, with intuitive space planning, desk and room booking, employee...

  • Site Reliability Engineer

    hace 3 semanas


    San José, Costa Rica Sysdig A tiempo completo

    Sysdig is driving the standard for securing the cloud and containers. We created Falco, the open standard for cloud-native threat detection, and consistently contribute to open source software projects. We are passionate, technical problem-solvers, continually innovating and delivering powerful solutions to secure the cloud from source to run. We value...

  • Site Reliability Engineer

    hace 3 semanas


    San José, San José, Costa Rica Equifax A tiempo completo

    Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. _ SREs in our team...


  • San José, San José, Costa Rica OfficeSpace A tiempo completo

    OfficeSpace Software is the workplace management platform enabling the future of work, with software that helps teams plan, connect, and perform in the hybrid workplace. 1,000 of the world's top organizations use OfficeSpace to get the most out of their space and connect the people in it, with intuitive space planning, desk and room booking, employee...

  • Site Reliability Engineer

    hace 3 semanas


    Ubicación San José, San José, Costa Rica Udersol A tiempo completo

    Requisitos:Technical Requirements: Bachelors degree in computer science, IT or other highly technical, scientific discipline. 3+ Years experience in a Site Reliability role. Ability to program with one or more high level languages, such as Python, Ruby, and Javascript. Experience with automation and scripting languages, including CloudFormation and Terraform...

  • Site Reliability Engineer

    hace 4 semanas


    San José, Costa Rica Equifax A tiempo completo

    Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _ **What you’ll do**: - You will influence and design the infrastructure, architecture, standards, and methods for large-scale systems. - Will...


  • San José, Costa Rica Canonical - Jobs A tiempo completo

    **Site Reliability Engineer**: To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers. As a...


  • San José, San José, Costa Rica Equifax A tiempo completo

    Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _What you'll do: You will influence and design the infrastructure, architecture, standards, and methods for largescale systems. Will support...

  • Site Reliability Engineer

    hace 2 semanas


    San José, Costa Rica FullStack Labs A tiempo completo

    FullStack is the fastest-growing software consultancy in the Americas. We help organizations like Uber, GoDaddy, MGM, Siemens, Stanford University, and the State of California, build distributed software development teams, and deliver transformational digital solutions. As an employee-first company, we focus on hiring the most talented software designers and...