Site Reliability Engineer

hace 4 semanas


San José, Costa Rica Cohesity A tiempo completo

Cohesity is on a mission to radically simplify how organizations secure and manage their data, while unlocking limitless value. As a leader in data security and management, we make it easy to secure, protect, manage and derive value from data—across the data center, edge, and cloud. At Cohesity, we're a group of builders and go-getters who are committed to doing the right thing. We encourage you to come as you are, as our differences make us stronger.

We've been named a Leader by multiple analyst firms and are prominently featured in the Forbes Cloud 100 and CRN's Coolest Cloud companies.

Join us and we'll lead the way together.

Our **Site Reliability Engineer** (SRE) works within Cohesity's Customer Support organization and will focus on supporting and resolving data protection issues on traditional platforms and databases.

We are looking for passionate technical support engineers who possess deep technical expertise, excellent troubleshooting experience, outstanding customer service and communications skills.

In this remote first opportunity you will work with other dedicated engineers on our San Jose, CA HQ support team to ensure our customers receive the highest level of support possible

**English Fluency Required**: Effective spoken and written business communication skills needed for typical business communications such as support calls, technical customer conversations, technical documentation, correspondence, presentations, negotiations, socializing and meetings.

Our SRE opportunity requires 9 am to 6 pm local time (CST) and will require working staggered days covering the weekends (Tuesday to Saturday OR Sunday to Thursday).

**Primary Job Responsibilities**:

- Responsible for deployment, maintenance and operations on Cohesity clusters deployed worldwide.
- Develop scripts and framework for automating repetitive tasks.
- Debug issues with backups on Microsoft Hyper-V, VMware vSphere and Linux KVM hypervisors.
- Provide support on clusters deployed in cloud platforms like Amazon Web Services, Microsoft Azure and Google Cloud Platform.
- Write scripts using bash or python for log capture and analysis on large clusters.
- Perform performance analysis, tuning and provide recommendations on issues and suggest cluster expansion if required.
- Define and lead changes to the Cohesity Data platform to product engineering teams based on feedback from deployments.
- Provide support for issues pertaining to Powershell cmdlets and RESTful APIs and provide scripts for automating tasks on Cohesity Data Platform.
- Work on debugging and resolving technical issues with the CentOS Platform.

**Required Skills**:

- Bachelor's degree with 2+ years of experience or Master's degree specializing in storage, networking or virtualization.
- Experience with Python, bash or Powershell.
- Deep understanding of Linux platform and ability to identify and resolve issues with the Linux kernel associated with storage, networking, compute and memory.
- Ability to analyze system diagnostics and clearly articulate issues.
- Good understanding of Linux debugging utilities, with an emphasis on systemtap, tcpdump ftrace, strace, wireshark, gdb, and crash.
- Experience with remote file access protocols, including NFS, SMB (CIFS).
- Solid experience with storage-related concepts, including virtualization and data protection (e.g. VMware, CommVault, Symantec, EMC, NetApp).

**Advantageous**:

- Experience working with a distributed file system.
- Performance analysis experience.
- CentOS Linux Platform Architecture experience.
- Experience debugging complex, serialized, multi-threaded code.

For information on personal data processing, please see our **Privacy Policy**.**

**Equal Employment Opportunity Employer (EEOE)**

Cohesity is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, national origin or nationality, ancestry, age, disability, gender identity or expression, marital status, veteran status or any other category protected by law.

**COVID-19


  • Site Reliability Engineer

    hace 3 semanas


    San José, San José, Costa Rica Oracle A tiempo completo

    Site Reliability Engineer-230001K1Applicants are required to read, write, and speak the following languages: EnglishPreferred QualificationsSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle...


  • San José, Costa Rica Encora A tiempo completo

    **Important Information** Experience: + 5 years Job Mode: Full-time Work Mode: Work from home **Job Summary** As a **_Senior Site Reliability Engineer (6632)_**, you will be part of a highly skilled technology and agile team by supporting and developing cutting-edge solutions to meet our business requirements. You will help us accelerate our customers'...

  • Site Reliability Engineer

    hace 3 semanas


    San José, San José, Costa Rica CRG Solutions A tiempo completo

    Reporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical andprocess guidance specific to a business unit. Key areas of impact this role provides are in depth knowledgeof the engineering environments within the specific business unit and providing automated, stable, andAutomation Solutions Engineering, CI/CD...

  • Site Reliability Engineer

    hace 4 semanas


    San Pedro, Costa Rica CRG Solutions A tiempo completo

    Reporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical and process guidance specific to a business unit. Key areas of impact this role provides are in depth knowledge of the engineering environments within the specific business unit and providing automated, stable, and Automation Solutions Engineering, CI/CD...


  • San José, Costa Rica VS-Staffing A tiempo completo

    Job Description - Site Reliability Engineer - Remote Costa Rica **Title**: Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Incident Management: Lead the response to security incidents through identification, containment, analysis, and mitigation strategies to minimize impact. -...

  • Site Reliability Engineer

    hace 3 semanas


    San José, Costa Rica Hitachi Solutions Ltd A tiempo completo

    **Company Description** Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain,...

  • Site Reliability Engineer

    hace 2 semanas


    San José, Costa Rica Hitachi Solutions A tiempo completo

    Company Description Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain, a...

  • Site Reliability Engineer

    hace 4 semanas


    Ubicación San José, San José, Costa Rica Udersol A tiempo completo

    Requisitos: Technical Requirements: - Bachelor-s degree in computer science, IT or other highly technical, scientific discipline. - 3+ Years experience in a Site Reliability role. - Ability to program with one or more high level languages, such as Python, Ruby, and Javascript. Experience with automation and scripting languages, including CloudFormation and...


  • San José, Costa Rica VS-Staffing A tiempo completo

    Job Description - Sr. Site Reliability Engineer **Title**: Sr. Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Leadership and Mentorship: Direct and mentor junior SREs, fostering a culture of excellence, continuous improvement, and learning within the team. - Strategy Development:...

  • Site Reliability Engineer

    hace 3 semanas


    San José, San José, Costa Rica Hitachi Solutions A tiempo completo

    Company DescriptionHitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain, a...

  • Site Reliability Engineer

    hace 3 semanas


    San José, San José, Costa Rica Hitachi Solutions Ltd A tiempo completo

    Company DescriptionHitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain, a...


  • San José, Costa Rica OfficeSpace A tiempo completo

    OfficeSpace Software is the workplace management platform enabling the future of work, with software that helps teams plan, connect, and perform in the hybrid workplace. 1,000 of the world’s top organizations use OfficeSpace to get the most out of their space and connect the people in it, with intuitive space planning, desk and room booking, employee...

  • Site Reliability Engineer

    hace 3 semanas


    San José, Costa Rica Sysdig A tiempo completo

    Sysdig is driving the standard for securing the cloud and containers. We created Falco, the open standard for cloud-native threat detection, and consistently contribute to open source software projects. We are passionate, technical problem-solvers, continually innovating and delivering powerful solutions to secure the cloud from source to run. We value...

  • Site Reliability Engineer

    hace 3 semanas


    San José, San José, Costa Rica Equifax A tiempo completo

    Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. _ SREs in our team...


  • San José, San José, Costa Rica OfficeSpace A tiempo completo

    OfficeSpace Software is the workplace management platform enabling the future of work, with software that helps teams plan, connect, and perform in the hybrid workplace. 1,000 of the world's top organizations use OfficeSpace to get the most out of their space and connect the people in it, with intuitive space planning, desk and room booking, employee...

  • Site Reliability Engineer

    hace 3 semanas


    Ubicación San José, San José, Costa Rica Udersol A tiempo completo

    Requisitos:Technical Requirements: Bachelors degree in computer science, IT or other highly technical, scientific discipline. 3+ Years experience in a Site Reliability role. Ability to program with one or more high level languages, such as Python, Ruby, and Javascript. Experience with automation and scripting languages, including CloudFormation and Terraform...

  • Site Reliability Engineer

    hace 4 semanas


    San José, Costa Rica Equifax A tiempo completo

    Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _ **What you’ll do**: - You will influence and design the infrastructure, architecture, standards, and methods for large-scale systems. - Will...


  • San José, Costa Rica Canonical - Jobs A tiempo completo

    **Site Reliability Engineer**: To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers. As a...


  • San José, San José, Costa Rica Equifax A tiempo completo

    Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _What you'll do: You will influence and design the infrastructure, architecture, standards, and methods for largescale systems. Will support...

  • Site Reliability Engineer

    hace 2 semanas


    San José, Costa Rica FullStack Labs A tiempo completo

    FullStack is the fastest-growing software consultancy in the Americas. We help organizations like Uber, GoDaddy, MGM, Siemens, Stanford University, and the State of California, build distributed software development teams, and deliver transformational digital solutions. As an employee-first company, we focus on hiring the most talented software designers and...