Site Reliability Engineer

hace 2 semanas


San José, Costa Rica Cohesity A tiempo completo

Cohesity is on a mission to radically simplify how organizations secure and manage their data, while unlocking limitless value. As a leader in data security and management, we make it easy to secure, protect, manage and derive value from data—across the data center, edge, and cloud. At Cohesity, we're a group of builders and go-getters who are committed to doing the right thing. We encourage you to come as you are, as our differences make us stronger.

We've been named a Leader by multiple analyst firms and are prominently featured in the Forbes Cloud 100 and CRN's Coolest Cloud companies.

Join us and we'll lead the way together.

Our **Site Reliability Engineer** (SRE) works within Cohesity's Customer Support organization and will focus on supporting and resolving data protection issues on traditional platforms and databases.

We are looking for passionate technical support engineers who possess deep technical expertise, excellent troubleshooting experience, outstanding customer service and communications skills.

In this remote first opportunity you will work with other dedicated engineers on our San Jose, CA HQ support team to ensure our customers receive the highest level of support possible

**English Fluency Required**: Effective spoken and written business communication skills needed for typical business communications such as support calls, technical customer conversations, technical documentation, correspondence, presentations, negotiations, socializing and meetings.

Our SRE opportunity requires 9 am to 6 pm local time (CST) and will require working staggered days covering the weekends (Tuesday to Saturday OR Sunday to Thursday).

**Primary Job Responsibilities**:

- Responsible for deployment, maintenance and operations on Cohesity clusters deployed worldwide.
- Develop scripts and framework for automating repetitive tasks.
- Debug issues with backups on Microsoft Hyper-V, VMware vSphere and Linux KVM hypervisors.
- Provide support on clusters deployed in cloud platforms like Amazon Web Services, Microsoft Azure and Google Cloud Platform.
- Write scripts using bash or python for log capture and analysis on large clusters.
- Perform performance analysis, tuning and provide recommendations on issues and suggest cluster expansion if required.
- Define and lead changes to the Cohesity Data platform to product engineering teams based on feedback from deployments.
- Provide support for issues pertaining to Powershell cmdlets and RESTful APIs and provide scripts for automating tasks on Cohesity Data Platform.
- Work on debugging and resolving technical issues with the CentOS Platform.

**Required Skills**:

- Bachelor's degree with 2+ years of experience or Master's degree specializing in storage, networking or virtualization.
- Experience with Python, bash or Powershell.
- Deep understanding of Linux platform and ability to identify and resolve issues with the Linux kernel associated with storage, networking, compute and memory.
- Ability to analyze system diagnostics and clearly articulate issues.
- Good understanding of Linux debugging utilities, with an emphasis on systemtap, tcpdump ftrace, strace, wireshark, gdb, and crash.
- Experience with remote file access protocols, including NFS, SMB (CIFS).
- Solid experience with storage-related concepts, including virtualization and data protection (e.g. VMware, CommVault, Symantec, EMC, NetApp).

**Advantageous**:

- Experience working with a distributed file system.
- Performance analysis experience.
- CentOS Linux Platform Architecture experience.
- Experience debugging complex, serialized, multi-threaded code.

For information on personal data processing, please see our **Privacy Policy**.**

**Equal Employment Opportunity Employer (EEOE)**

Cohesity is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, national origin or nationality, ancestry, age, disability, gender identity or expression, marital status, veteran status or any other category protected by law.

**COVID-19


  • Site Reliability Engineer

    hace 2 semanas


    San José, San José, Costa Rica Vs-Staffing A tiempo completo

    Job Description - Site Reliability Engineer - Remote Costa Rica**Title**:Site Reliability Engineer**Location**:Remote, based in Costa Rica**Job Overview**:**Key responsibilities include**:- Incident Management: Lead the response to security incidents through identification, containment, analysis, and mitigation strategies to minimize impact.- Procedure...

  • Site Reliability Engineer

    hace 2 semanas


    San José, San José, Costa Rica Canonical - Jobs A tiempo completo

    About the Role: We are seeking an experienced Site Reliability Engineer to join our team at Canonical. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining the reliability and scalability of our cloud infrastructure.">Automate software operations for reusability and consistency across private and...


  • San José, San José, Costa Rica Vs-Staffing A tiempo completo

    Job Description - Sr. Site Reliability Engineer**Title**:Sr. Site Reliability Engineer**Location**:Remote, based in Costa Rica**Job Overview**:**Key responsibilities include**:- Leadership and Mentorship: Direct and mentor junior SREs, fostering a culture of excellence, continuous improvement, and learning within the team.- Strategy Development: Lead the...


  • San José, Costa Rica Oracle A tiempo completo

    Site Reliability Engineer-230001K1 **Applicants are required to read, write, and speak the following languages***: English **Preferred Qualifications** Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and...


  • San José, San José, Costa Rica beBee Careers A tiempo completo

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the scalability, fault-tolerance, and high availability of our platform.Key ResponsibilitiesLeverage a suite of SaaS-based observability tools to monitor and improve the performance of our platform.Take...

  • Site Reliability Engineer

    hace 2 semanas


    San José, San José, Costa Rica Oracle A tiempo completo

    Site Reliability Engineer-230001K1**Applicants are required to read, write, and speak the following languages***: English**Preferred Qualifications**Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.Design, write, and deploy software to improve the availability, scalability, and efficiency of...


  • San José, San José, Costa Rica beBee Careers A tiempo completo

    **Job Summary:** We are seeking an experienced Site Reliability Engineer to lead our team in delivering high-quality, scalable, and secure systems.**Key Responsibilities:*Develop and execute strategies for system optimization, ensuring scalability, reliability, and security at all levels.Lead the creation and implementation of sophisticated incident response...


  • San José, Costa Rica Oracle A tiempo completo

    Site Reliability Engineer-2200087E **Applicants are required to read, write, and speak the following languages**: English **Preferred Qualifications** Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and...

  • Site Reliability Engineer

    hace 2 semanas


    San José, San José, Costa Rica Crg Solutions A tiempo completo

    Reporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical andprocess guidance specific to a business unit.Key areas of impact this role provides are in depth knowledgeof the engineering environments within the specific business unit and providing automated, stable, andAutomation Solutions Engineering, CI/CD...

  • Site Reliability Engineer

    hace 2 semanas


    San Pedro, Costa Rica CRG Solutions A tiempo completo

    Reporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical and process guidance specific to a business unit. Key areas of impact this role provides are in depth knowledge of the engineering environments within the specific business unit and providing automated, stable, and Automation Solutions Engineering, CI/CD...

  • Site Reliability Engineer

    hace 3 semanas


    San José, San José, Costa Rica Oracle A tiempo completo

    Site Reliability Engineer-2200087E**Applicants are required to read, write, and speak the following languages**: English**Preferred Qualifications**Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.Understand the end-to-end configuration, technical dependencies, and...


  • San José, San José, Costa Rica Oracle A tiempo completo

    Site Reliability Engineer-2200087I**Applicants are required to read, write, and speak the following languages**: English**Preferred Qualifications**Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.Understand the end-to-end configuration, technical dependencies, and...


  • San José, Costa Rica Hitachi Solutions A tiempo completo

    Company Description Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain, a...


  • San José, Costa Rica Hitachi Solutions Ltd A tiempo completo

    **Company Description** Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain,...


  • San José, San José, Costa Rica Bairesdev A tiempo completo

    About the RoleWe're looking for a skilled Site Reliability Engineer to join our Development team on a Home-based modality. As a key member of our multicultural teams, you'll contribute to delivering top-notch solutions to our clients while fostering a dynamic work environment. Your expertise will play a crucial role in ensuring the smooth operation of our...

  • Site Reliability Engineer

    hace 2 semanas


    San Francisco, Heredia, Costa Rica Sysco Costa Rica A tiempo completo

    **Requirements**:- Develop and refine strategy and process for all support issue tracking from intake through resolution in conjunction with senior members of the team.- Contribute to, and occasionally lead, strategic discussions to continue the evolution of flexibility and sustainability of the entire product suite.- Partner with Level 1 support teams,...

  • Site Reliability Engineer

    hace 2 semanas


    San José, San José, Costa Rica Modus Create A tiempo completo

    Job DescriptionWe are looking for an experienced DevOps/SRE Engineer to join our team. As a key member of our technical staff, you will be responsible for designing and implementing efficient systems, automating processes, and ensuring the reliability of our infrastructure. You will collaborate with cross-functional teams to deliver high-quality solutions...

  • Site Reliability Engineer

    hace 2 semanas


    San José, San José, Costa Rica Hitachi Solutions Ltd A tiempo completo

    **Company Description**Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals.Our industry focus, expertise, and intellectual property is what truly sets us apart.We have earned, and continue to maintain, a...


  • San José, San José, Costa Rica Hitachi Solutions A tiempo completo

    Company DescriptionHitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals.Our industry focus, expertise, and intellectual property is what truly sets us apart.We have earned, and continue to maintain, a...

  • Site Reliability Engineer

    hace 2 semanas


    San José, San José, Costa Rica Equifax A tiempo completo

    Equifax is where you can power your possible.If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you._**What you'll do**:- You will influence and design the infrastructure, architecture, standards, and methods for large-scale systems.- Will support...