Site Reliability Engineer

hace 3 semanas


San José, Costa Rica Cohesity A tiempo completo

Cohesity is on a mission to radically simplify how organizations secure and manage their data, while unlocking limitless value. As a leader in data security and management, we make it easy to secure, protect, manage and derive value from data—across the data center, edge, and cloud. At Cohesity, we're a group of builders and go-getters who are committed to doing the right thing. We encourage you to come as you are, as our differences make us stronger.

We've been named a Leader by multiple analyst firms and are prominently featured in the Forbes Cloud 100 and CRN's Coolest Cloud companies.

Join us and we'll lead the way together.

Our **Site Reliability Engineer** (SRE) works within Cohesity's Customer Support organization and will focus on supporting and resolving data protection issues on traditional platforms and databases.

We are looking for passionate technical support engineers who possess deep technical expertise, excellent troubleshooting experience, outstanding customer service and communications skills.

In this remote first opportunity you will work with other dedicated engineers on our San Jose, CA HQ support team to ensure our customers receive the highest level of support possible

**English Fluency Required**: Effective spoken and written business communication skills needed for typical business communications such as support calls, technical customer conversations, technical documentation, correspondence, presentations, negotiations, socializing and meetings.

Our SRE opportunity requires 9 am to 6 pm local time (CST) and will require working staggered days covering the weekends (Tuesday to Saturday OR Sunday to Thursday).

**Primary Job Responsibilities**:

- Responsible for deployment, maintenance and operations on Cohesity clusters deployed worldwide.
- Develop scripts and framework for automating repetitive tasks.
- Debug issues with backups on Microsoft Hyper-V, VMware vSphere and Linux KVM hypervisors.
- Provide support on clusters deployed in cloud platforms like Amazon Web Services, Microsoft Azure and Google Cloud Platform.
- Write scripts using bash or python for log capture and analysis on large clusters.
- Perform performance analysis, tuning and provide recommendations on issues and suggest cluster expansion if required.
- Define and lead changes to the Cohesity Data platform to product engineering teams based on feedback from deployments.
- Provide support for issues pertaining to Powershell cmdlets and RESTful APIs and provide scripts for automating tasks on Cohesity Data Platform.
- Work on debugging and resolving technical issues with the CentOS Platform.

**Required Skills**:

- Bachelor's degree with 2+ years of experience or Master's degree specializing in storage, networking or virtualization.
- Experience with Python, bash or Powershell.
- Deep understanding of Linux platform and ability to identify and resolve issues with the Linux kernel associated with storage, networking, compute and memory.
- Ability to analyze system diagnostics and clearly articulate issues.
- Good understanding of Linux debugging utilities, with an emphasis on systemtap, tcpdump ftrace, strace, wireshark, gdb, and crash.
- Experience with remote file access protocols, including NFS, SMB (CIFS).
- Solid experience with storage-related concepts, including virtualization and data protection (e.g. VMware, CommVault, Symantec, EMC, NetApp).

**Advantageous**:

- Experience working with a distributed file system.
- Performance analysis experience.
- CentOS Linux Platform Architecture experience.
- Experience debugging complex, serialized, multi-threaded code.

For information on personal data processing, please see our **Privacy Policy**.**

**Equal Employment Opportunity Employer (EEOE)**

Cohesity is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, national origin or nationality, ancestry, age, disability, gender identity or expression, marital status, veteran status or any other category protected by law.

**COVID-19


  • Site Reliability Engineer

    hace 2 semanas


    San José, Costa Rica Oracle A tiempo completo

    Site Reliability Engineer-2200087E **Applicants are required to read, write, and speak the following languages**: English **Preferred Qualifications** Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and...

  • Site Reliability Engineer

    hace 3 semanas


    San Pedro, Costa Rica CRG Solutions A tiempo completo

    Reporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical and process guidance specific to a business unit. Key areas of impact this role provides are in depth knowledge of the engineering environments within the specific business unit and providing automated, stable, and Automation Solutions Engineering, CI/CD...

  • Site Reliability Engineer

    hace 4 semanas


    San José, Costa Rica VS-Staffing A tiempo completo

    Job Description - Site Reliability Engineer - Remote Costa Rica **Title**: Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Incident Management: Lead the response to security incidents through identification, containment, analysis, and mitigation strategies to minimize impact. -...

  • Site Reliability Engineer

    hace 3 semanas


    Ubicación San José, San José, Costa Rica Udersol A tiempo completo

    Requisitos: Technical Requirements: - Bachelor-s degree in computer science, IT or other highly technical, scientific discipline. - 3+ Years experience in a Site Reliability role. - Ability to program with one or more high level languages, such as Python, Ruby, and Javascript. Experience with automation and scripting languages, including CloudFormation and...

  • Site Reliability Engineer

    hace 4 semanas


    San José, San José, Costa Rica Equifax A tiempo completo

    Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems.SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles._- SREs in our team take...


  • San José, Costa Rica VS-Staffing A tiempo completo

    Job Description - Sr. Site Reliability Engineer **Title**: Sr. Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Leadership and Mentorship: Direct and mentor junior SREs, fostering a culture of excellence, continuous improvement, and learning within the team. - Strategy Development:...

  • Site Reliability Engineer

    hace 2 semanas


    San José, Costa Rica Sysdig A tiempo completo

    Sysdig is driving the standard for securing the cloud and containers. We created Falco, the open standard for cloud-native threat detection, and consistently contribute to open source software projects. We are passionate, technical problem-solvers, continually innovating and delivering powerful solutions to secure the cloud from source to run. We value...

  • Site Reliability Engineer

    hace 4 semanas


    San José, San José, Costa Rica Scalable Systems A tiempo completo

    Scalable Systems?is a USA-based Big Data, Analytics and Digital Transformation Company focused on vertical, innovative solutions.By providing next-generation technology solutions and services, we help organizations to identify risks & opportunities, achieve operational excellence, and gain an innovative edge.**Openings**:**Title**: Site Reliability...

  • Site Reliability Engineer

    hace 3 semanas


    San José, Costa Rica Equifax A tiempo completo

    Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _ **What you’ll do**: - You will influence and design the infrastructure, architecture, standards, and methods for large-scale systems. - Will...


  • San José, Costa Rica Equifax A tiempo completo

    Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _ **What you’ll do**: - You will influence and design the infrastructure, architecture, standards, and methods for large-scale systems. - Will...


  • San José, San José, Costa Rica Akamai A tiempo completo

    **Do you have a passion for cutting edge technologies and tackling system problems?****Are you a self-starting professional who thrives in a dynamic environment?****Join the Akamai SRE Infrastructure team**As Site Reliability Engineer II youll be responsible for the operational stability and performance of critical systems and services.Part of a Global team...


  • San José, San José, Costa Rica Wikimedia Foundation A tiempo completo

    **Staff Site Reliability Engineer (Traffic)****Summary**We are looking for a Staff Site Reliability Engineer to support and develop the platform serving the world's favorite encyclopedia to millions of people around the globe.Wikimedia's Site Reliability Engineering (SRE) team is principally responsible for ensuring our global top-15 website, our...


  • San José, San José, Costa Rica Vs-Staffing A tiempo completo

    About the JobThis role is an exciting opportunity to join our team as a Site Reliability Engineer.You will be working closely with our IT team to deploy security controls and measures that safeguard against future incidents while ensuring system compliance and reliability.The successful candidate will have experience in site reliability engineering and be...


  • San José, Costa Rica Akamai A tiempo completo

    **Do you have a passion for cutting edge technologies and tackling system problems?** **Are you a self-starting professional who thrives in a dynamic environment?** **Join the Akamai SRE Infrastructure team** As Site Reliability Engineer II youll be responsible for the operational stability and performance of critical systems and services. Part of a Global...


  • San José, Costa Rica Equifax A tiempo completo

    Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. _ - SREs in our team...


  • San José, Costa Rica Equifax A tiempo completo

    **Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you.** **Cyber Security Site Reliability Engineer (SRE Intermediate) **is a discipline that combines software and systems engineering for building...


  • San José, Costa Rica Akamai A tiempo completo

    **Do you have a passion for cutting edge technologies and tackling system problems?** **Are you a self-starting professional who thrives in a dynamic environment?** **Join our Site Reliability team** **Help us shape the future of the Internet** As a Site Reliability Engineer, you will be responsible for: - Deploying, managing, and operating scalable,...


  • San José, San José, Costa Rica Bairesdev A tiempo completo

    BairesDev is the leading software development company in the Americas.With more than more than 3500 employees working on projects around the world, a sustained average annual growth of over 50%, and recognized by Inc. in the Top 10 Silicon Valley fastest-growing private companies, BairesDev is guiding the digital transformation of some of the top companies...


  • San José, San José, Costa Rica Equifax A tiempo completo

    **Equifax is where you can power your possible.If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you.****Cyber Security Site Reliability Engineer (SRE Intermediate) **is a discipline that combines software and systems engineering for building and...


  • San José, San José, Costa Rica Akamai A tiempo completo

    **Do you have a passion for cutting edge technologies and tackling system problems?****Are you a self-starting professional who thrives in a dynamic environment?****Join the Akamai SRE Infrastructure team**As Site Reliability Engineer Senior II, youll be responsible for the operational stability and performance of critical systems and services.Part of a...