Site Reliability Engineer
hace 7 meses
Sysdig is driving the standard for securing the cloud and containers. We created Falco, the open standard for cloud-native threat detection, and consistently contribute to open source software projects. We are passionate, technical problem-solvers, continually innovating and delivering powerful solutions to secure the cloud from source to run.
We value diversity and open dialog to spur ideas, working closely together to achieve goals. We're an international company that understands how to cultivate a strong culture across a remote team. And we're a great place to work too — we've been named a **_Bay Area Best Place to Work _**by the **_San Francisco Business Times and the Silicon Valley Business Journal_**_ _for three years now We were recognized by Deloitte as one of the 500 fastest growing organizations in 2020 and 2021. We are looking for team members who have a passion for container and cloud security and are willing to dig deeper to help our customers. Does this sound like the right place for you?
As a **Site Reliability Engineer,** you will build solutions to enhance the availability, security, and resilience of the Sysdig services, including backends and data stores. You will collaborate with the Infrastructure, Engineering, and Customer Success teams to provide the best experience for our high-profile customers.
**What you will do**
- Deploy, upgrade and migrate large-scale Sysdig services on Kubernetes
- Enable customers and Sysdig customer-facing teams to solve common issues in productions
- Enhance the observability and reliability of Sysdig services to meet SLA/SLO
- Automate manual and repetitive tasks to reduce the toil
- Work with the Engineering team on security hardening in highly regulated environments
**What you will bring with you**
- Working experience in deploying and running workloads on Kubernetes in production is a must
- Working experience in monitoring production environments using Prometheus is a must
- Working experience with one of the following data stores is highly preferred: Postgres, Redis, Cassandra, Elasticsearch, Kafka/Zookeeper
- Ability to write and maintain technical documentation is a must
- Strong coding skills in a high-level programming language (Python, Golang, etc.)
- Working experience with Terraform or Helm
- Experience with well-known CI/CD tool
- Familiar with common Linux commands
- Knowledge and experience in public cloud are preferred
- On call every 6 weeks
**Why work at Sysdig?**
- We're a well-funded startup that already has a large enterprise customer base
- We have a pragmatic, transparent culture, from the CEO down
- We have an organizational focus on delivering value to customers
**When you join Sysdig, you can expect**:
- Competitive compensation including equity opportunities
- Flexible hours and additional recharge days
- Mental wellbeing support through Modern Health for you and your family
- Career growth
**_Some of our hiring managers are based internationally, an up to date CV in English would be appreciated_**
LI-FD1
LI-Hybrid
-
Site Reliability Engineer
hace 2 meses
San Francisco, Heredia, Costa Rica Ibm A tiempo completoJob SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at IBM. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our systems and infrastructure.Key ResponsibilitiesLead the problem resolution process for our clients, from analysis and troubleshooting to deploying workarounds...
-
Site Reliability Engineer
hace 2 meses
San Francisco, Heredia, Costa Rica Ibm A tiempo completoJob SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at IBM. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure.Key ResponsibilitiesIdentify and investigate issues with our cloud infrastructureDevelop and implement solutions to improve the...
-
Site Reliability Engineer
hace 1 semana
San José, Costa Rica Oracle A tiempo completoSite Reliability Engineer-230001K1 **Applicants are required to read, write, and speak the following languages***: English **Preferred Qualifications** Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and...
-
Site Reliability Engineer
hace 1 semana
San José, Costa Rica Oracle A tiempo completoSite Reliability Engineer-2200087I **Applicants are required to read, write, and speak the following languages**: English **Preferred Qualifications** Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and...
-
Site Reliability Engineer Position
hace 2 meses
San Francisco, Heredia, Costa Rica Sysco Costa Rica A tiempo completo**Job Requirements**:We are seeking a highly skilled Site Reliability Engineer to join our team at Sysco Costa Rica. This position will be responsible for developing and refining strategies and processes for support issue tracking from intake through resolution.**Key Responsibilities**:Contribute to and lead strategic discussions to evolve the product...
-
Senior Site Reliability Engineer
hace 7 meses
San José, Costa Rica Encora A tiempo completo**Important Information** Experience: + 5 years Job Mode: Full-time Work Mode: Work from home **Job Summary** As a **_Senior Site Reliability Engineer (6632)_**, you will be part of a highly skilled technology and agile team by supporting and developing cutting-edge solutions to meet our business requirements. You will help us accelerate our customers'...
-
Site Reliability Engineer
hace 7 meses
San José, Costa Rica Hitachi Solutions Ltd A tiempo completo**Company Description** Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain,...
-
Site Reliability Engineer
hace 7 meses
San José, Costa Rica Hitachi Solutions A tiempo completoCompany Description Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain, a...
-
Site Reliability Engineer
hace 1 día
San José, Costa Rica Scalable Systems A tiempo completoScalable Systems is a USA-based Big Data, Analytics and Digital Transformation Company focused on vertical, innovative solutions. By providing next-generation technology solutions and services, we help organizations to identify risks & opportunities, achieve operational excellence, and gain an innovative edge. **Openings**: **Title**: Site Reliability...
-
Senior Site Reliability Engineer
hace 5 días
San José, Costa Rica Equifax A tiempo completoEquifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _ **What you’ll do**: - You will influence and design the infrastructure, architecture, standards, and methods for large-scale systems. - Will...
-
Senior Site Reliability Engineer, Americas
hace 2 semanas
San José, Costa Rica Canonical - Jobs A tiempo completo**Site Reliability Engineer**: To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers. As a...
-
Sr Site Reliability Engineer
hace 2 semanas
San José, Costa Rica Datasite A tiempo completoDatasite is where deals are made. We provide the data rooms and SaaS technology used in M&A and other high-value transactions, to deliver projects in more than 170 countries. Carrying that success into the future is all about you. Your useful skills, your unusual experience, your unique ideas. Everyone here brings something unexpected. What’s yours? Invest...
-
Senior Site Reliability Engineer
hace 2 semanas
San José, Costa Rica Equifax A tiempo completoEquifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _ - As a Site Reliability Engineer (SRE) you will combine software and systems engineering for building and running large-scale, distributed,...
-
Site Reliability Engineer
hace 7 meses
San José, Costa Rica Equifax A tiempo completoSite Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. _ - SREs in our team...
-
Site Reliability Engineer
hace 2 meses
San Francisco, Heredia, Costa Rica Ibm A tiempo completoIntroductionWe are seeking a highly skilled Site Reliability Engineer to join our global team managing one of IBM's leading security solutions. As a member of our team, you will be working in a fast-paced and rewarding environment.Your Role and ResponsibilitiesYou will have access to the latest education, tools, and technology, and a limitless career path...
-
Site Reliability Engineer
hace 7 meses
San José, Costa Rica Equifax A tiempo completoSite Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. _ - SREs in our team...
-
Cyber Security Site Reliability Engineer
hace 6 días
San José, Costa Rica Equifax A tiempo completo**Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you.** **Cyber Security Site Reliability Engineer (SRE Intermediate) **is a discipline that combines software and systems engineering for building...
-
Reliability Engineering Specialist
hace 14 horas
San Francisco, Heredia, Costa Rica Ibm A tiempo completoOverviewWelcome to IBM, where innovation meets reliability. As a Site Reliability Engineer, you will be at the forefront of building and maintaining systems that power our client business.
-
Site Reliability Engineer
hace 2 meses
San José, San José, Costa Rica Vs-Staffing A tiempo completoVs-Staffing is seeking a Site Reliability Engineer to join our team. As a key member of our cybersecurity department, you will be responsible for leading incident response efforts and developing strategies to mitigate threats.The ideal candidate will have a comprehensive understanding of cyber threats and attack methodologies, as well as expertise in Splunk...
-
Senior Site Reliability Engineer
hace 7 meses
San José, Costa Rica Nucleus Health A tiempo completoA U.S.based company that is on a mission to develop the largest online marketplace and media platform in the world is looking for a Senior DevOps/SRE Engineer. The engineer will be working with cross-functional teams to raise system performance, reliability, and effectiveness. The company is developing a knowledge-commerce platform that connects clients and...