Site Reliability Engineer
hace 1 día
Scalable Systems is a USA-based Big Data, Analytics and Digital Transformation Company focused on vertical, innovative solutions. By providing next-generation technology solutions and services, we help organizations to identify risks & opportunities, achieve operational excellence, and gain an innovative edge.
**Openings**:
**Title**: Site Reliability Engineer
**Responsibilities**:
- Leverage a suite of SaaS-based observability tools to ensure our platform is scalable, fault-tolerant, and highly available.
- Take ownership of customer issues reported and see problems through to resolution.
- Attend in-person meetings with clients to analyze, troubleshoot and diagnose technical and data-related issues.
- Coordinate resolutions via Jira tickets with the Technology Development team.
- Test resolutions in conjunction with the Quality Assurance team.
- Communicate resolution of technical issues to Client Success team members and internal stakeholders.
- Partner with Level 1 support to ensure timely resolution of support issues.
- Collaborate with the Product Development team to fix areas with a high issue volume.
- Improve operations by conducting systems analysis and recommending changes in policies and procedures.
- Update job knowledge by studying best practices in technical support.
- Work with teams across the organization to build and maintain monitor-able, performant, reliable, and highly scalable software systems.
- Participate in timely post-mortems of production incidents.
**Requirements**:
- 5 Years of experience developing and monitoring mission-critical systems.
- Experience with SaaS-based observability tools, such as CloudWatch, Sentry, New Relic, Data Dog, and Uptime.
- Working knowledge of and passion for automating software delivery processes.
- Proven track record for designing and building top-tier monitoring and alerting infrastructure.
- Experience administering CI/CD pipelines.
- Thorough understanding of security & compliance best practices.
- Strong written and verbal communication skills with both internal team members and external customers with varying levels of technical knowledge.
- Strong initiative to find ways to improve solutions, operations, and processes.
- Internally motivated, with the ability to work proficiently both independently and in a team environment.
- A roll-up-your-sleeves, GSD approach to the day-to-day.
**Scalable Systems** is an Equal Opportunity-Affirmative Action Employer - Minority / Female / Disability / Veteran / Gender Identity / Sexual Orientation.
**Salary**: ₡1.00 - ₡10.00 per month
-
Site Reliability Engineer
hace 2 meses
San Francisco, Heredia, Costa Rica Ibm A tiempo completoJob SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at IBM. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our systems and infrastructure.Key ResponsibilitiesLead the problem resolution process for our clients, from analysis and troubleshooting to deploying workarounds...
-
Site Reliability Engineer
hace 2 meses
San Francisco, Heredia, Costa Rica Ibm A tiempo completoJob SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at IBM. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure.Key ResponsibilitiesIdentify and investigate issues with our cloud infrastructureDevelop and implement solutions to improve the...
-
Site Reliability Engineer
hace 1 semana
San José, Costa Rica Oracle A tiempo completoSite Reliability Engineer-230001K1 **Applicants are required to read, write, and speak the following languages***: English **Preferred Qualifications** Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and...
-
Site Reliability Engineer
hace 1 semana
San José, Costa Rica Oracle A tiempo completoSite Reliability Engineer-2200087I **Applicants are required to read, write, and speak the following languages**: English **Preferred Qualifications** Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and...
-
Site Reliability Engineer Position
hace 2 meses
San Francisco, Heredia, Costa Rica Sysco Costa Rica A tiempo completo**Job Requirements**:We are seeking a highly skilled Site Reliability Engineer to join our team at Sysco Costa Rica. This position will be responsible for developing and refining strategies and processes for support issue tracking from intake through resolution.**Key Responsibilities**:Contribute to and lead strategic discussions to evolve the product...
-
Senior Site Reliability Engineer
hace 7 meses
San José, Costa Rica Encora A tiempo completo**Important Information** Experience: + 5 years Job Mode: Full-time Work Mode: Work from home **Job Summary** As a **_Senior Site Reliability Engineer (6632)_**, you will be part of a highly skilled technology and agile team by supporting and developing cutting-edge solutions to meet our business requirements. You will help us accelerate our customers'...
-
Site Reliability Engineer
hace 7 meses
San José, Costa Rica Hitachi Solutions Ltd A tiempo completo**Company Description** Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain,...
-
Site Reliability Engineer
hace 7 meses
San José, Costa Rica Hitachi Solutions A tiempo completoCompany Description Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain, a...
-
Site Reliability Engineer
hace 7 meses
San José, Costa Rica Sysdig A tiempo completoSysdig is driving the standard for securing the cloud and containers. We created Falco, the open standard for cloud-native threat detection, and consistently contribute to open source software projects. We are passionate, technical problem-solvers, continually innovating and delivering powerful solutions to secure the cloud from source to run. We value...
-
Senior Site Reliability Engineer
hace 5 días
San José, Costa Rica Equifax A tiempo completoEquifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _ **What you’ll do**: - You will influence and design the infrastructure, architecture, standards, and methods for large-scale systems. - Will...
-
Senior Site Reliability Engineer, Americas
hace 2 semanas
San José, Costa Rica Canonical - Jobs A tiempo completo**Site Reliability Engineer**: To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers. As a...
-
Sr Site Reliability Engineer
hace 2 semanas
San José, Costa Rica Datasite A tiempo completoDatasite is where deals are made. We provide the data rooms and SaaS technology used in M&A and other high-value transactions, to deliver projects in more than 170 countries. Carrying that success into the future is all about you. Your useful skills, your unusual experience, your unique ideas. Everyone here brings something unexpected. What’s yours? Invest...
-
Senior Site Reliability Engineer
hace 2 semanas
San José, Costa Rica Equifax A tiempo completoEquifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _ - As a Site Reliability Engineer (SRE) you will combine software and systems engineering for building and running large-scale, distributed,...
-
Site Reliability Engineer
hace 7 meses
San José, Costa Rica Equifax A tiempo completoSite Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. _ - SREs in our team...
-
Site Reliability Engineer
hace 2 meses
San Francisco, Heredia, Costa Rica Ibm A tiempo completoIntroductionWe are seeking a highly skilled Site Reliability Engineer to join our global team managing one of IBM's leading security solutions. As a member of our team, you will be working in a fast-paced and rewarding environment.Your Role and ResponsibilitiesYou will have access to the latest education, tools, and technology, and a limitless career path...
-
Site Reliability Engineer
hace 7 meses
San José, Costa Rica Equifax A tiempo completoSite Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. _ - SREs in our team...
-
Cyber Security Site Reliability Engineer
hace 6 días
San José, Costa Rica Equifax A tiempo completo**Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you.** **Cyber Security Site Reliability Engineer (SRE Intermediate) **is a discipline that combines software and systems engineering for building...
-
Reliability Engineering Specialist
hace 14 horas
San Francisco, Heredia, Costa Rica Ibm A tiempo completoOverviewWelcome to IBM, where innovation meets reliability. As a Site Reliability Engineer, you will be at the forefront of building and maintaining systems that power our client business.
-
Site Reliability Engineer
hace 2 meses
San José, San José, Costa Rica Vs-Staffing A tiempo completoVs-Staffing is seeking a Site Reliability Engineer to join our team. As a key member of our cybersecurity department, you will be responsible for leading incident response efforts and developing strategies to mitigate threats.The ideal candidate will have a comprehensive understanding of cyber threats and attack methodologies, as well as expertise in Splunk...
-
Senior Site Reliability Engineer
hace 7 meses
San José, Costa Rica Nucleus Health A tiempo completoA U.S.based company that is on a mission to develop the largest online marketplace and media platform in the world is looking for a Senior DevOps/SRE Engineer. The engineer will be working with cross-functional teams to raise system performance, reliability, and effectiveness. The company is developing a knowledge-commerce platform that connects clients and...