Site Reliability Engineer
hace 2 semanas
Scalable Systems is a USA-based Big Data, Analytics and Digital Transformation Company focused on vertical, innovative solutions. By providing next-generation technology solutions and services, we help organizations to identify risks & opportunities, achieve operational excellence, and gain an innovative edge.
**Openings**:
**Title**: Site Reliability Engineer
**Responsibilities**:
- Leverage a suite of SaaS-based observability tools to ensure our platform is scalable, fault-tolerant, and highly available.
- Take ownership of customer issues reported and see problems through to resolution.
- Attend in-person meetings with clients to analyze, troubleshoot and diagnose technical and data-related issues.
- Coordinate resolutions via Jira tickets with the Technology Development team.
- Test resolutions in conjunction with the Quality Assurance team.
- Communicate resolution of technical issues to Client Success team members and internal stakeholders.
- Partner with Level 1 support to ensure timely resolution of support issues.
- Collaborate with the Product Development team to fix areas with a high issue volume.
- Improve operations by conducting systems analysis and recommending changes in policies and procedures.
- Update job knowledge by studying best practices in technical support.
- Work with teams across the organization to build and maintain monitor-able, performant, reliable, and highly scalable software systems.
- Participate in timely post-mortems of production incidents.
**Requirements**:
- 5 Years of experience developing and monitoring mission-critical systems.
- Experience with SaaS-based observability tools, such as CloudWatch, Sentry, New Relic, Data Dog, and Uptime.
- Working knowledge of and passion for automating software delivery processes.
- Proven track record for designing and building top-tier monitoring and alerting infrastructure.
- Experience administering CI/CD pipelines.
- Thorough understanding of security & compliance best practices.
- Strong written and verbal communication skills with both internal team members and external customers with varying levels of technical knowledge.
- Strong initiative to find ways to improve solutions, operations, and processes.
- Internally motivated, with the ability to work proficiently both independently and in a team environment.
- A roll-up-your-sleeves, GSD approach to the day-to-day.
**Scalable Systems** is an Equal Opportunity-Affirmative Action Employer - Minority / Female / Disability / Veteran / Gender Identity / Sexual Orientation.
**Salary**: ₡1.00 - ₡10.00 per month
-
Site Reliability Engineer
hace 1 semana
San José, San José, Costa Rica Vs-Staffing A tiempo completoJob Description - Site Reliability Engineer - Remote Costa Rica**Title**:Site Reliability Engineer**Location**:Remote, based in Costa Rica**Job Overview**:**Key responsibilities include**:- Incident Management: Lead the response to security incidents through identification, containment, analysis, and mitigation strategies to minimize impact.- Procedure...
-
Site Reliability Engineer
hace 4 días
San José, San José, Costa Rica Canonical - Jobs A tiempo completoAbout the Role: We are seeking an experienced Site Reliability Engineer to join our team at Canonical. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining the reliability and scalability of our cloud infrastructure.">Automate software operations for reusability and consistency across private and...
-
Sr. Site Reliability Engineer
hace 1 semana
San José, San José, Costa Rica Vs-Staffing A tiempo completoJob Description - Sr. Site Reliability Engineer**Title**:Sr. Site Reliability Engineer**Location**:Remote, based in Costa Rica**Job Overview**:**Key responsibilities include**:- Leadership and Mentorship: Direct and mentor junior SREs, fostering a culture of excellence, continuous improvement, and learning within the team.- Strategy Development: Lead the...
-
Site Reliability Engineer
hace 4 días
San José, San José, Costa Rica Oracle A tiempo completoSite Reliability Engineer-230001K1**Applicants are required to read, write, and speak the following languages***: English**Preferred Qualifications**Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.Design, write, and deploy software to improve the availability, scalability, and efficiency of...
-
Site Reliability Engineer
hace 4 semanas
San José, Costa Rica Oracle A tiempo completoSite Reliability Engineer-2200087E **Applicants are required to read, write, and speak the following languages**: English **Preferred Qualifications** Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and...
-
Site Reliability Engineer
hace 12 horas
San José, Costa Rica Oracle A tiempo completoSite Reliability Engineer-2200087E **Applicants are required to read, write, and speak the following languages**: English **Preferred Qualifications** Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and...
-
Site Reliability Engineer
hace 4 días
San José, San José, Costa Rica Crg Solutions A tiempo completoReporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical andprocess guidance specific to a business unit.Key areas of impact this role provides are in depth knowledgeof the engineering environments within the specific business unit and providing automated, stable, andAutomation Solutions Engineering, CI/CD...
-
Site Reliability Engineer
hace 2 semanas
San José, San José, Costa Rica Oracle A tiempo completoSite Reliability Engineer-2200087E**Applicants are required to read, write, and speak the following languages**: English**Preferred Qualifications**Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.Understand the end-to-end configuration, technical dependencies, and...
-
Site Reliability Engineer
hace 3 días
San José, San José, Costa Rica Oracle A tiempo completoSite Reliability Engineer-2200087I**Applicants are required to read, write, and speak the following languages**: English**Preferred Qualifications**Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.Understand the end-to-end configuration, technical dependencies, and...
-
Site Reliability Engineer
hace 6 días
San Pedro, Costa Rica CRG Solutions A tiempo completoReporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical and process guidance specific to a business unit. Key areas of impact this role provides are in depth knowledge of the engineering environments within the specific business unit and providing automated, stable, and Automation Solutions Engineering, CI/CD...
-
Site Reliability Engineer
hace 1 semana
San José, Costa Rica VS-Staffing A tiempo completoJob Description - Site Reliability Engineer - Remote Costa Rica **Title**: Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Incident Management: Lead the response to security incidents through identification, containment, analysis, and mitigation strategies to minimize impact. -...
-
Site Reliability Engineer
hace 1 semana
Ubicación San José, San José, Costa Rica Udersol A tiempo completoRequisitos: Technical Requirements: - Bachelor-s degree in computer science, IT or other highly technical, scientific discipline. - 3+ Years experience in a Site Reliability role. - Ability to program with one or more high level languages, such as Python, Ruby, and Javascript. Experience with automation and scripting languages, including CloudFormation and...
-
Site Reliability Specialist
hace 2 días
San José, San José, Costa Rica Bairesdev A tiempo completoAbout the RoleWe're looking for a skilled Site Reliability Engineer to join our Development team on a Home-based modality. As a key member of our multicultural teams, you'll contribute to delivering top-notch solutions to our clients while fostering a dynamic work environment. Your expertise will play a crucial role in ensuring the smooth operation of our...
-
Site Reliability Engineer
hace 5 días
San Francisco, Heredia, Costa Rica Sysco Costa Rica A tiempo completo**Requirements**:- Develop and refine strategy and process for all support issue tracking from intake through resolution in conjunction with senior members of the team.- Contribute to, and occasionally lead, strategic discussions to continue the evolution of flexibility and sustainability of the entire product suite.- Partner with Level 1 support teams,...
-
Site Reliability Engineer
hace 7 días
San José, San José, Costa Rica Modus Create A tiempo completoJob DescriptionWe are looking for an experienced DevOps/SRE Engineer to join our team. As a key member of our technical staff, you will be responsible for designing and implementing efficient systems, automating processes, and ensuring the reliability of our infrastructure. You will collaborate with cross-functional teams to deliver high-quality solutions...
-
Site Reliability Engineer
hace 7 días
San José, San José, Costa Rica Hitachi Solutions Ltd A tiempo completo**Company Description**Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals.Our industry focus, expertise, and intellectual property is what truly sets us apart.We have earned, and continue to maintain, a...
-
Site Reliability Engineer
hace 3 días
San José, San José, Costa Rica Hitachi Solutions A tiempo completoCompany DescriptionHitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals.Our industry focus, expertise, and intellectual property is what truly sets us apart.We have earned, and continue to maintain, a...
-
Site Reliability Engineer
hace 6 días
San José, San José, Costa Rica Equifax A tiempo completoEquifax is where you can power your possible.If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you._**What you'll do**:- You will influence and design the infrastructure, architecture, standards, and methods for large-scale systems.- Will support...
-
Site Reliability Engineering Lead
hace 3 días
San José, San José, Costa Rica Oracle A tiempo completoAbout the RoleAs a Site Reliability Engineering Lead at Oracle, you will be responsible for ensuring the reliability and efficiency of our cloud services. This involves working closely with development teams to design and implement solutions that meet the needs of our customers.Key Responsibilities:Collaborate with development teams to define and implement...
-
Sr. Site Reliability Engineer
hace 1 semana
San José, Costa Rica VS-Staffing A tiempo completoJob Description - Sr. Site Reliability Engineer **Title**: Sr. Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Leadership and Mentorship: Direct and mentor junior SREs, fostering a culture of excellence, continuous improvement, and learning within the team. - Strategy Development:...