Site Reliability Engineer
hace 22 horas
Reporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical and
process guidance specific to a business unit. Key areas of impact this role provides are in depth knowledge
of the engineering environments within the specific business unit and providing automated, stable, and
Automation Solutions Engineering, CI/CD Solutions Engineering, and Cloud Engineering on the DevOps
background in several types of engineering including: Software, Systems, Network, Security, Cloud
(Public/Private/Community/Hybrid), Automation, and Development Operations.
**ESSENTIAL DUTIES AND RESPONSIBILITIES**:
Our employees are tasked with delivering excellent business results through the efforts of their teams.
These results are achieved by:
- Monitoring and attending to site specific engineering IaaS and PaaS systems
- Working closely with site specific engineering teams to provide quality release management services
- Consumes automation from teams listed above to help provide an automated, stable, and consistent development environment.
- Accountable for working with engineering teams to help provide a minimum of 99.9% infrastructure uptime in accordance with current service level agreements.
- Helps to maintain standards for documentation, standard operating procedures, and work instructions.
- Address and coordinate efforts on escalated incidents.
- Maintain and enforce ITIL standards and procedures around service strategy, design, transition, operations, and continuous improvement.
- Assist in managing vendor relationships
- All other duties as assigned
Reasonable accommodations may be made to enable individuals with disabilities to perform the essential
functions of this position.
**MINIMUM KNOWLEDGE, SKILLS AND ABILITIES**:
The requirements listed below are representative of the experience, education, knowledge, skill and/or
abilities required.
- 2+ years in cloud operations, designing solutions leveraging public cloud IaaS and PaaS including
Virtual Machine, Virtual Networking, Load Balancing and other HA technologies, headless
architecture and containerization.
- 5+ years working in technical software or infrastructure engineering teams.
- Configuration management experience using popular automation products such as Azure
Automation, Powershell DSC, Chef, Ansible, Salt Stack, Puppet, or others.
- IaC (Infrastrucutre as Code) Automation provisioning experience using popular automation products
such as Azure Resource Manager Templates, Terraform, UCS Director, VMware vRealize
Automation, or others.
- Experience in developing or participating in cloud migration strategies
- Designing and reporting metrics and dashboards to drive SLA compliance.
- Knowledge and experience in ITIL framework-based operations and reporting.
- Advanced large-scale server and network management technology
- Procedure and policy design, documentation and dissemination
- Automation and analysis of performance and reliability data
- Understanding of global network design and inter-networking
- Security engineering, server and network hardening, auditing and reporting
- Participate in vendor and contract negotiation
- Strong technical, analytical, problem solving and verbal and written communication skills.
**PREFERRED QUALIFICATIONS**:
- Prior healthcare experience would be a plus.
- BA in a computer related field preferred but not required
- Industry certifications such as Cisco CCNA, VMware VCP, AWS Professional or Azure
MCSA/MCSE, Jenkins Certified Engineer, AWS Certified DevOps Engineer or Microsoft Azure
DevOps Solutions Certification
**PHYSICAL/MENTAL DEMANDS AND WORKING ENVIRONMENT**:
The physical and mental requirements along with the work environment characteristics described here are representative of those an individual encounters while performing the essential functions of this position.
-
Site Reliability Engineer Leader
hace 3 días
San José, San José, Costa Rica Pfizer A tiempo completo**Job Summary:**We are seeking a highly skilled Site Reliability Engineer Leader to join our Digital Command Operations team. In this role, you will be responsible for ensuring the robustness, reliability, and performance of Pfizer's critical digital solutions.**Key Responsibilities:**Act as focal point for day-to-day operation of Cloud services at...
-
Site Reliability Engineer
hace 2 semanas
San José, San José, Costa Rica Oracle A tiempo completoSite Reliability Engineer-230001K1**Applicants are required to read, write, and speak the following languages***: English**Preferred Qualifications**Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.Design, write, and deploy software to improve the availability, scalability, and efficiency of...
-
Site Reliability Engineer
hace 3 semanas
San José, Costa Rica Crg Solutions A tiempo completoReporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical andprocess guidance specific to a business unit.Key areas of impact this role provides are in depth knowledgeof the engineering environments within the specific business unit and providing automated, stable, andAutomation Solutions Engineering, CI/CD...
-
Site Reliability Engineer
hace 3 semanas
San José, Costa Rica Oracle A tiempo completoSite Reliability Engineer-2200087E**Applicants are required to read, write, and speak the following languages**: English**Preferred Qualifications**Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.Understand the end-to-end configuration, technical dependencies, and...
-
Site Reliability Engineer
hace 2 semanas
San José, Costa Rica Oracle A tiempo completoSite Reliability Engineer-2200087I**Applicants are required to read, write, and speak the following languages**: English**Preferred Qualifications**Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.Understand the end-to-end configuration, technical dependencies, and...
-
Site Reliability Engineer
hace 2 semanas
San José, San José, Costa Rica Fullstack Labs A tiempo completoFullStack is the fastest-growing software consultancy in the Americas.We help organizations like Uber, GoDaddy, MGM, Siemens, Stanford University, and the State of California, build distributed software development teams, and deliver transformational digital solutions.As an employee-first company, we focus on hiring the most talented software designers and...
-
Site Reliability Engineer
hace 6 días
San José, Costa Rica VS-Staffing A tiempo completoJob Description - Site Reliability Engineer - Remote Costa Rica **Title**: Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Incident Management: Lead the response to security incidents through identification, containment, analysis, and mitigation strategies to minimize impact. -...
-
Site Reliability Engineering Specialist
hace 2 días
San José, San José, Costa Rica Fullstack Labs A tiempo completoAbout UsFullStack Labs is a software consultancy with a strong presence in the Americas. Our team helps organizations build distributed software development teams and deliver transformational digital solutions. We focus on creating a positive, respectful, and supportive work environment where our employees can thrive.We're proud of:Offering life-changing...
-
Site Reliability Engineering Leader
hace 11 horas
San José, San José, Costa Rica Datasite A tiempo completo**Job Summary**We are seeking an experienced Sr Site Reliability Engineer to mature our operational observability practices, prevent issues, and resolve enterprise incidents in our customer-facing platform. As a key member of our Engineering team, you will deliver mission-critical expertise and support to ensure high availability and performance.
-
Site Reliability Engineer
hace 3 semanas
San José, Costa Rica Vs-Staffing A tiempo completoJob Description - Site Reliability Engineer - Remote Costa Rica**Title**:Site Reliability Engineer**Location**:Remote, based in Costa Rica**Job Overview**:**Key responsibilities include**:- Incident Management: Lead the response to security incidents through identification, containment, analysis, and mitigation strategies to minimize impact.- Procedure...
-
Site Reliability Engineer
hace 2 semanas
San Francisco, Heredia, Costa Rica Sysco Costa Rica A tiempo completo**Requirements**:- Develop and refine strategy and process for all support issue tracking from intake through resolution in conjunction with senior members of the team.- Contribute to, and occasionally lead, strategic discussions to continue the evolution of flexibility and sustainability of the entire product suite.- Partner with Level 1 support teams,...
-
Site Reliability Engineer
hace 4 días
Ubicación San José, San José, Costa Rica Udersol A tiempo completoRequisitos: Technical Requirements: - Bachelor-s degree in computer science, IT or other highly technical, scientific discipline. - 3+ Years experience in a Site Reliability role. - Ability to program with one or more high level languages, such as Python, Ruby, and Javascript. Experience with automation and scripting languages, including CloudFormation and...
-
Site Reliability Engineer
hace 3 días
San José, San José, Costa Rica Modus Create A tiempo completoAbout Modus CreateModus Create is a fast-growing, remote-first company that specializes in emerging technologies. We are seeking an experienced and enthusiastic DevOps/SRE Engineer (Tooling and Site Reliability Engineer) to join our team.This senior-level position requires expertise in optimization and automation, as well as experience with software...
-
Site Reliability Engineer
hace 2 semanas
San José, San José, Costa Rica Hitachi Solutions A tiempo completoCompany DescriptionHitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals.Our industry focus, expertise, and intellectual property is what truly sets us apart.We have earned, and continue to maintain, a...
-
Site Reliability Engineer
hace 3 semanas
San José, San José, Costa Rica Hitachi Solutions Ltd A tiempo completo**Company Description**Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals.Our industry focus, expertise, and intellectual property is what truly sets us apart.We have earned, and continue to maintain, a...
-
Site Reliability Expert
hace 15 horas
San Francisco, Heredia, Costa Rica Sysco Costa Rica A tiempo completo**Company Overview:**Sysco Costa Rica is a leading provider of food and support services. Our team works diligently to ensure our customers receive the highest quality products and experiences.**Job Summary:**We are seeking an experienced Site Reliability Engineer to join our team. The successful candidate will be responsible for developing and refining...
-
Site Reliability Engineer
hace 1 semana
San José, San José, Costa Rica Equifax A tiempo completoSite Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems.SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles._- SREs in our team take...
-
Sr. Site Reliability Engineer
hace 6 días
San José, Costa Rica VS-Staffing A tiempo completoJob Description - Sr. Site Reliability Engineer **Title**: Sr. Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Leadership and Mentorship: Direct and mentor junior SREs, fostering a culture of excellence, continuous improvement, and learning within the team. - Strategy Development:...
-
Site Reliability Engineer Leader
hace 10 horas
San José, San José, Costa Rica Akamai A tiempo completoInnovative Problem-Solving OpportunitiesAkamai is a leader in delivering fast, smart, and secure intelligent edge platforms. We are seeking a talented Senior Site Reliability Engineer to join our team.This role will involve deploying, managing, and operating scalable, highly available, and fault-tolerant systems on the Akamai Zero Trust Cloud Platform. You...
-
Sr. Site Reliability Engineer
hace 3 semanas
San José, Costa Rica Vs-Staffing A tiempo completoJob Description - Sr. Site Reliability Engineer**Title**:Sr. Site Reliability Engineer**Location**:Remote, based in Costa Rica**Job Overview**:**Key responsibilities include**:- Leadership and Mentorship: Direct and mentor junior SREs, fostering a culture of excellence, continuous improvement, and learning within the team.- Strategy Development: Lead the...