Site Reliability Engineer

hace 1 día


San Pedro, Costa Rica CRG Solutions A tiempo completo

Reporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical and
process guidance specific to a business unit. Key areas of impact this role provides are in depth knowledge
of the engineering environments within the specific business unit and providing automated, stable, and
Automation Solutions Engineering, CI/CD Solutions Engineering, and Cloud Engineering on the DevOps
background in several types of engineering including: Software, Systems, Network, Security, Cloud
(Public/Private/Community/Hybrid), Automation, and Development Operations.

**ESSENTIAL DUTIES AND RESPONSIBILITIES**:
Our employees are tasked with delivering excellent business results through the efforts of their teams.
These results are achieved by:

- Monitoring and attending to site specific engineering IaaS and PaaS systems
- Working closely with site specific engineering teams to provide quality release management services
- Consumes automation from teams listed above to help provide an automated, stable, and consistent development environment.
- Accountable for working with engineering teams to help provide a minimum of 99.9% infrastructure uptime in accordance with current service level agreements.
- Helps to maintain standards for documentation, standard operating procedures, and work instructions.
- Address and coordinate efforts on escalated incidents.
- Maintain and enforce ITIL standards and procedures around service strategy, design, transition, operations, and continuous improvement.
- Assist in managing vendor relationships
- All other duties as assigned

Reasonable accommodations may be made to enable individuals with disabilities to perform the essential
functions of this position.

**MINIMUM KNOWLEDGE, SKILLS AND ABILITIES**:
The requirements listed below are representative of the experience, education, knowledge, skill and/or

abilities required.
- 2+ years in cloud operations, designing solutions leveraging public cloud IaaS and PaaS including

Virtual Machine, Virtual Networking, Load Balancing and other HA technologies, headless

architecture and containerization.
- 5+ years working in technical software or infrastructure engineering teams.
- Configuration management experience using popular automation products such as Azure

Automation, Powershell DSC, Chef, Ansible, Salt Stack, Puppet, or others.
- IaC (Infrastrucutre as Code) Automation provisioning experience using popular automation products

such as Azure Resource Manager Templates, Terraform, UCS Director, VMware vRealize

Automation, or others.
- Experience in developing or participating in cloud migration strategies
- Designing and reporting metrics and dashboards to drive SLA compliance.
- Knowledge and experience in ITIL framework-based operations and reporting.
- Advanced large-scale server and network management technology
- Procedure and policy design, documentation and dissemination
- Automation and analysis of performance and reliability data
- Understanding of global network design and inter-networking
- Security engineering, server and network hardening, auditing and reporting
- Participate in vendor and contract negotiation
- Strong technical, analytical, problem solving and verbal and written communication skills.

**PREFERRED QUALIFICATIONS**:

- Prior healthcare experience would be a plus.
- BA in a computer related field preferred but not required
- Industry certifications such as Cisco CCNA, VMware VCP, AWS Professional or Azure

MCSA/MCSE, Jenkins Certified Engineer, AWS Certified DevOps Engineer or Microsoft Azure

DevOps Solutions Certification

**PHYSICAL/MENTAL DEMANDS AND WORKING ENVIRONMENT**:
The physical and mental requirements along with the work environment characteristics described here are representative of those an individual encounters while performing the essential functions of this position.



  • San José, Costa Rica Oracle A tiempo completo

    Site Reliability Engineer-230001K1**Applicants are required to read, write, and speak the following languages***: English**Preferred Qualifications**Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.Design, write, and deploy software to improve the availability, scalability, and efficiency of...


  • San José, Costa Rica Oracle A tiempo completo

    Site Reliability Engineer-2200087I**Applicants are required to read, write, and speak the following languages**: English**Preferred Qualifications**Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.Understand the end-to-end configuration, technical dependencies, and...


  • San José, Costa Rica VS-Staffing A tiempo completo

    Job Description - Site Reliability Engineer - Remote Costa Rica **Title**: Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Incident Management: Lead the response to security incidents through identification, containment, analysis, and mitigation strategies to minimize impact. -...


  • San José, Costa Rica Vs-Staffing A tiempo completo

    Job Description - Site Reliability Engineer - Remote Costa Rica**Title**:Site Reliability Engineer**Location**:Remote, based in Costa Rica**Job Overview**:**Key responsibilities include**:- Incident Management: Lead the response to security incidents through identification, containment, analysis, and mitigation strategies to minimize impact.- Procedure...


  • San José, Costa Rica Hitachi Solutions A tiempo completo

    Company DescriptionHitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals.Our industry focus, expertise, and intellectual property is what truly sets us apart.We have earned, and continue to maintain, a...


  • Ubicación San José, San José, Costa Rica Udersol A tiempo completo

    Requisitos: Technical Requirements: - Bachelor-s degree in computer science, IT or other highly technical, scientific discipline. - 3+ Years experience in a Site Reliability role. - Ability to program with one or more high level languages, such as Python, Ruby, and Javascript. Experience with automation and scripting languages, including CloudFormation and...


  • San José, Costa Rica VS-Staffing A tiempo completo

    Job Description - Sr. Site Reliability Engineer **Title**: Sr. Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Leadership and Mentorship: Direct and mentor junior SREs, fostering a culture of excellence, continuous improvement, and learning within the team. - Strategy Development:...


  • San José, Costa Rica Scalable Systems A tiempo completo

    Scalable Systems is a USA-based Big Data, Analytics and Digital Transformation Company focused on vertical, innovative solutions. By providing next-generation technology solutions and services, we help organizations to identify risks & opportunities, achieve operational excellence, and gain an innovative edge. **Openings**: **Title**: Site Reliability...


  • San José, Costa Rica Vs-Staffing A tiempo completo

    Job Description - Sr. Site Reliability Engineer**Title**:Sr. Site Reliability Engineer**Location**:Remote, based in Costa Rica**Job Overview**:**Key responsibilities include**:- Leadership and Mentorship: Direct and mentor junior SREs, fostering a culture of excellence, continuous improvement, and learning within the team.- Strategy Development: Lead the...


  • San José, Costa Rica Equifax A tiempo completo

    Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _ **What you’ll do**: - You will influence and design the infrastructure, architecture, standards, and methods for large-scale systems. - Will...


  • San José, Costa Rica Canonical - Jobs A tiempo completo

    **Site Reliability Engineer**: To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers. As a...


  • San José, Costa Rica Canonical - Jobs A tiempo completo

    **Site Reliability Engineer**:To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers.As a...


  • San Francisco, Heredia, Costa Rica Ibm A tiempo completo

    OverviewWelcome to IBM, where innovation meets reliability. As a Site Reliability Engineer, you will be at the forefront of building and maintaining systems that power our client business.


  • San José, Costa Rica Datasite A tiempo completo

    Datasite is where deals are made.We provide the data rooms and SaaS technology used in M&A and other high-value transactions, to deliver projects in more than 170 countries.Carrying that success into the future is all about you.Your useful skills, your unusual experience, your unique ideas.Everyone here brings something unexpected.What's yours?Invest your...


  • San José, Costa Rica Fullstack Labs A tiempo completo

    FullStack is the fastest-growing software consultancy in the Americas.We help organizations like Uber, GoDaddy, MGM, Siemens, Stanford University, and the State of California, build distributed software development teams, and deliver transformational digital solutions.As an employee-first company, we focus on hiring the most talented software designers and...


  • San José, Costa Rica Equifax A tiempo completo

    Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems.SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles._- SREs in our team take...


  • San José, Costa Rica Equifax A tiempo completo

    Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems.SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles._- SREs in our team take...


  • San José, Costa Rica Bairesdev A tiempo completo

    BairesDev is proud to be one of the fastest-growing companies in Latin America and a welcoming, highly rated employer (Glassdoor Employee Score: 4.3).With more than 3500 employees in 27 countries and world-class clients from start-ups to Fortune 500 companies, we're only as strong as the multicultural teams at the heart of our business.BairesDev runs on...


  • San José, Costa Rica Akamai A tiempo completo

    **Do you have a passion for cutting edge technologies and tackling system problems?****Are you a self-starting professional who thrives in a dynamic environment?****Join our Site Reliability team****Help us shape the future of the Internet**As a Site Reliability Engineer, you will be responsible for:- Deploying, managing, and operating scalable, highly...


  • San José, Costa Rica Equifax A tiempo completo

    **Equifax is where you can power your possible.If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you.****Cyber Security Site Reliability Engineer (SRE Intermediate) **is a discipline that combines software and systems engineering for building and...