Site Reliability Engineer

hace 2 semanas


San José, Costa Rica Oracle A tiempo completo

Site Reliability Engineer-2200087I

**Applicants are required to read, write, and speak the following languages**: English

**Preferred Qualifications**

Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.
Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services.
Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance.
Authority for end-to-end performance and operability.
Partner with development teams in defining and implementing improvements in service architecture.
Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio.
Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack.
Demonstrate clear understanding of automation and orchestration principles.
Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs).
Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations.
Understand and explain the affect of product architecture decisions on distributed systems.
Professional curiosity and a desire to a develop deep understanding of services and technologies.
**S**olve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.
Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services.
Design and develop designs, architectures, standards, and methods for large-scale distributed systems.
Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.
**Requierements**:

- A BS or MS in Computer Science, or equivalent.
- Solid knowledge of server hardware and software configuration, networking, standard internet services, scripting languages, cloud computing patterns, technology security and compliance.
- Experience running large scale customer facing web services.
- Solid understanding of load balancing technologies and experience with development in programming languages, databases and big data stores, and container technologies.
- Work involves defining and documenting technical architecture of complex and highly scalable products.
A minimum of 2 years experience.
**Languages**:Bash, Python, Perl

**Services**:Java, Apache, Jetty, Kafka, Zookeeper, Oracle Database

**Operating Systems**:Any related Linux operating systems.
**Detailed Description and Job Requirements**

Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.
Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services.
Design and develop designs, architectures, standards, and methods for large-scale distributed systems.
Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.
Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.
Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services.
Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance.
Authority for end-to-end performance and operability.
Partner with development teams in defining and implementing improvements in service architecture.
Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio.
Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack.
Demonstrate clear understanding of automation and orchestration principles.
Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs).
Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations.
Understand and explain the affect of product architecture decisions on distributed systems.
Professional curiosity and a desire to a develop deep understanding of services and technologies.
A BS or MS in Computer Science, or equivalent.
Solid knowledge of server hardware and software configuration, networking, standard internet services, scripting languages, cloud computing patterns, technology security and compliance.
Experience running large scale customer facing web services.
Sol



  • San José, San José, Costa Rica Pfizer A tiempo completo

    **Job Summary:**We are seeking a highly skilled Site Reliability Engineer Leader to join our Digital Command Operations team. In this role, you will be responsible for ensuring the robustness, reliability, and performance of Pfizer's critical digital solutions.**Key Responsibilities:**Act as focal point for day-to-day operation of Cloud services at...

  • Site Reliability Engineer

    hace 3 semanas


    San José, San José, Costa Rica Oracle A tiempo completo

    Site Reliability Engineer-230001K1**Applicants are required to read, write, and speak the following languages***: English**Preferred Qualifications**Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.Design, write, and deploy software to improve the availability, scalability, and efficiency of...

  • Site Reliability Engineer

    hace 3 semanas


    San José, Costa Rica Crg Solutions A tiempo completo

    Reporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical andprocess guidance specific to a business unit.Key areas of impact this role provides are in depth knowledgeof the engineering environments within the specific business unit and providing automated, stable, andAutomation Solutions Engineering, CI/CD...

  • Site Reliability Engineer

    hace 4 semanas


    San José, Costa Rica Oracle A tiempo completo

    Site Reliability Engineer-2200087E**Applicants are required to read, write, and speak the following languages**: English**Preferred Qualifications**Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.Understand the end-to-end configuration, technical dependencies, and...


  • San Pedro, Costa Rica CRG Solutions A tiempo completo

    Reporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical and process guidance specific to a business unit. Key areas of impact this role provides are in depth knowledge of the engineering environments within the specific business unit and providing automated, stable, and Automation Solutions Engineering, CI/CD...


  • San José, Costa Rica VS-Staffing A tiempo completo

    Job Description - Site Reliability Engineer - Remote Costa Rica **Title**: Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Incident Management: Lead the response to security incidents through identification, containment, analysis, and mitigation strategies to minimize impact. -...

  • Site Reliability Engineer

    hace 3 semanas


    San José, San José, Costa Rica Fullstack Labs A tiempo completo

    FullStack is the fastest-growing software consultancy in the Americas.We help organizations like Uber, GoDaddy, MGM, Siemens, Stanford University, and the State of California, build distributed software development teams, and deliver transformational digital solutions.As an employee-first company, we focus on hiring the most talented software designers and...


  • San José, San José, Costa Rica Fullstack Labs A tiempo completo

    About UsFullStack Labs is a software consultancy with a strong presence in the Americas. Our team helps organizations build distributed software development teams and deliver transformational digital solutions. We focus on creating a positive, respectful, and supportive work environment where our employees can thrive.We're proud of:Offering life-changing...


  • San José, San José, Costa Rica Datasite A tiempo completo

    **Job Summary**We are seeking an experienced Sr Site Reliability Engineer to mature our operational observability practices, prevent issues, and resolve enterprise incidents in our customer-facing platform. As a key member of our Engineering team, you will deliver mission-critical expertise and support to ensure high availability and performance.

  • Site Reliability Engineer

    hace 3 semanas


    San José, Costa Rica Vs-Staffing A tiempo completo

    Job Description - Site Reliability Engineer - Remote Costa Rica**Title**:Site Reliability Engineer**Location**:Remote, based in Costa Rica**Job Overview**:**Key responsibilities include**:- Incident Management: Lead the response to security incidents through identification, containment, analysis, and mitigation strategies to minimize impact.- Procedure...


  • Ubicación San José, San José, Costa Rica Udersol A tiempo completo

    Requisitos: Technical Requirements: - Bachelor-s degree in computer science, IT or other highly technical, scientific discipline. - 3+ Years experience in a Site Reliability role. - Ability to program with one or more high level languages, such as Python, Ruby, and Javascript. Experience with automation and scripting languages, including CloudFormation and...


  • San José, San José, Costa Rica Modus Create A tiempo completo

    About Modus CreateModus Create is a fast-growing, remote-first company that specializes in emerging technologies. We are seeking an experienced and enthusiastic DevOps/SRE Engineer (Tooling and Site Reliability Engineer) to join our team.This senior-level position requires expertise in optimization and automation, as well as experience with software...

  • Site Reliability Engineer

    hace 3 semanas


    San José, San José, Costa Rica Hitachi Solutions A tiempo completo

    Company DescriptionHitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals.Our industry focus, expertise, and intellectual property is what truly sets us apart.We have earned, and continue to maintain, a...

  • Site Reliability Engineer

    hace 3 semanas


    San José, San José, Costa Rica Hitachi Solutions Ltd A tiempo completo

    **Company Description**Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals.Our industry focus, expertise, and intellectual property is what truly sets us apart.We have earned, and continue to maintain, a...

  • Site Reliability Engineer

    hace 3 semanas


    San Francisco, Heredia, Costa Rica Sysco Costa Rica A tiempo completo

    **Requirements**:- Develop and refine strategy and process for all support issue tracking from intake through resolution in conjunction with senior members of the team.- Contribute to, and occasionally lead, strategic discussions to continue the evolution of flexibility and sustainability of the entire product suite.- Partner with Level 1 support teams,...

  • Site Reliability Engineer

    hace 2 semanas


    San José, San José, Costa Rica Equifax A tiempo completo

    Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems.SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles._- SREs in our team take...


  • San José, Costa Rica VS-Staffing A tiempo completo

    Job Description - Sr. Site Reliability Engineer **Title**: Sr. Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Leadership and Mentorship: Direct and mentor junior SREs, fostering a culture of excellence, continuous improvement, and learning within the team. - Strategy Development:...


  • San José, San José, Costa Rica Akamai A tiempo completo

    Innovative Problem-Solving OpportunitiesAkamai is a leader in delivering fast, smart, and secure intelligent edge platforms. We are seeking a talented Senior Site Reliability Engineer to join our team.This role will involve deploying, managing, and operating scalable, highly available, and fault-tolerant systems on the Akamai Zero Trust Cloud Platform. You...


  • San José, Costa Rica Sysdig A tiempo completo

    Sysdig is driving the standard for securing the cloud and containers. We created Falco, the open standard for cloud-native threat detection, and consistently contribute to open source software projects. We are passionate, technical problem-solvers, continually innovating and delivering powerful solutions to secure the cloud from source to run. We value...


  • San José, Costa Rica Vs-Staffing A tiempo completo

    Job Description - Sr. Site Reliability Engineer**Title**:Sr. Site Reliability Engineer**Location**:Remote, based in Costa Rica**Job Overview**:**Key responsibilities include**:- Leadership and Mentorship: Direct and mentor junior SREs, fostering a culture of excellence, continuous improvement, and learning within the team.- Strategy Development: Lead the...