Site Reliability Engineer

hace 1 semana


San José, Costa Rica Hitachi Solutions Ltd A tiempo completo

**Company Description**

Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain, a strategic relationship with Microsoft. Recognized for our achievements - teaming with our clients to deliver innovative digital solutions and services - is how we have achieved year after year recognition.

A part of Hitachi, Ltd., our company has a long and rich history of innovation, financial strength, and international presence of one of the world's largest companies. Since 1910, Hitachi, Ltd. has been a leader in manufacturing innovative products and solutions that support industry and social infrastructure around the globe supported by 303,000 employees in over 100 countries and across 864 companies.

Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our leadership in Global Dynamics 365 Field Service and Manufacturing is what truly sets us apart and enables us to maintain a strategic relationship with Microsoft.

**SITE RELIABILITY ENGINEER**

This is a full-time role in our product organization for an expert in systems design with considerable skill and expertise in large software development. Designs and implements Continuous Integration/Continuous Deployment (CI/CD) tooling using GitHub Actions / Azure DevOps, and related technologies. This includes defining and implementing: build and test pipelines for containerized architectures, infrastructure as code (IaC) for the stateful deployment of environments, Role-Based Access Control (RBAC), linting and other code quality controls, gitops and kubernetes pipelines, and managing SaaS deployment APIs.

**Technologies (all optional; we are seeking a blend of skills)**
- Experience with containerized architectures, especially Kubernetes
- Experience with deployments into Kubernetes environments using tools such as Flux or Argo CD
- Infrastructure as Code (ARM, Terraform, or similar)
- Strong Cloud Knowledge (Azure Preferred)
- Logging frameworks (Azure Monitor, Splunk, Elastic, or similar)
- Telemetry Collection (Prometheus, Application Insights, etc.)
- Monitoring tools (Azure Monitor Application Insights, Sentry.io, Dynatrace, New Relic, Naggios, Zabbix)
- Azure DevOps (pipelines and releases) or Github Actions (Yaml build pipelines)
- Prior experience with large Jenkins or similar CI/CD technology is also acceptable.
- Complete command of source control (git), including branching strategies and policies
- Excellent command of GNU bash
- Experience with database deployment pipelines (i.e. dacpac's or similar technology)
- Experience with networking and security elements (VNets, Peering, Firewalls, NAT, etc.)
- Once or more unit testing (examples: Unitest, MS Test, Nunit) and mocking frameworks (examples:, RhinoMocks, Moq, Nsubstitute)
- (Bonus) Experience deploying Azure and/or Spark data components (Data Factory, Airflow, Data Lake (ACLs), Synapse)
- (Bonus) Experience in SSO (single sign-on), and federated security
- (Bonus) Experience with MLFlow and other MLOps pipeline technology

**Qualifications**

**Practices, Principles, Techniques**
- Continuous Integration/Continuous Deployment (CI/CD)
- Instrumentation strategy and Site Reliability Engineering (SRE)
- Release Communication and Collaboration
- Security and Compliance
- TDD (Test Driven Development, especially with respect to CI/CD and DevOps)

**EXAMS / CERTIFICATIONS: (OPTIONAL)**
- Microsoft Certified: Azure Solutions Architect Expert
- Microsoft Certified: DevOps Engineer Expert
- CKAD and/or CKA certifications

**Additional Information**

**We are an equal opportunity employer. All applicants will be considered for employment without attention to age, race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.**

**#LI-JH1**

**#REMOTE**

**Beware of scams



  • San José, Costa Rica Oracle A tiempo completo

    Site Reliability Engineer-230001K1 **Applicants are required to read, write, and speak the following languages***: English **Preferred Qualifications** Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and...

  • Site Reliability Engineer

    hace 2 semanas


    San José, Costa Rica Sysdig A tiempo completo

    Sysdig is driving the standard for securing the cloud and containers. We created Falco, the open standard for cloud-native threat detection, and consistently contribute to open source software projects. We are passionate, technical problem-solvers, continually innovating and delivering powerful solutions to secure the cloud from source to run. We value...

  • Site Reliability

    hace 2 semanas


    San José, San José, Costa Rica Canonical - Jobs A tiempo completo

    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers,...


  • San José, Costa Rica Equifax A tiempo completo

    Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _ - As a Site Reliability Engineer (SRE) you will combine software and systems engineering for building and running large-scale, distributed,...


  • San José, San José, Costa Rica Equifax A tiempo completo

    A Site Reliability Engineering (SRE) is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering, security, and vulnerability management...


  • San José, Costa Rica BairesDev A tiempo completo

    BairesDev is proud to be one of the fastest-growing companies in Latin America and a welcoming, highly rated employer (Glassdoor Employee Score: 4.3). With more than 3500 employees in 27 countries and world-class clients from start-ups to Fortune 500 companies, we’re only as strong as the multicultural teams at the heart of our business. BairesDev runs on...


  • San José, Costa Rica Akamai A tiempo completo

    **Do you have a passion for cutting edge technologies and tackling system problems?** **Are you a self-starting professional who thrives in a dynamic environment?** **Join our Site Reliability team** **Help us shape the future of the Internet** As a Site Reliability Engineer, you will be responsible for: - Deploying, managing, and operating scalable,...


  • San José, Costa Rica Equifax A tiempo completo

    **Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you.** **Cyber Security Site Reliability Engineer (SRE Intermediate) **is a discipline that combines software and systems engineering for building...

  • Site Reliability Engineer

    hace 2 semanas


    San José, San José, Costa Rica Canonical - Jobs A tiempo completo

    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers, and...


  • San José, Costa Rica Nucleus Health A tiempo completo

    A U.S.based company that is on a mission to develop the largest online marketplace and media platform in the world is looking for a Senior DevOps/SRE Engineer. The engineer will be working with cross-functional teams to raise system performance, reliability, and effectiveness. The company is developing a knowledge-commerce platform that connects clients and...