Site Reliability Engineer
hace 5 meses
Reporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical and
process guidance specific to a business unit. Key areas of impact this role provides are in depth knowledge
of the engineering environments within the specific business unit and providing automated, stable, and
Automation Solutions Engineering, CI/CD Solutions Engineering, and Cloud Engineering on the DevOps
background in several types of engineering including: Software, Systems, Network, Security, Cloud
(Public/Private/Community/Hybrid), Automation, and Development Operations.
**ESSENTIAL DUTIES AND RESPONSIBILITIES**:
Our employees are tasked with delivering excellent business results through the efforts of their teams.
These results are achieved by:
- Monitoring and attending to site specific engineering IaaS and PaaS systems
- Working closely with site specific engineering teams to provide quality release management services
- Consumes automation from teams listed above to help provide an automated, stable, and consistent development environment.
- Accountable for working with engineering teams to help provide a minimum of 99.9% infrastructure uptime in accordance with current service level agreements.
- Helps to maintain standards for documentation, standard operating procedures, and work instructions.
- Address and coordinate efforts on escalated incidents.
- Maintain and enforce ITIL standards and procedures around service strategy, design, transition, operations, and continuous improvement.
- Assist in managing vendor relationships
- All other duties as assigned
Reasonable accommodations may be made to enable individuals with disabilities to perform the essential
functions of this position.
**MINIMUM KNOWLEDGE, SKILLS AND ABILITIES**:
The requirements listed below are representative of the experience, education, knowledge, skill and/or
abilities required.
- 2+ years in cloud operations, designing solutions leveraging public cloud IaaS and PaaS including
Virtual Machine, Virtual Networking, Load Balancing and other HA technologies, headless
architecture and containerization.
- 5+ years working in technical software or infrastructure engineering teams.
- Configuration management experience using popular automation products such as Azure
Automation, Powershell DSC, Chef, Ansible, Salt Stack, Puppet, or others.
- IaC (Infrastrucutre as Code) Automation provisioning experience using popular automation products
such as Azure Resource Manager Templates, Terraform, UCS Director, VMware vRealize
Automation, or others.
- Experience in developing or participating in cloud migration strategies
- Designing and reporting metrics and dashboards to drive SLA compliance.
- Knowledge and experience in ITIL framework-based operations and reporting.
- Advanced large-scale server and network management technology
- Procedure and policy design, documentation and dissemination
- Automation and analysis of performance and reliability data
- Understanding of global network design and inter-networking
- Security engineering, server and network hardening, auditing and reporting
- Participate in vendor and contract negotiation
- Strong technical, analytical, problem solving and verbal and written communication skills.
**PREFERRED QUALIFICATIONS**:
- Prior healthcare experience would be a plus.
- BA in a computer related field preferred but not required
- Industry certifications such as Cisco CCNA, VMware VCP, AWS Professional or Azure
MCSA/MCSE, Jenkins Certified Engineer, AWS Certified DevOps Engineer or Microsoft Azure
DevOps Solutions Certification
**PHYSICAL/MENTAL DEMANDS AND WORKING ENVIRONMENT**:
The physical and mental requirements along with the work environment characteristics described here are representative of those an individual encounters while performing the essential functions of this position.
-
Site Reliability Engineer
hace 1 semana
San Francisco, Heredia, Costa Rica Ibm A tiempo completoJob SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at IBM. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our systems and infrastructure.Key ResponsibilitiesLead the problem resolution process for our clients, from analysis and troubleshooting to deploying workarounds...
-
Site Reliability Engineer
hace 1 semana
San Francisco, Heredia, Costa Rica Ibm A tiempo completoAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at IBM. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure.Key ResponsibilitiesDesign, implement, and maintain scalable and highly available cloud infrastructureCollaborate with...
-
Site Reliability Engineer
hace 1 semana
San Francisco, Heredia, Costa Rica Ibm A tiempo completo**Job Summary**We are seeking a highly skilled Site Reliability Engineer to join our team at IBM. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure.**Key Responsibilities**Design, implement, and maintain scalable and highly available cloud infrastructureCollaborate with...
-
Site Reliability Engineer
hace 2 días
San José, San José, Costa Rica Netskope A tiempo completoAbout NetskopeThe cloud security landscape is rapidly evolving, and Netskope is at the forefront of this shift. As a Site Reliability Engineer, you will play a crucial role in ensuring the Netskope platform is highly available, performant, and secure.Key Responsibilities:Partner closely with development teams and product managers to architect and build...
-
Site Reliability Engineer
hace 1 semana
San Francisco, Heredia, Costa Rica Ibm A tiempo completoJob SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at IBM. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure.Key ResponsibilitiesIdentify and investigate issues with our cloud infrastructureDevelop and implement solutions to improve the...
-
Site Reliability Engineer
hace 1 semana
San Francisco, Heredia, Costa Rica Ibm A tiempo completo**Job Summary**IBM is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure. This includes identifying and troubleshooting issues, implementing solutions, and collaborating with cross-functional teams to drive...
-
Site Reliability Engineer
hace 1 semana
San Francisco, Heredia, Costa Rica Ibm A tiempo completoAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at IBM. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our systems and services.Key ResponsibilitiesIdentify and investigate issues, using troubleshooting techniques to provide advice and guidance to clients.Work...
-
Site Reliability Engineer
hace 1 semana
San José, San José, Costa Rica Equifax A tiempo completoAt Equifax, we're looking for a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our internal and external services. This involves taking an engineering approach to building and running large-scale, distributed, fault-tolerant systems.**Key...
-
Site Reliability Engineer
hace 1 semana
San José, San José, Costa Rica Equifax A tiempo completoAt Equifax, we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our large-scale, distributed systems.**Key Responsibilities:**• Engage in and improve the software development lifecycle, from inception and design through development,...
-
Site Reliability Engineer
hace 2 semanas
San José, San José, Costa Rica Netskope A tiempo completoAbout NetskopeNetskope is a cloud security company that has been leading the market since 2012. We have a strong culture of openness, honesty, and transparency, which is reflected in our open desk layouts and large meeting spaces. Our team is passionate about building a secure and reliable cloud platform, and we're looking for a skilled Site Reliability...
-
Site Reliability Engineer
hace 1 semana
San Francisco, Heredia, Costa Rica Ibm Careers A tiempo completoAs a Site Reliability Engineer at IBM Careers, you will work in an agile, collaborative environment to build, deploy, configure, and maintain systems for our clients. In this role, you will lead the problem resolution process, from analysis and troubleshooting, to deploying the latest software updates and fixes.Your primary responsibilities include:• 24x7...
-
Site Reliability Engineer
hace 1 mes
San José, San José, Costa Rica Netskope A tiempo completoAbout NetskopeNetskope is a leading cloud security company that has revolutionized the way organizations protect their data and users. As a Site Reliability Engineer, you will be part of a team that is dedicated to ensuring the availability, latency, performance, efficiency, change management, observability, emergency response, and capacity planning of our...
-
Senior Site Reliability Engineer
hace 1 semana
San José, San José, Costa Rica Vs-Staffing A tiempo completoJob Title: Senior Site Reliability EngineerJob Overview:At Vs-Staffing, we are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our SRE team, you will be responsible for ensuring the reliability, scalability, and security of our systems.Key Responsibilities:Leadership and Mentorship: Direct and mentor junior...
-
Site Reliability Engineer
hace 4 semanas
San José, San José, Costa Rica Netskope A tiempo completoAbout NetskopeNetskope is a leading cloud security company that has been redefining cloud, network, and data security since 2012. Our mission is to protect data wherever it goes, and we've built a market-leading platform to achieve this goal. As a Site Reliability Engineer at Netskope, you'll join a team of software engineers focused on improving the...
-
Senior Site Reliability Engineer
hace 4 meses
San José, Costa Rica Encora A tiempo completo**Important Information** Experience: + 5 years Job Mode: Full-time Work Mode: Work from home **Job Summary** As a **_Senior Site Reliability Engineer (6632)_**, you will be part of a highly skilled technology and agile team by supporting and developing cutting-edge solutions to meet our business requirements. You will help us accelerate our customers'...
-
Senior Site Reliability Engineer
hace 4 semanas
San José, San José, Costa Rica Netskope A tiempo completoAbout NetskopeNetskope is a leading cloud security company that provides innovative solutions to protect data and users in a rapidly changing digital landscape. Our mission is to redefine cloud, network, and data security by building a new perimeter that follows and protects data wherever it goes.Job SummaryWe are seeking a highly skilled Senior Site...
-
Site Reliability Engineer
hace 5 meses
San José, Costa Rica Hitachi Solutions A tiempo completoCompany Description Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain, a...
-
Site Reliability Engineer
hace 5 meses
San José, Costa Rica Hitachi Solutions Ltd A tiempo completo**Company Description** Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals. Our industry focus, expertise, and intellectual property is what truly sets us apart. We have earned, and continue to maintain,...
-
Site Reliability Engineer
hace 5 meses
San José, Costa Rica Sysdig A tiempo completoSysdig is driving the standard for securing the cloud and containers. We created Falco, the open standard for cloud-native threat detection, and consistently contribute to open source software projects. We are passionate, technical problem-solvers, continually innovating and delivering powerful solutions to secure the cloud from source to run. We value...
-
Site Reliability Operations Engineer III
hace 1 semana
San José, San José, Costa Rica Zuora A tiempo completo**About Zuora**Zuora is a leading company in the Subscription Economy, helping businesses transform their approach to customer relationships and growth. Our mission is to empower companies to build recurring relationships with their customers, focusing on sustainable growth and customer satisfaction.**The Role**We are seeking a highly skilled Site...