Cloud Kubernetes Reliability Specialist
hace 2 días
About Us
">VMware's Cross-Cloud SaaS Platform team is responsible for delivering the public cloud infrastructure and Managed Kubernetes clusters that host all of VMware's SaaS products consumed by our customers. The platform is globally distributed and built using a combination of industry-standard open-source solutions and VMware Products.
The Team
">We host VMware's most significant SaaS products in production, including VMware Cloud on AWS, CarbonBlack, Networking (NSBU) and Cloud Management Business Unit (CMBU) services. Our Site Reliability Engineering team ensures the uptime and reliability of our Managed Platform and provides value add services to our customers and internal VMware engineering teams.
Key Responsibilities
">- Assist the team in continuing to ensure that our uptime and reliability are aligned with the defined Service Level Objectives (SLOs).
- Design solutions, automate everything, reduce toil, and be a fantastic collaborator working well in a globally distributed team.
- Experience in managing and automating Public Cloud (AWS/GCP/AZURE) Infrastructure and Kubernetes is required.
- Inquisitive and determined to get to the root cause of a problem ensuring that we don't see repetitive issues, or better yet, have the foresight to address them before they occur.
- Assist with our Observability stack, such as logging, metrics, tracing and dashboards.
Pillars of Focus
">We have three functional pillars within the Platform Team:
- Managed Platform - Deploy, manage and support the uptime and reliability of our SaaS platform, including our public cloud configurations, 10's of thousands of pods running on hundreds of Kubernetes clusters.
- Managed Services - Provide value-add cloud-native services that run on Kubernetes. We currently support VMware's Managed Kafka, Observability stacks, logging services, secrets management and more.
- Security & Compliance - The platform is certified in FedRAMP High, PCI, HIPAA, SOC2 and more. We are constantly innovating, building new services and automating to secure our platform and increase our security posture with least privilege principles and zero trust.
What You'll Be Doing
">This engineering team moves at lightning speed adopting leading edge technologies. You'll be pulling things apart and tinkering, building new platforms, or playing in the cloud. Here, the engineering opportunities are endless.
You'll be working together and across the organization to ensure we maintain our SaaS Platform running on Kubernetes ensuring we meet our SLOs. Strong knowledge in SRE with experience in Kubernetes as an Admin would be ideal. You will ideally have contributed to opensource and keen to develop services that all of VMware SaaS engineering products and teams will consume.
You must be driven, understand and can demonstrate SRE best practices with strong expertise in troubleshooting Kubernetes clusters. Have a strong understanding of Observability, Release Management, exposure to Cloud services and excellent hands-on expertise in Python and ideally Golang.
This is an extremely exciting role working with a very strong team right at the heart of VMware's SaaS transformation journey.
-
Senior Cloud Reliability Engineer
hace 3 meses
San Francisco, Heredia, Costa Rica Ibm A tiempo completoJob SummaryWe are seeking a highly skilled Senior Cloud Reliability Engineer to join our team at IBM. As a key member of our global infrastructure team, you will be responsible for designing, deploying, and maintaining scalable and highly available cloud-based systems.Key ResponsibilitiesLead the problem resolution process for our clients, from analysis and...
-
Senior Cloud Infrastructure Engineer
hace 1 semana
San Francisco, Heredia, Costa Rica Ibm A tiempo completo**Job Overview**We are seeking a skilled Senior Cloud Infrastructure Engineer to join our team in ensuring the performance, reliability, and scalability of AI & ML driven voice agent microservices, Kubernetes clusters, AWS cloud infrastructure, network services, and storage layers.**Key Responsibilities**The ideal candidate will work closely with other...
-
Site Reliability Engineer
hace 3 meses
San Francisco, Heredia, Costa Rica Ibm A tiempo completoJob SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at IBM. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure.Key ResponsibilitiesIdentify and investigate issues with our cloud infrastructureDevelop and implement solutions to improve the...
-
Cloud Architecture Specialist
hace 2 meses
San Francisco, Heredia, Costa Rica Experian A tiempo completoWe're looking for a highly skilled Cloud Architecture Specialist to design and implement new products and features across the entire AWS stack.This is a permanent, remote role in Costa Rica with a competitive benefits package and a focus on diversity and inclusion.Key Responsibilities:Design and implement cloud-based architectures using AWS Serverless...
-
Cloud Platform Deployment Specialist
hace 2 meses
San Francisco, Heredia, Costa Rica Experian A tiempo completoExperian is a leading global information services company, empowering consumers and clients to manage their data with confidence. With over 20,000 employees operating across 44 countries, we invest in new technologies, talented people, and innovation to help our clients thrive.We are seeking an experienced Cloud Platform Deployment Specialist to support the...
-
Data Reliability Specialist
hace 2 meses
San Francisco, Heredia, Costa Rica Moody'S A tiempo completoRole OverviewAt Moody's, we are looking for a talented Data Reliability Specialist to join our team. As a key member of our data reliability team, you will be responsible for ensuring the accuracy and quality of our data systems. Key Responsibilities:Champion data integrity by being the guardian of data accuracy across our systems.Innovate in error...
-
Technical Support Specialist for Cloud Solutions
hace 2 semanas
San Francisco, Heredia, Costa Rica Cloud Software Group A tiempo completoAbout the Role">We are seeking a skilled Technical Support Specialist to join our team at Cloud Software Group. As a Technical Support Specialist, you will be responsible for providing top-notch support to our customers, ensuring they have a seamless experience with our cloud-based products.Key ResponsibilitiesAnswering incoming customer support requests in...
-
Cloud Technical Specialist
hace 2 meses
San Francisco, Heredia, Costa Rica Ibm A tiempo completoSRE Engineer Role Summary:As a Cloud Technical Specialist, you will work in an agile environment to build, deploy, configure and maintain systems for IBM clients.Be the primary point of contact for clients, responsible for client service requests and ensure timely resolution.Identify and investigate issues, provide expert advice and guidance to clients on...
-
Site Reliability Engineering Specialist
hace 2 meses
San Francisco, Heredia, Costa Rica Ibm A tiempo completoAbout IBMIBM is a leading technology company that enables businesses to innovate and succeed in the digital era. With a rich history of innovation, IBM has established itself as a trusted partner for organizations around the world.Job SummaryAs a Site Reliability Engineering Professional at IBM, you will play a critical role in ensuring the smooth operation...
-
Cloud Platform SRE Engineering Lead
hace 2 meses
San Francisco, Heredia, Costa Rica Ibm A tiempo completoAbout the RoleThis position offers an exciting opportunity to lead a team of skilled SRE engineers, focusing on a global Cloud Platform solution. Key responsibilities include managing and mentoring team members, providing architectural guidance, and collaborating with cross-functional stakeholders to deliver high-quality solutions.Your ResponsibilitiesManage...
-
Site Reliability Engineer
hace 3 meses
San Francisco, Heredia, Costa Rica Ibm A tiempo completoJob SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at IBM. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our systems and infrastructure.Key ResponsibilitiesLead the problem resolution process for our clients, from analysis and troubleshooting to deploying workarounds...
-
Enterprise Reliability Expert
hace 1 mes
San Francisco, Heredia, Costa Rica Ibm A tiempo completoAbout the RoleWe are seeking an experienced Enterprise Reliability Expert to join our global team managing one of IBM's leading security solutions. This is a unique opportunity to work in a fast-paced and rewarding environment with access to the latest education, tools, and technology.Your ResponsibilitiesDevelop and maintain automated processes, tools, and...
-
Cloud Software Group Technical Solutions Specialist
hace 19 horas
San Francisco, Heredia, Costa Rica Cloud Software Group A tiempo completoAbout Cloud Software GroupCloud Software Group is a leading provider of cloud software solutions, serving over 100 million users worldwide. We value diversity, collaboration, and innovation in our work environment.Our TeamWe are seeking an experienced Technical Solutions Specialist to join our team. This role will focus on advanced problem analysis and...
-
Reliability Engineering Specialist
hace 4 días
San Francisco, Heredia, Costa Rica Ibm A tiempo completoOverviewWelcome to IBM, where innovation meets reliability. As a Site Reliability Engineer, you will be at the forefront of building and maintaining systems that power our client business.
-
Cloud Software Group Technical Support Specialist
hace 2 meses
San Francisco, Heredia, Costa Rica Cloud Software Group A tiempo completoAbout Us:Citrix and TIBCO merged to create Cloud Software Group, one of the world's largest cloud solution providers.We value diverse lived experiences, passion for technology, and the courage to take risks. Everyone is empowered to learn, dream, and build the future of work.As a global leader in cloud computing, we are constantly innovating and pushing the...
-
Senior Site Reliability Engineer
hace 3 meses
San Francisco, Heredia, Costa Rica Ibm A tiempo completoJob SummaryWe are seeking a highly skilled Cloud Engineer to join our team as a Senior Site Reliability Engineer. As a key member of our infrastructure team, you will be responsible for designing, deploying, and maintaining large-scale cloud-based systems. Your expertise in cloud computing, DevOps, and system administration will enable you to identify and...
-
Cloud Infrastructure Engineer
hace 1 mes
San Francisco, Heredia, Costa Rica Aligntech A tiempo completoAbout UsAligntech is a leading medical device company transforming smiles and changing lives worldwide. Our team of innovative employees develops cutting-edge technology, tools, and treatment options for dental professionals.Job SummaryWe are seeking an experienced Cloud Infrastructure Engineer to support our organization in building and maintaining...
-
Cloud Operations Lead
hace 3 semanas
San Francisco, Heredia, Costa Rica Encora A tiempo completoCompany OverviewEncora is a global leader in software and digital engineering, providing innovative solutions to overcome the Software Engineering Talent shortage.We have 11 global offices and 36 innovation labs, working with leading-edge technology companies to improve their speed to impact.About This RoleWe are seeking an experienced Cloud Operations Lead...
-
Cloud Software Collection Specialist
hace 2 meses
San Francisco, Heredia, Costa Rica Cloud Software Group A tiempo completoAbout Cloud Software GroupAs a leading cloud software provider, Cloud Software Group combines the capabilities of Citrix and TIBCO to serve over 100 million users worldwide.We are committed to making a positive impact on people's lives by providing reliable cloud solutions. Our diverse team values different perspectives and encourages learning, innovation,...
-
Cloud Software Support Specialist
hace 19 horas
San Francisco, Heredia, Costa Rica Cloud Software Group A tiempo completoAbout Cloud Software GroupWe combine the capabilities of Citrix and TIBCO, creating a leading cloud software provider serving over 100 million users worldwide.Our team is dedicated to making a difference in people's lives by providing reliable cloud solutions that enable work from anywhere.We value diverse perspectives, courage to take risks, and encourage...