Site Reliability and Automation Engineer
hace 14 horas
**Introduction**
At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's most challenging problems? If so, lets talk.
**Your Role and Responsibilities**
Are you passionate about technology? Do you love building new things? Do you want to develop the future of IBM's Cloud offerings? If you answered YES, then we have the right opportunity for you
The shift toward the consumption of IT as a service, i.e, the cloud, is one of the most important
changes to happen to our industry in decades. At IBM, we are driven to shift our technology to an as-a-service model and to help our clients transform themselves to take full advantage of the cloud.
With industry leadership in analytics, security, commerce, and cognitive computing and with
unmatched hardware and software design and industrial research capabilities, no other company is as well positioned to address the full opportunity of cloud computing.
We are looking for a dynamic, Site Reliability and Automation Engineer to join our Cloud
Operations Team, who is responsive to market needs, to deliver value to our clients in a fast
- changing cloud landscape. The Cloud team is dedicated to ensuring the IBM Cloud is at the
forefront of cloud technology, from data center design to network architecture to storage and
compute clusters to flexible infrastructure services. We are building and operating IBM's VMware Solutions cloud platform to deliver performance and predictability for our customers' most demanding workloads, at global scale and with leadership efficiency, resiliency and security. It is an exciting time, and as a team we are driven by this incredible opportunity to thrill our clients.
In this Site Reliability and Automation Engineer role, you will work closely with the Data Center, the entire Cloud development organization and IBM vendors to support, maintain and operationally improve the cloud infrastructure. Your focus will be the following key responsibilities:
- Support and Operate Cloud Service delivery
- Automate health monitoring of the production and test systems
- Automate return to service procedures for Cloud Service delivery
- Support the compliance and security integrity of the environment through your work
- Partner with other teams, functional managers and program managers to deliver mission-critical services to the market
- Support development of new and existing capabilities for our compute, storage and network services.
- Integrate automation with operational requirements
Work with Engineering and Development to:
- Define operational requirements
- Automate operational requirements
- Provide initial assessment and possible workaround of production issue
- Troubleshoot and resolve production issues
Work with Support and Infrastructure to:
- Identify and resolve complex issues
- Discuss and plan integration tasks
Qualifications:
- Excellent written and verbal communication skills
- Comfortable operating in fast paced environment
**Required Technical and Professional Expertise**
- 2-3 years of experience in data center infrastructure, engineering and support
- Minimum of 2 years’ experience with hands-on production administration of large virtual system environments using VMware vSphere, VMware vCenter
- Experience with VMware NSX, vRealize Operations Manager, vRealize Network Insight.
- Experience in establishing, following, and improving operational procedures within a mission critical environment
- Experience in IT Change, Incident, Problem, Asset management
- Must be efficient in writing, debugging and maintaining scripts (Bash, Python, Powershell)
- Ability to do low level debugging and problem analysis by examining logs and running Unix commands
- 2-3 years of experience with open-source products
- Hands on knowledge using vRealize Log Insight or LogDNA
- Excellent written and verbal communication skills
**Preferred Technical and Professional Expertise**
- Experience in maintaining cloud based solutions with VMware vCloud Director
- Experience with Veeam Backup
- Experience with replication/failover using Zerto Platform, VMware vCloud
- Availability or Veeam Cloud Connect
- (Extensive) Experience with scripting languages, such as Bash, Powershell and Python
- Working knowledge with SQL (PostgreSQL, MSSQL) and Cloudant
- Working knowledge with Networking, sub-netting and Storage technologies
- Working knowledge with ServiceNow, JIRA, Confluence, and GitHub
**About Business Unit**
Digitization is accelerating the ongoing evolution of business, and clouds - public, private, and hybrid - enable companies to extend their existing infrastructure and integrate across systems. IBM Cloud provides the security, control, and visibility that our
-
Site Reliability Engineer
hace 7 meses
Heredia, Costa Rica IBM A tiempo completoIntroduction At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's most...
-
Site Reliability Engineer
hace 7 meses
Heredia, Costa Rica Sysco Costa Rica A tiempo completo**Requirements**: - Develop and refine strategy and process for all support issue tracking from intake through resolution in conjunction with senior members of the team. - Contribute to, and occasionally lead, strategic discussions to continue the evolution of flexibility and sustainability of the entire product suite. - Partner with Level 1 support teams,...
-
Site Reliability Engineer
hace 7 meses
Heredia, Costa Rica IBM A tiempo completo**Introduction** At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's...
-
Sr Site Reliability Engineer
hace 7 meses
Heredia, Costa Rica Encora A tiempo completo**Our company**: Encora is a global Software and Digital Engineering company that helps businesses overcome the Software Engineering Talent shortage and provides next-gen services such as Predictive Analysis, Artificial Intelligence & Machine Learning, IoT, Cloud, and Test Automation. We have 11 global offices and 36 innovation labs. Our Software...
-
Site Reliability Engineer Manager
hace 14 horas
Heredia, Costa Rica IBM A tiempo completo**Introduction** At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's...
-
Compute Operations Site Reliability Engineer
hace 7 meses
Heredia, Costa Rica IBM A tiempo completoIntroduction At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's most...
-
Site Reliability Engineering Professional
hace 8 meses
Heredia, Costa Rica IBM A tiempo completoIntroduction At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's most...
-
Site Reliability Engineering Professional
hace 8 meses
Heredia, Costa Rica IBM A tiempo completoIntroduction At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's most...
-
Site Reliability Engineering Professional
hace 7 meses
Heredia, Costa Rica IBM A tiempo completoIntroduction At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's most...
-
Senior Site Reliability Engineer
hace 7 meses
Heredia, Costa Rica IBM A tiempo completoIntroduction As a SRE Engineer, you will work in an agile, collaborative environment to build, deploy, configure and maintain systems for the IBM client business. In this role, you will lead the problem resolution process for our clients, from analysis and troubleshooting, to deploying workarounds or fixes. Working closely with our worldwide teams, you will...
-
Sr Qlty
hace 2 días
Heredia, Costa Rica TE Connectivity A tiempo completoTE Connectivity’s Quality and Reliability Engineering Teams analyze the ability of product and production systems to comply with customer and contractual requirements through established reliability factors. They design, recommend revisions and install quality control systems, develop and document analytical methods for establishing reliability of products...
-
Senior Site Reliability Engineer Manager
hace 7 meses
Heredia, Costa Rica IBM A tiempo completoIntroduction The IBM Software is seeking a talented and motivated SRE Manager professional to lead and manage a team of engineers focused on a global Cloud Platform solution servicing multiple IBM offering. Your Role and Responsibilities - Manage and lead a team of SRE engineers. This involves hiring, training, and mentoring team members, assigning tasks,...
-
Site Reliability Engineer
hace 9 meses
Heredia, Costa Rica IBM A tiempo completoIntroduction As a SRE Engineer, you will work in an agile, collaborative environment to build, deploy, configure and maintain systems for the IBM client business. In this role, you will lead the problem resolution process for our clients, from analysis and troubleshooting, to deploying workarounds or fixes. Working closely with our worldwide teams, you will...
-
Site Reliability Engineer
hace 7 meses
Heredia, Costa Rica IBM A tiempo completo**Introduction** As a SRE Engineer, you will work in an agile, collaborative environment to build, deploy, configure and maintain systems for the IBM client business. In this role, you will lead the problem resolution process for our clients, from analysis and troubleshooting, to deploying workarounds or fixes. Working closely with our worldwide teams, you...
-
Site Reliability Engineer
hace 7 meses
Heredia, Costa Rica IBM A tiempo completo**Introduction** As a SRE Engineer, you will work in an agile, collaborative environment to build, deploy, configure and maintain systems for the IBM client business. In this role, you will lead the problem resolution process for our clients, from analysis and troubleshooting, to deploying workarounds or fixes. Working closely with our worldwide teams, you...
-
Automation Engineer
hace 1 semana
Heredia, Costa Rica SGF Global A tiempo completo**Automation Engineer (Remote)** **Heredia, Costa Rica** SGF Global is looking for a Business Analyst, for IT enterprise in Heredia, Costa Rica. **Requirements**: - Must be a Graduate. BTech/BE or any other technical degree preferred - Years of Experience: 4+ - Developer of Grade 6 ,grade 7 or above for automation development role - Should have hands-on...
-
Site Reliability Engineer
hace 7 meses
Heredia, Costa Rica IBM A tiempo completo**Introduction** As a SRE Engineer, you will work in an agile, collaborative environment to build, deploy, configure and maintain systems for the IBM client business. In this role, you will lead the problem resolution process for our clients, from analysis and troubleshooting, to deploying workarounds or fixes. Working closely with our worldwide teams, you...
-
Site Reliability Engineer
hace 7 meses
Heredia, Costa Rica IBM A tiempo completo**Introduction** As a SRE Engineer, you will work in an agile, collaborative environment to build, deploy, configure and maintain systems for the IBM client business. In this role, you will lead the problem resolution process for our clients, from analysis and troubleshooting, to deploying workarounds or fixes. Working closely with our worldwide teams, you...
-
Mid Site Reliability Engineer
hace 7 meses
Heredia, Costa Rica IBM A tiempo completoIntroduction As a SRE Engineer, you will work in an agile, collaborative environment to build, deploy, configure and maintain systems for the IBM client business. In this role, you will lead the problem resolution process for our clients, from analysis and troubleshooting, to deploying workarounds or fixes. Working closely with our worldwide teams, you will...
-
Mid Site Reliability Engineer
hace 7 meses
Heredia, Costa Rica IBM A tiempo completoIntroduction As a SRE Engineer, you will work in an agile, collaborative environment to build, deploy, configure and maintain systems for the IBM client business. In this role, you will lead the problem resolution process for our clients, from analysis and troubleshooting, to deploying workarounds or fixes. Working closely with our worldwide teams, you will...