Manager, Site Reliability Engineer
hace 2 semanas
ROLE SUMMARY
As Pfizer strives to Win the Digital Race in Pharma, there is no time to lose, and no tolerance for downtime. In Digital Command Operations, we use best-in-class approaches to ensure that the infrastructure services that underpin Pfizer’s critical digital solutions are robust, reliable, performant and efficient. We partner closely with Product teams, Process teams and Third-party Service Providers to achieve this aim. We value our team members’ technical knowledge, but also their passion, teamwork, and commitment to growth.
- Ensuring services are running smoothly: identifying and regularly reviewing operational health indicators to ensure delivery to defined service levels.
- Continuously analysing service data to identify service improvement opportunities. Opportunities will be assessed on their potential to remove manual effort (‘toil’), improve service reliability, and enhance customer experience.
- Developing, or partnering with other teams to develop automation that will remove toil from the environment, and deliver a more reliable, cost-effective service to the company.
ROLE RESPONSIBILITIES
- Act as focal point for the day-to-day operation of Cloud services at Pfizer. Work to embed core principles and practices of Site Reliability Engineering for those services.
- Continuously review operational indicators, ensuring that the service is meeting quality and reliability outcomes expected by its customers. Oversee the delivery of services by outsource partners.
- Analyse service data to identify service improvement opportunities. Maintain a backlog of prioritized opportunities. Pursue items in the backlog in an agile fashion, favouring rapid iteration and continuous delivery of work products.
- Though having software development skills are not a hard requirement for this role, those skills would be an advantage. A shared development team will be available to work with the SRE on automation development. The SRE will ensure that any code is developed to agreed standards, aligned with other service areas, and released in a compliant manner.
- Constantly evaluate cost of service delivery for service in question. Seek to gain cost improvements as toil is removed.
- Assume a leadership role on major outages or planned events that impact the service. Work closely with Pfizer’s Command Center to ensure processes are optimised for management of major events including proactive monitoring, escalation plans and troubleshooting guides.
- Lead post-mortems for major events pertaining to the service. Coordinate reliability testing on services in scope.
BASIC QUALIFICATIONS
- Bachelor's degree in Computer Science or related technical field, or equivalent practical experience.
- 5+ years’ experience working in similar environments and/or roles.
- Experience of working with Cloud services, particularly AWS services.
- Familiarity with enterprise infrastructure concepts and technologies on which Cloud services rely (i.e., network connectivity, authentication, etc.)
- Strong data literacy and analytical ability. Must be able to aggregate data from different sources, derive insights and communicate actionable information.
- Proven ability to build and improve processes and workflows with a record of simplifying and streamlining. Relentless focus on removing toil (manual effort) through process re-design and automation.
- Skilful in navigating through and working with different functional teams to reach positive process outcomes.
- Able to communicate in a succinct, accurate and timely fashion. Ability to effectively communicate technical concepts to business recipients.
PREFERRED QUALIFICATIONS
- Exposure to agile ways of working including Scrum.
- Understanding of software development concepts and standards a distinct advantage.
- Experience of working in a regulated environment.
LI-PFE
NON-STANDARD WORK SCHEDULE, TRAVEL OR ENVIRONMENT REQUIREMENTS
Command Center & Ops is a 24x7 operations team, with team members spread across multiple geographies. Some flexibility in work hours in order to connect with these colleagues may be occasionally required.
ORGANIZATIONAL RELATIONSHIPS
- Peer SREs
- Product/Engineering teams
- Managed Service Providers/Vendors
- Application support teams
Work Location Assignment: Flexible
Pfizer is an equal opportunity employer and complies with all applicable equal employment opportunity legislation in each jurisdiction in which it operates.
Information & Business Tech
LI-PFE
-
Site Reliability Engineer
hace 2 semanas
San Pedro, Costa Rica CRG Solutions A tiempo completoReporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical and process guidance specific to a business unit. Key areas of impact this role provides are in depth knowledge of the engineering environments within the specific business unit and providing automated, stable, and Automation Solutions Engineering, CI/CD...
-
Site Reliability Engineer
hace 4 semanas
San Francisco, Heredia, Costa Rica Sysco Costa Rica A tiempo completo**Requirements**:- Develop and refine strategy and process for all support issue tracking from intake through resolution in conjunction with senior members of the team.- Contribute to, and occasionally lead, strategic discussions to continue the evolution of flexibility and sustainability of the entire product suite.- Partner with Level 1 support teams,...
-
Site Reliability Engineer
hace 4 semanas
San José, San José, Costa Rica Oracle A tiempo completoSite Reliability Engineer-230001K1**Applicants are required to read, write, and speak the following languages***: English**Preferred Qualifications**Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.Design, write, and deploy software to improve the availability, scalability, and efficiency of...
-
Site Reliability Engineer
hace 1 semana
San José, Costa Rica Oracle A tiempo completoSite Reliability Engineer-2200087E **Applicants are required to read, write, and speak the following languages**: English **Preferred Qualifications** Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and...
-
Site Reliability Engineer
hace 4 semanas
San José, Costa Rica Oracle A tiempo completoSite Reliability Engineer-2200087I**Applicants are required to read, write, and speak the following languages**: English**Preferred Qualifications**Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.Understand the end-to-end configuration, technical dependencies, and...
-
Site Reliability Engineer
hace 4 semanas
San José, San José, Costa Rica Fullstack Labs A tiempo completoFullStack is the fastest-growing software consultancy in the Americas.We help organizations like Uber, GoDaddy, MGM, Siemens, Stanford University, and the State of California, build distributed software development teams, and deliver transformational digital solutions.As an employee-first company, we focus on hiring the most talented software designers and...
-
Site Reliability Engineer
hace 3 semanas
San José, Costa Rica VS-Staffing A tiempo completoJob Description - Site Reliability Engineer - Remote Costa Rica **Title**: Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Incident Management: Lead the response to security incidents through identification, containment, analysis, and mitigation strategies to minimize impact. -...
-
Site Reliability Engineer
hace 2 semanas
Ubicación San José, San José, Costa Rica Udersol A tiempo completoRequisitos: Technical Requirements: - Bachelor-s degree in computer science, IT or other highly technical, scientific discipline. - 3+ Years experience in a Site Reliability role. - Ability to program with one or more high level languages, such as Python, Ruby, and Javascript. Experience with automation and scripting languages, including CloudFormation and...
-
Site Reliability Engineer
hace 4 semanas
San José, San José, Costa Rica Hitachi Solutions A tiempo completoCompany DescriptionHitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals.Our industry focus, expertise, and intellectual property is what truly sets us apart.We have earned, and continue to maintain, a...
-
Site Reliability Engineer
hace 4 semanas
San José, San José, Costa Rica Hitachi Solutions Ltd A tiempo completo**Company Description**Hitachi Solutions is a global Microsoft solutions integrator passionate about developing and delivering industry-focused solutions that support our clients to deliver on their business transformation goals.Our industry focus, expertise, and intellectual property is what truly sets us apart.We have earned, and continue to maintain, a...
-
Senior Site Reliability Engineer
hace 7 días
San José, San José, Costa Rica Equifax A tiempo completoKey ResponsibilitiesAs a Cyber Security Site Reliability Engineer, your key responsibilities will include:Influencing infrastructure, architecture, standards, and methods for large-scale systems.Supporting services prior to production via infrastructure design, software platform development, load testing, capacity planning, and launch reviews.Maintaining...
-
Site Reliability Engineer
hace 3 semanas
San José, San José, Costa Rica Equifax A tiempo completoSite Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems.SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles._- SREs in our team take...
-
Sr. Site Reliability Engineer
hace 3 semanas
San José, Costa Rica VS-Staffing A tiempo completoJob Description - Sr. Site Reliability Engineer **Title**: Sr. Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Leadership and Mentorship: Direct and mentor junior SREs, fostering a culture of excellence, continuous improvement, and learning within the team. - Strategy Development:...
-
Site Reliability Engineer
hace 1 semana
San José, Costa Rica Sysdig A tiempo completoSysdig is driving the standard for securing the cloud and containers. We created Falco, the open standard for cloud-native threat detection, and consistently contribute to open source software projects. We are passionate, technical problem-solvers, continually innovating and delivering powerful solutions to secure the cloud from source to run. We value...
-
Site Reliability Engineer
hace 2 semanas
San José, San José, Costa Rica Scalable Systems A tiempo completoScalable Systems?is a USA-based Big Data, Analytics and Digital Transformation Company focused on vertical, innovative solutions.By providing next-generation technology solutions and services, we help organizations to identify risks & opportunities, achieve operational excellence, and gain an innovative edge.**Openings**:**Title**: Site Reliability...
-
Cloud Security Site Reliability Lead
hace 7 días
San José, San José, Costa Rica Equifax A tiempo completoJob DescriptionWe are seeking a highly skilled Cyber Security Site Reliability Engineer to join our team. As a member of our Engineering department, you will play a key role in ensuring the reliability and performance of our internal and external services.The ideal candidate will have a strong background in software development, systems engineering, and...
-
Senior Site Reliability Engineer
hace 3 semanas
San José, Costa Rica Equifax A tiempo completoEquifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _ **What you’ll do**: - You will influence and design the infrastructure, architecture, standards, and methods for large-scale systems. - Will...
-
Site Reliability Engineer
hace 2 semanas
San José, Costa Rica Equifax A tiempo completoEquifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _ **What you’ll do**: - You will influence and design the infrastructure, architecture, standards, and methods for large-scale systems. - Will...
-
Site Reliability Engineer
hace 3 semanas
San José, San José, Costa Rica Bairesdev A tiempo completoBairesDev is proud to be one of the fastest-growing companies in Latin America and a welcoming, highly rated employer (Glassdoor Employee Score: 4.3).With more than 3500 employees in 27 countries and world-class clients from start-ups to Fortune 500 companies, we're only as strong as the multicultural teams at the heart of our business.BairesDev runs on...
-
Manager, Site Reliability Engineer
hace 2 semanas
San José, San José, Costa Rica Pfizer A tiempo completoROLE SUMMARYAs Pfizer strives to Win the Digital Race in Pharma, there is no time to lose, and no tolerance for downtime.In Digital Command Operations, we use best-in-class approaches to ensure that the infrastructure services that underpin Pfizer's critical digital solutions are robust, reliable, performant and efficient.We partner closely with Product...