Sr. Site Reliability Engineer
hace 5 días
Job Description - Sr. Site Reliability Engineer
**Title**:
Sr. Site Reliability Engineer
**Location**:
Remote, based in Costa Rica
**Job Overview**:
**Key responsibilities include**:
- Leadership and Mentorship: Direct and mentor junior SREs, fostering a culture of excellence, continuous improvement, and learning within the team.
- Strategy Development: Lead the creation and execution of sophisticated strategies for system optimization, ensuring scalability, reliability, and security at all levels.
- System Architecture Contribution: Engage in the design and review of system architecture, advocating for security and reliability best practices.
- Advanced Incident Management: Manage complex security incidents with expertise, guiding the team during crisis situations and ensuring swift, effective resolutions.
- Cross-Functional Collaboration: Serve as a primary liaison between the SRE team, IT department, and other technical and business units, driving cohesive efforts towards shared organizational goals.
- Innovation and Research: Champion innovation by researching, advocating for, and implementing cutting-edge technologies and methodologies to enhance system reliability and security.
- Procedure Development: Formulate and maintain up-to-date incident response procedures and playbooks, ensuring their effectiveness and compliance with industry standards.
- Post-Incident Analysis: Conduct thorough post-incident reviews, deriving insights and recommendations to prevent recurrence and improve system security and reliability.
- Collaboration and Detection: Work closely with the vSOC to enhance detection and reporting mechanisms for timely incident response.
- Threat and Vulnerability Assessment: Provide expertise in threat analysis, conduct vulnerability assessments, and perform penetration testing using leading-edge tools and techniques.
- Security Measures Implementation: Partner with the IT team to deploy security controls and measures that safeguard against future incidents while ensuring system compliance and reliability.
- Stakeholder Engagement: Develop and maintain relationships with key external stakeholders, staying abreast of the latest security trends and practices.
- Technology Proficiency: Utilize and manage advanced incident response and reliability tools, including Splunk, Crowdstrike Falcon Complete, and MS Defender, among others.
**Preferred Qualifications and Experience**:
- Educational Background: Bachelor's degree in Computer Science, Information Technology, or equivalent experience. Advanced degrees or specialized certifications in site reliability engineering or cybersecurity are preferred.
- Professional Experience: A minimum of 5-7 years in cybersecurity, with extensive experience in site reliability engineering, including leadership roles or substantial project management experience.
- Technical Expertise: Deep understanding of cyber threats, attack methodologies, incident response techniques, and a solid grasp of NIST and ISO 27001 frameworks, with the ability to lead in architecture design, advanced troubleshooting, and performance optimization.
- Leadership Skills: Demonstrated leadership capabilities, with experience in guiding projects, mentoring team members, and leading by example in a high-stakes environment.
- Strategic Planning: Proven track record in strategic planning and execution, aligning technical projects with broader business objectives.
- Tools Proficiency: Expertise in using incident response tools and technologies such as SIEM, XDR, and threat intelligence platforms, with advanced knowledge in Splunk administration and other critical technologies.
- Analytical Skills: Exceptional analytical and problem-solving abilities, capable of sifting through large data sets to identify and address security incidents effectively.
- Communication: Strong communication skills, with the capacity to articulate complex technical information clearly to both technical and non-technical stakeholders.
- Adaptability: Ability to thrive in a fast-paced, ever-changing environment, showing flexibility and a commitment to continuous learning and improvement.
- Desirable Skills: Familiarity with Qualys, Contrast Security, KnowBe4 PhishER, PCI, and SOX compliance, along with experience in using Pager Duty, Jira, and Confluence, is advantageous.
**Desirable Skills**:
- Advanced Technical Skills: Experience with leading-edge technologies or methodologies, such as cloud-native technologies, Kubernetes, or advanced automation and orchestration platforms.
- Industry Leadership: Contributions to the field through speaking engagements, publications, or active participation in relevant professional communities are highly valued.
-
Site Reliability Engineer
hace 5 horas
San Pedro, Costa Rica CRG Solutions A tiempo completoReporting to the Director of Solutions Engineering, the Site Reliability Engineer provides technical and process guidance specific to a business unit. Key areas of impact this role provides are in depth knowledge of the engineering environments within the specific business unit and providing automated, stable, and Automation Solutions Engineering, CI/CD...
-
Site Reliability Engineer
hace 5 días
San José, Costa Rica VS-Staffing A tiempo completoJob Description - Site Reliability Engineer - Remote Costa Rica **Title**: Site Reliability Engineer **Location**: Remote, based in Costa Rica **Job Overview**: **Key responsibilities include**: - Incident Management: Lead the response to security incidents through identification, containment, analysis, and mitigation strategies to minimize impact. -...
-
Site Reliability Engineer
hace 7 días
San José, Costa Rica Scalable Systems A tiempo completoScalable Systems is a USA-based Big Data, Analytics and Digital Transformation Company focused on vertical, innovative solutions. By providing next-generation technology solutions and services, we help organizations to identify risks & opportunities, achieve operational excellence, and gain an innovative edge. **Openings**: **Title**: Site Reliability...
-
Senior Site Reliability Engineer
hace 2 semanas
San José, Costa Rica Equifax A tiempo completoEquifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _ **What you’ll do**: - You will influence and design the infrastructure, architecture, standards, and methods for large-scale systems. - Will...
-
Site Reliability Engineer
hace 2 días
San José, Costa Rica Equifax A tiempo completoEquifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. _ **What you’ll do**: - You will influence and design the infrastructure, architecture, standards, and methods for large-scale systems. - Will...
-
Site Reliability Engineer
hace 1 semana
San José, San José, Costa Rica Cisco Systems Costa Rica, Sociedad Anonima A tiempo completoMeet the TeamAt Cisco, we know that technology can connect, empower, and drive us. Our mission is to simplify technology so our customers can focus on what's most meaningful to them: their students, patients, customers, and businesses. We're making networking easier and faster with technology that simply works. Our CETO Sales Automation and Artificial...
-
Senior Site Reliability Engineer, Americas
hace 7 días
San José, Costa Rica Canonical - Jobs A tiempo completo**Site Reliability Engineer**: To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers. As a...
-
Site Reliability Engineer
hace 4 días
San José, Costa Rica Equifax A tiempo completoSite Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. _ - SREs in our team...
-
Site Reliability Engineer
hace 1 semana
San José, Costa Rica Equifax A tiempo completoSite Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. _ - SREs in our team...
-
Site Reliability Engineer
hace 2 semanas
San José, Costa Rica BairesDev A tiempo completoBairesDev is proud to be one of the fastest-growing companies in Latin America and a welcoming, highly rated employer (Glassdoor Employee Score: 4.3). With more than 3500 employees in 27 countries and world-class clients from start-ups to Fortune 500 companies, we’re only as strong as the multicultural teams at the heart of our business. BairesDev runs on...