Site Reliability Engineer

hace 1 semana


San José, San José, Costa Rica Splunk Inc A tiempo completo

Join us as we pursue our disruptive new vision to make machine data accessible, usable and valuable to everyone.
We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers.
At Splunk, we're committed to our work, customers, having fun and most importantly to each other's success.
Learn more about Splunk careers and how you can become a part of our journeyThe Cloud organization at Splunk focuses on building and maintaining robust and resilient platform solutions for SaaS hosting of Splunk's enterprise software.
Our main technologies are Cloud Infrastructure based, focusing on puppet and terraform.The Splunk Cloud TechOps team is globally distributed with teams based in SF, Plano, Sydney, London, Costa Rica and India.It is the responsibility of the TechOps engagement team to monitor and resolve issues that affect the availability and performance of Splunk for our cloud customers 24/7.
As the authority on our customer's experience, the engagement team is the frontline of defense in making sure each of our customers has an exceptional experience.The TechOps engagement team provides a backstop for all staff on any questions or issues that arise during their shift related to their technical area of expertise.
The TechOps engineers lead their respective queue and ensure all requests coming into that queue are addressed in a timely manner.Responsibilities:We are looking for a TechOps Engineer to help maintain, contribute to, and improve the next generation of our large scale Cloud offering.
You will be working with providers and supporting the infrastructure that powers Splunk's cloud offering.Provide technical support for the Splunk Cloud fleetPerform impact assessments and problem solving according to established proceduresDocument issues, remediation steps, and help with follow up problem managementAssist other TechOps engineers on your shift on complex tasksUse the internal tools to restore normal service operations as quickly as possible to minimize the impact to business operations during escalated incidentsLead by example and drive the core values of the companyAlways ensure a quality customer experienceYou love large complex systems.
Experience in working on distributed systems or a passion for finding edge cases that appear at scale.
You're interested in how to bring something from a small one off task to how to implement it across several thousand machines at onceData drives your decisions and excites you - you make decisions based on numbers rather than assumptions.
If an issue arises, you strive to be alerted before our customers notice.What We Provide:Opportunities to develop and grow as an engineer.
We are always expanding into new areas, working with cross-collaborative teams, and exploring new technologies.A team of awesome, capable and dedicated peers, all the way from engineering to product management and customer support.Breadth and depth.
You are interested in working in an area that dynamically scales to meet the needs.
You want to go deep into optimizing how we automate every manual process and tedious task we encounter.Growth and mentorship.
We believe in growing engineers through ownership and leadership opportunities.
We also believe that mentors help both sides of the equation.A stable, collaborative, and supportive work environment.
Honesty and collaboration are values we see as a core of our team identity.
We understand the value in open communication—working together to get things done, and to adapt to the changing needs of the team and individuals.
This is reflected in both our internal communications and also in how we interact with our customers.Balance.
We're not expecting people to work 12-hour days.
We want you to be successful outside of work too and welcome work-from-home days.
We trust our colleagues to be responsible with their time and commitment, and believe that balance helps cultivate a positive environment.Requirements:Requires a minimum of 2 years of related experience with a technical Bachelor's degree; or equivalent practical experience.



  • San José, San José, Costa Rica Oracle A tiempo completo

    Site Reliability Engineer-2200087E**Applicants are required to read, write, and speak the following languages**: English**Preferred Qualifications**Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.Understand the end-to-end configuration, technical dependencies, and...

  • Site Reliability Engineer

    hace 4 semanas


    San José, San José, Costa Rica Scalable Systems A tiempo completo

    Scalable Systems?is a USA-based Big Data, Analytics and Digital Transformation Company focused on vertical, innovative solutions.By providing next-generation technology solutions and services, we help organizations to identify risks & opportunities, achieve operational excellence, and gain an innovative edge.**Openings**:**Title**: Site Reliability...


  • San José, San José, Costa Rica Wikimedia Foundation A tiempo completo

    **Staff Site Reliability Engineer (Traffic)****Summary**We are looking for a Staff Site Reliability Engineer to support and develop the platform serving the world's favorite encyclopedia to millions of people around the globe.Wikimedia's Site Reliability Engineering (SRE) team is principally responsible for ensuring our global top-15 website, our...


  • San José, San José, Costa Rica Vs-Staffing A tiempo completo

    About the JobThis role is an exciting opportunity to join our team as a Site Reliability Engineer.You will be working closely with our IT team to deploy security controls and measures that safeguard against future incidents while ensuring system compliance and reliability.The successful candidate will have experience in site reliability engineering and be...


  • San José, San José, Costa Rica Equifax A tiempo completo

    **Equifax is where you can power your possible.If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you.****Cyber Security Site Reliability Engineer (SRE Intermediate) **is a discipline that combines software and systems engineering for building and...


  • San José, San José, Costa Rica Nucleus Health A tiempo completo

    A U.S.based company that is on a mission to develop the largest online marketplace and media platform in the world is looking for a Senior DevOps/SRE Engineer.The engineer will be working with cross-functional teams to raise system performance, reliability, and effectiveness.The company is developing a knowledge-commerce platform that connects clients and...


  • San José, San José, Costa Rica Pfizer A tiempo completo

    ROLE SUMMARYAs Pfizer strives to Win the Digital Race in Pharma, there is no time to lose, and no tolerance for downtime.In Digital Command Operations, we use best-in-class approaches to ensure that the infrastructure services that underpin Pfizer's critical digital solutions are robust, reliable, performant and efficient.We partner closely with Product...


  • San José, San José, Costa Rica Bairesdev A tiempo completo

    Who We areBairesDev is proud to be the fastest-growing company in America.With people in five continents and world-class clients, we are only as strong as the multicultural teams at the heart of our business.To consistently deliver the highest quality solutions to our clients, we only hire the Top 1% of the best talents and nurture their professional growth...


  • San José, San José, Costa Rica Akamai A tiempo completo

    **Accelerate Your Career with Akamai**We are seeking a highly skilled Site Reliability Engineer II to join our SRE Infrastructure team. As a key member of this team, you will play a vital role in ensuring the reliability and performance of our critical systems and services.Your primary focus will be on the operational stability and performance of our dev,...


  • San José, San José, Costa Rica Equifax A tiempo completo

    At Equifax, Site Reliability Engineering (SRE) is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. Our SRE team ensures that internal and external services meet or exceed reliability and performance expectations while adhering to our engineering principles.SREs take an...


  • San José, San José, Costa Rica Crg Solutions A tiempo completo

    Crg Solutions is looking for a reliable and scalable engineer to work on our team. The ideal candidate will be responsible for providing technical and process guidance to a business unit, ensuring high levels of infrastructure uptime and reliability.Main Responsibilities:Monitoring and maintaining site-specific engineering IaaS and PaaS systemsCollaborating...


  • San José, San José, Costa Rica Intel A tiempo completo

    Develops, applies, and maintains quality and reliability standards for processing materials into partially finished or finished product.- Evaluate the materials, process and techniques used in production to meet the requirement of products and production equipment.- Specifies inspection and testing mechanism, conduct quality assessment (up to and including...


  • San José, San José, Costa Rica Canonical - Jobs A tiempo completo

    This role is an opportunity for a hands-on technologist with a passion for Linux to build a career with Canonical and drive the success with those leveraging Ubuntu and open source products.If you have an affinity for open source development and a passion for technology, then you will enjoy working with some of the best people in the industry at...


  • San José, San José, Costa Rica Vertrical Gmbh A tiempo completo

    Vertrical GmbH is a software start-up that understands the challenges faced by freelance DevOps and Site Reliability Engineers. We're not a staffing or recruiting agency; instead, we code ourselves and partner with clients in the healthcare industry to develop their digital solutions. Our ideal candidate will have experience with load balancers, target...


  • San José, San José, Costa Rica Intel A tiempo completo

    Have you been waiting all year for a famous dish and that onetime it just didn't taste the same as last one?That's what Quality and Reliability is all about, putting a happy face on our customers every single time because they get exactly what they thought they would get.Do you want to be part of enhancing the life of every person on earth?If you are into...


  • San José, San José, Costa Rica Intel A tiempo completo

    Microelectronic Quality Reliability Engineers provide project management, product, process design/development and sustaining support for integrated circuit or semiconductor assemblies, various other electronic components, sub systems and/or completed units.Responsible for physical understanding, model prediction and enhancement of quality and reliability for...


  • San José, San José, Costa Rica Splunk Inc A tiempo completo

    Join us on the Splunk TechOps team, empowering our customers to execute our vision making machine data accessible, usable, and valuable to everyoneThe Splunk TechOps organization runs Splunk cloud, blending SRE, Systems Engineering and Service Engineering disciplines, across functional global teams.Come join a team that is striving for operational...


  • San José, San José, Costa Rica Vertrical Gmbh A tiempo completo

    **Our Mission**We are dedicated to shaping the future of superior healthcare through our digital innovations. As a Freelance DevOps / Site Reliability Engineer, you will be part of our in-house task force, working on projects that make a real difference in people's lives.


  • San José, San José, Costa Rica International Talent Resources A tiempo completo

    Experience required:CKAD/CKA certification- Excellent troubleshooting skills across many layers (storage, networking, hypervisor, OS)- Experience building and operating highly available and scalable infrastructure solutions: you'd probably have worked with Kafka, Zookeeper or similar tech- Experience with infrastructure-as-code tools: Terraform, Packer and...


  • San José, San José, Costa Rica Wikimedia Foundation A tiempo completo

    **Summary****You will be expected to**:- Manage one to two globally distributed teams within Wikimedia's Site Reliability Engineering organization- Recruit, hire, and help onboard new team members- Work with team members to set team objectives and individual performance goals, and support them in meeting and evolving their goals and career path- Triage...