Senior SRE

hace 1 semana


Escazú, Escazú, Costa Rica ImagineX Consulting A tiempo completo

This remote role is based in Costa Rica or El Salvador and only open to citizens and permanent residents of Costa Rica or El Salvador who do not need visa sponsorship.  

Academic Level  
BS/MS degree in Computer Science, Engineering or a related subject.  


Description  
ImagineX Studio - Costa Rica is a product-oriented and boutique software development company. We partner with our clients to become their product team, helping them from ideation to product launch. We commit to delivering quality, impactful, ground-breaking products that can realize our client's vision and make their life easier. 

We're looking for a highly skilled Senior SRE / Platform Engineering Lead to drive reliability, scalability, and operational excellence across our automation and platform ecosystem. This role blends advanced SRE practices with strong platform engineering expertise across distributed systems, observability, CI/CD, and infrastructure design.

You will partner closely with development, architecture, and operations teams to consolidate best practices, define standards, and guide the build-out of core platform capabilities. If you love designing systems that scale, creating guardrails, and mentoring teams toward modern engineering practices — this is the place to make a huge impact.

This person becomes the connective tissue between SRE, platform and engineering teams — the one who sets the standard for observability, reliability, and reusable platform patterns.
 

Responsibilities 

Lead the buildout of enterprise observability using Prometheus, Grafana, ELK, and Splunk, including dashboards, alerting rules, log pipelines, and distributed tracing.Maintain scalable monitoring, logging, and telemetry systems that support high-availability services.Partner with developers to improve system reliability, performance, and security.Drive automation to reduce manual work across operational workflows.Bring strong hands-on engineering skills while also mentoring teams on SRE and platform concepts.Collaborate across SRE, DevOps, Platform, and Product teams to align priorities.Help consolidate enterprise best practices for:Infrastructure as CodeCI/CD pipelinesSecurity, secrets management, and RBACAPI governance & integration patterns


Required Skills

Deep experience with Prometheus + Grafana for metrics and alerting.Strong hands-on knowledge of ELK/Elastic Stack and Splunk for log aggregation and observability.Strong understanding of distributed systems, scalability, and HA patterns.


Preferred Skills

Working knowledge with Redis and Postgres in production environments.Working knowledge of MongoDB (document schema design, performance tuning).Experience operating production Kubernetes environmentsNetworking background is EXCEPTIONAL but not requiredExperience with Postgres or Mongo migrations, schema evolution, and performance testing.