hero

Career Opportunities at SJF Portfolio Companies

SJF Ventures
companies
Jobs

Staff Site Reliability Engineer

ShipMonk

ShipMonk

Software Engineering
Czechia
Posted on Mar 14, 2026

We are seeking an influential Staff SRE to help architect and drive the strategic evolution of our core cloud and deployment infrastructure, shifting our operations toward a more robust, self-service developer platform. This is a highly strategic, but hands-on role for an engineer ready to challenge inefficiencies and contribute to continuous improvement initiatives, from concept to production.


Key responsibilities and scop

  • ePlatform Architecture: Propose the design, implementation, and maintenance of core cloud and deployment systems, advocating for self-service patterns
  • .Kubernetes and Cloud Orchestration: Take ownership of the scalability, security, and optimization of production Kubernetes clusters and the underlying AWS accounts management structure
  • .CI/CD Strategy: Drive best practices across our CI/CD pipelines, optimizing performance and reliability of GitLab CI runners and standardizing deployment flows using ArgoCD
  • .Infrastructure Core Services: Provide administrative expertise and reliability improvements for critical services, including RabbitMQ and the enterprise VPN
  • .Observability Leadership: Improve the organization’s vision for monitoring, tracing, and logging, and manage the strategic use and optimization of Datadog management across all environments

.Skills and qualification

  • s6+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering roles
  • .Deep expertise in AWS multi-account environments (Networking, Security, IAM)
  • .Expert-level knowledge of Kubernetes administration, networking, and deployment strategies
  • .Strong operational experience with messaging systems (e.g., RabbitMQ) and GitOps tools (e.g., Argo CD)
  • .Proficiency in modern CI/CD tooling, specifically GitLab CI/CD
  • .Expertise in Infrastructure as Code (IaC), preferably Terraform
  • .Demonstrated experience managing large-scale observability platforms like Datadog

.Ideal candidat

  • eAn Evolution Driver: Possesses a strong internal drive and the conviction to push for continuous, significant improvements and strategically refine the status quo of existing processes and infrastructure
  • .Strategic Communicator: A great communicator who is skilled at listening to the needs of engineering teams, translating those needs into technical roadmaps, and then successfully persuading other engineers and management that their ideas are worth investing in
  • .Platform-Focused: Experienced in building internal developer platforms (IDPs) and services, focusing on APIs and tooling that enable developers to deploy and manage their services reliably and independently
  • .Technical innovation: Acts as a force multiplier by bringing fresh ideas, challenging conventions, and raising the technical bar across the entire organization
.