Share this job
Site Reliability Engineer
Stockholm, AB
Apply for this job

Come build, innovate, disrupt, and thrive!


KēSTA I.T. is actively seeking a Site Reliability Engineer for an immediate full-time opportunity with our industry leading client.


Are you on the lookout for a unique career opportunity that offers leadership, responsibility, and the chance to make a significant impact? If you're eager to contribute to a thriving and stable organization while maintaining your confidentiality, continue reading.



Remote (European Union)


The Role

We are seeking a Site Reliability Engineer to support and evolve the operational integrity of a globally distributed platform that streams advanced 3D content to AR/VR devices. This individual will play a central role in strengthening production operations, improving the sustainability of on-call practices, and introducing automation that systematically reduces manual intervention.


You’ll join a reliability-focused team tasked with elevating operational maturity. Early priorities include improving the efficiency and health of the on-call rotation, refining alert quality, and strengthening troubleshooting capabilities. As stability increases, the role expands into forward-looking reliability initiatives including architectural input, production readiness standards, and scalable self-service infrastructure.


This position is best suited for someone who thrives in live production environments, enjoys solving complex distributed systems issues, and consistently looks for ways to convert repetitive operational work into durable automation.


Key Responsibilities

  • Respond to production incidents as part of a structured on-call rotation, ensuring continuous availability of streaming systems and data pipelines operating 24/7
  • Coordinate and drive incident resolution efforts, including detailed post-incident reviews and implementation of preventative improvements
  • Build, refine, and maintain alerting frameworks that balance signal-to-noise, reducing unnecessary escalations while detecting genuine service degradation early
  • Implement comprehensive observability practices, including metrics, telemetry, structured logging, distributed tracing, and cloud resource tagging
  • Collaborate with engineering teams to define and monitor SLIs and SLOs, manage error budgets, and formalize production-readiness expectations
  • Develop automation, internal tooling, and operational runbooks to eliminate repetitive manual tasks and steadily reduce operational toil
  • Improve system transparency and debuggability across complex distributed environments


Qualifications

  • 3–5+ years of hands-on production operations experience with direct responsibility in on-call rotations and real-time incident response
  • Demonstrated success enhancing operational workflows through tooling improvements, automation, and refined response procedures
  • Strong debugging capabilities across distributed systems in cloud-native environments
  • Experience designing and tuning alert thresholds to reduce alert fatigue while maintaining service reliability
  • Proficiency with observability platforms such as Prometheus, Grafana, ELK stack (or similar)
  • Experience operating within cloud environments such as AWS and CoreWeave
  • Familiarity with infrastructure-as-code and container orchestration practices, including Kubernetes, Helm, and Terraform


Benefits & Perks

  • Equity participation program
  • Flexible work schedule within a remote-first environment
  • Generous paid time off
  • 401(k) retirement savings plan
  • Comprehensive Medical, Dental, and Vision coverage
  • Flexible Spending Accounts for healthcare and dependent care
  • Technology allowance
  • Wellness stipend



About KēSTA I.T.:


Our name says it all; KēSTA I.T. (Keys-to-I.T.) AND our people are our keys to our success!


KēSTA I.T. is a premier Utah-based technical staffing and consulting services firm. We specialize in temporary and permanent placement of Software, Hardware, Network, Cloud, CRM/ERP, Data, End-User support, Web and Executive / leadership-based positions on a full time and consulting basis. If you're interested in a role where top performance is rewarded, personal time is valued, and excellence is demanded at every level we want to talk to you today!


Where do you want to go? We've got the keys! ~ KēSTA I.T.


WWW.KeSTAIT.COM

Apply for this job
Powered by