Runware

Staff DevOps Engineer

About the Company

Runware is a cutting-edge technology company based in the United Kingdom, dedicated to building the API layer for the next generation of AI products. Our platform provides teams with fast, reliable access to real-time inference across thousands of models through a single flexible API. We enable customers to build and scale media generation products with better performance, lower cost, and less operational complexity.

Behind this is an infrastructure platform built for speed, reliability, and GPU scale. New models launch constantly, and customer traffic can grow quickly. Performance matters at every layer. Our mission is to empower innovators to create and deploy AI solutions that transform industries and improve lives.

At Runware, we value a culture of innovation, collaboration, and continuous learning. We believe in giving our team members the autonomy to make decisions, the freedom to experiment, and the support to grow in their careers. This role matters to our company's growth because it will help us build, operate, and scale the infrastructure behind our global AI inference platform, ensuring that our systems are faster, more resilient, and easier to operate.

Key Responsibilities

As a Staff DevOps Engineer at Runware, you will play a critical role in designing, building, and operating the systems that power real-time AI inference across large-scale GPU fleets and a global production platform. Your work will directly shape how quickly we can launch new models, scale customer traffic, recover from failures, and deliver low-latency AI experiences to millions of users.

Requirements & Qualifications

Must-Have

Nice-to-Have

Technical Skills

Cloud & Infrastructure

We use a combination of bare-metal servers, serverless and containerised production systems to power our platform. Our infrastructure is built for speed, reliability, and GPU scale, and we are looking for someone who can help us optimize and scale it further.

Databases

We use a variety of databases to store and manage data for our platform, including relational and NoSQL databases. Our ideal candidate will have experience with database design, deployment, and operations.

CI/CD & Automation

We are looking for someone who can help us automate the hard parts of infrastructure operations, from provisioning and configuration through to CI/CD, deployment safety, progressive rollouts and rapid rollback.

What We Offer

We offer a competitive annual salary ranging from 160000 to 200000, depending on experience. In addition to a competitive salary, we also offer a range of benefits, including:

We believe in giving our team members the autonomy to make decisions, the freedom to experiment, and the support to grow in their careers. If you are looking for a challenging and rewarding role that will help you grow as a professional, we encourage you to apply.

Frequently Asked Questions

What is the remote work setup like?

We are a fully remote company, and we believe in giving our team members the flexibility to work from anywhere in the world. We use a range of tools to facilitate communication and collaboration, including Slack, Zoom and GitHub.

What is the hiring process and timeline?

We are looking to fill this role as soon as possible, and we will be conducting interviews on a rolling basis. The hiring process typically takes 2-3 weeks, and we will be in touch with you throughout the process to keep you updated on your status.

What is the team size and tech stack?

We are a small but growing team, and we are looking for someone who can help us scale our infrastructure and operations. Our tech stack includes a range of tools and technologies, including AWS, GCP, Azure, Docker, Kubernetes, Jenkins, GitLab CI/CD, Ansible and Terraform.

Apply Now