Site Reliabilty Engineer

London, England, United Kingdom · Engineering expand job description ↓


At Curve, we believe that an SRE is a person attracted by a blurring lines between development and operations with strong fundamentals in both worlds. Not scared by the complexity of code or by the configuration nuances of an operating system.
We’re a small team but growing quickly! And we intend to do so while keeping the emphasis on the “Engineering” part. This is an exceptional opportunity for you to come onboard and shape the future of this system.

Scaling this resiliently across the millions of transactions happening globally requires a sophisticated microservice architecture and a more than the usual big data lip service. And that's just for starters. Providing customer spending insights and battling fraud is going to require world's leading machine learning techniques.

Role: Tasks & Responsibilities include

  • Analysing, planning and maintaining production systems on AWS as they scale in capacity and complexity
  • Help defining internal and external SLOs and SLAs
  • NOT doing routine administration BUT engineering an automated solution!
  • Work with development teams and the management to define an auditable and compliant production system
  • Participate in 24/7 on-call rotation policy by responding to system and emergency problems

What we offer

This is a unique opportunity. And real. At Curve the SRE team is a core part of Engineering. We are very small but we are involved in the design and the scalability of every feature developed. We are not working, hidden, in the background: we are doing distributed systems engineering every day, whilst designing a PCI compliant Kubernetes cluster in a well-funded “one to watch” fintech startup with zero legacy. You will be one of the very few engineers that are doing this.



  • You have 2+ years experience as a DevOps / SRE engineer on innovative applications
  • You are experienced with Infrastructure as Code (Terraform, Cloudformation)
  • You are experienced with a modern programming language (Java/Python/Node.JS/C++)
  • You are experienced with Cloud Native solutions deployed to AWS


  • Open-source contributions
  • Knowledge of the rules of databases and distributed systems
  • Computer Science degree
  • Experience with Kubernetes/cluster schedulers
  • Experience with data analysis systems
  • Knowledge of Android, iOS and mobile applications pipelines

Core competencies/ person profile:

  • Excellent troubleshooting and problem solving skills
  • Enthusiastic team player
  • Not scared by complexity and with a healthy “Can-do” attitude
  • Personal interest in reading and studying about distributed systems and system reliability
  • We believe everyone has ideas to contribute to our objective of continuous improvement, so you will be expected to take ownership and bring ideas to the table, and also inspire others in the team to do the same
  • An unwavering ability to embrace a positive culture of blameless post-mortems, admit mistakes and continuous improvement
  • Personal interest in reading and studying about distributed systems and system reliability


Perks & Benefits

  • Monthly health & wellbeing budget for gym, etc.
  • Learning & Development annual budget
  • Supper & Taxis home should you work late
  • Work from home
  • Ride to Work Scheme
  • Season Ticket Loan
  • ‘Lunch Fridays’ and ‘Friday Drinks’

Personal information
Your Profile
Application Details