Join us.

Engineering
London

Lead Site Reliability Engineer

With over 35 nationalities and a range of backgrounds represented in our Benevolent team, we aim to build an inclusive environment where our people can bring their authentic selves to work, be respected for who they are and the exceptional work they do. We welcome and actively encourage applications from all sections of society and are committed to offering equal employment opportunities regardless of sex, race, religion or belief, ethnic or national origin, marital, domestic or civil partnership status, sexual orientation, gender identity, parental status, disability, age, citizenship, or any other basis. We see our diversity as an asset as we tackle challenging problems that bridge the gap between drug discovery and technology.

Apply

The Role

As the Lead Site Reliability Engineer, you will build a team around you and have line management responsibilities whilst remaining hands-on and steering the direction of Benevolent’s cutting-edge infrastructure. You will lead the team of up to seven engineers building and maintaining cloud and Kubernetes-based platforms that form the foundation of our drug discovery pipeline. You must be a strong communicator who can lead by example and guide your team to deliver robust, secure and reliable infrastructure solutions.

Your team will work alongside other infrastructure squads to promote industry best practices and ensure the software is resilient enough for our scientists to rely upon. You will also be adding your input into diverse areas such as cloud services, container technologies, authentication, network topology, sharded databases, scalable web services, interfaces to external data sources and APIs.

Primary Responsibilities

  • Co-ownership of the overall Benevolent cloud architecture.
  • Ownership of the company's site reliability goals, formulation of objectives in alignment with high-level organisation strategy.
  • Approving the defined targets for SLOs and SLIs. Participation in the negotiations to define SLAs.
  • Driving large-scale infrastructure projects to delivery through coordination with engineering and security teams in order to achieve a common goal.
  • Incident response management. Ownership of incident response and disaster recovery policies.
  • Influencing the direction of infrastructure technology advancements. Designing around challenges associated with large-scale distributed systems and driving the harmonisation of technology support layer to promote reuse across the organisation.
  • Conceiving and driving infrastructure solutions to achieve business continuity goals
  • Constantly refining processes and working practices to remove obstacles and empower engineering teams to supply our users with ample infrastructure solutions.
  • Designing infrastructure solutions and maintaining specification.

We are looking for someone with

  • Evidence of creative thinking and problem solving, confidently applying novel strategies to move projects to important decision points quickly and efficiently.
  • Excellent oral and written communication skills e.g. can tailor the complexity of communications as and when required, whilst maintaining clarity of communication.
  • Ability to work under pressure, manage different projects and deliver to defined timelines.
  • Experience successfully leading a Site Reliability, DevOps or engineering team with excellent communication skills and the ability to forge productive relationships and collaborations both internally and externally.
  • Excellent understanding of AWS and Kubernetes. Knowledge of scalability challenges associated with containers, distributed systems and large-scale web applications.
  • Experience with programming languages(any, bonus points for Python/Java/Go/C++).
  • Comfortable with availability out of working hours in the event of a high severity incident.
  • Experience with monitoring and alerting solutions(for example Grafana/Prometheus).
  • Extensive knowledge of cloud networking architecture, cloud operations, automation and orchestration.
  • Good knowledge of network protocols and components such as BGP, TCP, HTTP/S and Load Balancing.

We share a passion for being part of a mission that matters, and we are always looking for curious and collaborative people who share our values and want to be part of our journey.  If that sounds like a fit for you, hit the apply button and join us.

About us

BenevolentAI (AMS: BAI) is a leading, clinical-stage AI-enabled drug discovery and development company listed on the Euronext Amsterdam stock exchange. Through the combined capabilities of its AI platform, scientific expertise, and wet-lab facilities, BenevolentAI is well-positioned to deliver novel drug candidates with a higher probability of clinical success than those developed using traditional methods. The Benevolent Platform™ powers a growing in-house pipeline of 13 named drug programmes and over 10 exploratory programmes, and it maintains successful collaborations with AstraZeneca, as well as leading research and charitable institutions. BenevolentAI is headquartered in London, with a research facility in Cambridge (UK) and a further office in New York.

Want to do a little more research before you apply?

Head over to our Glassdoor page to learn about our benefits, culture and to find out what our team thinks about life at Benevolent. You can also find out more about us on LinkedIn and Twitter.

Apply
Important Note

Our team will only contact you from the domain @benevolent.ai. If you receive a suspicious contact request, please email hello@benevolent.ai. Thank you.