Join us.

Engineering

Lead Site Reliability Engineer

With over 35 nationalities and a range of backgrounds represented in our Benevolent team, we aim to build an inclusive environment where our people can bring their authentic selves to work, be respected for who they are and the exceptional work they do. We welcome and actively encourage applications from all sections of society and are committed to offering equal employment opportunities regardless of sex, race, religion or belief, ethnic or national origin, marital, domestic or civil partnership status, sexual orientation, gender identity, parental status, disability, age, citizenship, or any other basis. We see our diversity as an asset as we tackle challenging problems that bridge the gap between drug discovery and technology.

The Role

As the Lead Site Reliability Engineer, you will build a team around you and have line management responsibilities whilst remaining hands-on and steering the direction of Benevolent’s cutting-edge infrastructure. You will lead the team of up to seven engineers building and maintaining cloud and Kubernetes-based platforms that form the foundation of our drug discovery pipeline. You must be a strong communicator who can lead by example and guide your team to deliver robust, secure and reliable infrastructure solutions.

Your team will work alongside other infrastructure squads to promote industry best practices and ensure the software is resilient enough for our scientists to rely upon. You will also be adding your input into diverse areas such as cloud services, container technologies, authentication, network topology, sharded databases, scalable web services, interfaces to external data sources and APIs.

Primary Responsibilities

  • Co-ownership of the overall Benevolent cloud architecture.
  • Ownership of the company's site reliability goals, formulation of objectives in alignment with high-level organisation strategy.
  • Approving the defined targets for SLOs and SLIs. Participation in the negotiations to define SLAs.
  • Driving large-scale infrastructure projects to delivery through coordination with engineering and security teams in order to achieve a common goal.
  • Incident response management. Ownership of incident response and disaster recovery policies.
  • Influencing the direction of infrastructure technology advancements. Designing around challenges associated with large-scale distributed systems and driving the harmonisation of technology support layer to promote reuse across the organisation.
  • Conceiving and driving infrastructure solutions to achieve business continuity goals
  • Constantly refining processes and working practices to remove obstacles and empower engineering teams to supply our users with ample infrastructure solutions.
  • Designing infrastructure solutions and maintaining specification.

We are looking for someone with

  • Evidence of creative thinking and problem solving, confidently applying novel strategies to move projects to important decision points quickly and efficiently.
  • Excellent oral and written communication skills e.g. can tailor the complexity of communications as and when required, whilst maintaining clarity of communication.
  • Ability to work under pressure, manage different projects and deliver to defined timelines.
  • Experience successfully leading a Site Reliability, DevOps or engineering team with excellent communication skills and the ability to forge productive relationships and collaborations both internally and externally.
  • Excellent understanding of AWS and Kubernetes. Knowledge of scalability challenges associated with containers, distributed systems and large-scale web applications.
  • Experience with programming languages(any, bonus points for Python/Java/Go/C++).
  • Comfortable with availability out of working hours in the event of a high severity incident.
  • Experience with monitoring and alerting solutions(for example Grafana/Prometheus).
  • Extensive knowledge of cloud networking architecture, cloud operations, automation and orchestration.
  • Good knowledge of network protocols and components such as BGP, TCP, HTTP/S and Load Balancing.

We share a passion for being part of a mission that matters, and we are always looking for curious and collaborative people who share our values and want to be part of our journey.  If that sounds like a fit for you, hit the apply button and join us.

About us

BenevolentAI unites AI with human expertise to discover new and more effective medicines. Our unique computational R&D platform spans every step of the drug discovery process, powering an in-house pipeline of over 25 drug programmes. We advance our mission to reinvent drug discovery by harnessing the power of a diverse team, rich with different backgrounds, experiences, opinions and personalities.  In our offices in London and New York and research facility in Cambridge (UK), we work in highly collaborative, multidisciplinary teams, harnessing skills across biology, chemistry, engineering, AI, machine learning, informatics, precision medicine and drug discovery.

Want to do a little more research before you apply?

Head over to our Glassdoor page to learn about our benefits, culture and to find out what our team think about life at Benevolent. You can also find out more about us on LinkedIn and Twitter.

Apply
Important Note

Our team will only contact you from the domain @benevolent.ai. If you receive a suspicious contact request, please email hello@benevolent.ai. Thank you.