Join us.


Bioinformatics Data Scientist · London

The Company

BenevolentAI unites technology with human intelligence to re-engineer drug discovery and deliver life-changing medicines. We have developed the Benevolent Platform®, a drug discovery platform built on powerful data foundations with state of the art machine learning and AI technology. Our technology empowers scientists to decipher the vast and complex code underlying human biology, find new ways to treat disease and personalise medicines to patients. Benevolent has active in-house R&D drug programmes in disease areas such as neurodegeneration, immunology, oncology and inflammation and has research and commercial collaborations with leading pharmaceutical and research organisations. The company is headquartered in London with a research facility in Cambridge (UK) and a further office in New York.

Who we are

We are an eclectic bunch at Benevolent, united by our belief that innovative thinking and purposeful technology can truly change outcomes for the better. Our mission is to re-engineer drug discovery and deliver life-changing medicines for patients in need and we do this by applying AI, machine learning and other advanced technologies to reinvent the ways drugs are discovered and developed. We strive to bring together unique skills and perspectives across biology, chemistry, engineering, AI research, informatics, precision medicine and drug discovery.

The Role

We are seeking bioinformatics data scientists to join our London teams and apply advanced bioinformatics methods and data analytics to real world drug discovery challenges. We have identified 3 projects across our Product Areas at BenevolentAI, which each give successful candidates the opportunity to learn more about leveraging biomedical data, developing targeting identification processes and establishing precision medicine workflows, all within an industry setting.

Primary Responsibilities

Project 1: Knowledge

A significant amount of the pre-clinical data accumulated by BenevolentAI from the development of biological assays to characterise drug targets and evaluate potential therapeutic molecules. Databases dedicated to manage bioassay data could provide a rich resource for drug target identification. As such, the aim of this project is to build an automatic data processing pipeline to standardise assay data ingestion into BenevolentAI’s Knowledge Graph. The opportunities offered by this project include:  

  • Learning and applying FAIR data principles (Findable, Accessible, Interoperable and Reusable)
  • Inspecting assay data content features to understand how this information can support the Drug Discovery process
  • Designing and implementing assay data models and data standardisation workflows
  • Developing data engineering skills to support the integration of assay data into the BenevolentAI’s Knowledge Graph

Project 2: Target Identification

At BenevolentAI, our teams have developed performant models by collecting and encoding biological and chemical information around biomedical entities (e.g. genes, drugs, diseases, mechanisms). We are looking for an intern to help build a catalogue of discrete latent vector representations in our feature store; a central repository of features for biomedical entities in the problems we face day-to-day. This set of representations will help facilitate rapid experimentation of new models and improve existing workflows used for target identification. Due to the nature of the work, you will be expected to liaise with various teams across the business including chemistry, target identification, and precision medicine.  You will be supported by bioinformaticians, data scientists and engineers. The opportunities offered by this project include: 

  • Using the existing data in our internal feature store to train and evaluate machine learning models
  • Developing new features based on bio/chemoinformatic and machine learning approaches
  • Supporting your team, you will implement classification models consuming biomedical data
  • Performing benchmarking, validation, and evaluation of the created models and the features they use.

Project 3: Precision Medicine

The Precision Medicine team at Benevolent is applying latent variable models (LVMs) to transcriptomics data in order to define pathologically distinct patient subgroups and their key gene regulatory mechanisms. Within the Precision Medicine Omics pipeline, the latent variables are analysed in the context of a priori defined genesets, representing known biological processes or metabolic pathways. Given the availability of high resolution single cell transcriptomics data (scRNA-seq) for many human tissues and diseases, the team is planning to use these data and identify de-novo genesets, i.e. groups of genes representing disease-specific mechanisms that would provide an enriched view of the results of our AI models in the context of drug discovery. As an intern, you will be helping the Precision Medicine team to tackle this goal. You will be supported by bioinformatics / AI data scientists and will interact with engineers and Translational Medicine Scientists. This opportunities offered by this project include: 

  • Exploring the data available at the Human Cell Atlas or analogous databases and select appropriate datasets for one disease of interest at Benevolent; 
  • Running LVMs, adapting and/or developing other AI models to identify disease-specific mechanisms; 
  • Collaborating with biology experts to analyse and evaluate the results. 
  • Contributing to Python codebases to process and analyse the data

We are looking for someone with

  • A PhD (or should be currently working towards completion of a PhD) in the field of computational biology, bioinformatics or related field with an interest in data science (or vice versa);
  • Familiarity of biomedical datasets and bioinformatics techniques referred to in project descriptions
  • Familiarity in coding with Python and commonly used informatics, data science, machine learning packages
  • Good communication skills who enjoys working in a collaborative team environment


Applications close on February 12th, 2021. In addition to your CV, please send us a cover letter.

Terms and Conditions

Salary: We offer remuneration for the duration of the internships. However, remuneration remains subject to the University regulations & T&C’s - should they not comply with our policy, we reserve the right to review the salary allowance.

Together, we envision a world in which no disease goes untreated. If you are benevolent, curious, want to tackle real world problems and are willing to embrace new ideas, hit that ‘apply’ button and join us.

Important Note

It has come to our attention that unfortunately fraudsters have been falsely offering jobs and set-up fake interviews under the guise of being a Benevolent recruiter. We therefore advise you to be very stringent about who is reaching out to you. Any enquiry from our team will only be made via the email or an email from this domain Please flag any suspicious contact request to