Medical data: the key to unlocking the true potential of AI in healthcare

It is well documented that the lack of representation in biomedical research is leading to a data gap that can no longer be overlooked if we are to avoid exacerbating existing health inequalities in the age of digital health.

Advances in machine learning (ML) techniques are allowing the scientific community to unlock the potential of biomedical data and extract valuable insights. Yet amidst the hope sits a certain uncomfortable reality: not everyone is set to benefit from these advances. At the heart of innovation in healthcare lies the datasets used to train the algorithms, such as data from scientific literature, clinical trials, omics, and patient real world data. These datasets are the lifeblood of new technologies, and are extremely meaningful. Yet, they have significant shortcomings, since the majority of medical research is conducted on white and predominantly male populations of European descent. This lack of diversity in data has serious consequences for medical care, as the products discovered through use of this data may not benefit everyone.

As a data-driven artificial intelligence company, BenevolentAI recognises the value of having access to diverse datasets and has launched the Data Diversity Initiative (DDI) to address the challenge, raise awareness and work with stakeholders in the wider scientific and research community on possible solutions. At our inaugural DDI event in November 2019, our panel of experts discussed why we must drastically improve the way we design, collect and process biomedical research to ensure that health data fulfills its true potential.

Firstly, the panel discussed the need to ask the right questions when designing research studies, in particular: does the study represent the patient population accurately? The rich diversity of the UK’s NHS clinical population represents huge untapped potential for increasing diversity in data, however, Jackie Hunter of BenevolentAI (CE, Clinical and Strategic Partnerships) argues that this is often not fully embraced, especially in clinical trials.

When it comes to collecting diverse data, Dawn Duhaney, Product Manager at the Wellcome Trust, highlighted the need to build trust and increase transparency in the research recruitment process. People from diverse backgrounds may not trust medical or research institutions enough to engage with them, so work needs to be done around engaging with underrepresented groups and communicating how companies or institutions plan to use their data, and for what purpose.

This leads to another critical challenge: health data’s poor interoperability. Clinical medical records on diverse populations exist to a certain extent, however this data is held in siloed repositories such as electronic medical records, laboratory and imaging systems and physician notes. In fact, the World Health Organisation estimates that less than 20% of medical data is available in a form that ML and AI can ingest and learn from. The highly fragmented nature of medical data makes it very difficult and time consuming to access, share or combine. Improving the way we process data by improving data interoperability would more easily allow for health technology innovators to merge clinical medical records, thereby enabling access to a more diverse data pool.

Some countries have seen more coordinated efforts to enable data accessibility. As Diane Harbison, CEO of Decipher Analytics explains, in Scotland electronic health records have been used for longer, enabling the access to more longitudinal patient data since the Scotland healthcare board uses similar systems to track patients across the country. These coordinated efforts make it easier to access high quality datasets and maximise the chance to produce research that will have a positive impact on patient health. While Scotland does not have the most diverse patient population, this is a model which could be imitated elsewhere in the world.

We must be cognisant of the uncomfortable reality surrounding the application of AI in healthcare: if we feed new technologies imbalanced datasets, we will create an imbalance in those who benefit from tech-powered innovation. To ensure transferability and equality of medical treatment, we must improve the way we design, collect and process biomedical research.

More Posts

You Might Also Like

Baricitinib, first identified by BenevolentAI as a COVID-19 treatment, is granted emergency use in India in response to its escalating crisis
Baricitinib - first identified by BenevolentAI as a potential COVID-19 treatment - is accelerated for use in hospitalised patients in India following a continued surge in cases and fatalities.
May 5, 2021
A New Era in Target Discovery: Collaborating with AstraZeneca on CKD and IPF
Finding the right target underpins the success of the entire drug discovery process. Learn how BenevolentAI’s collaboration with AstraZeneca is making a difference in CKD and IPF.
Apr 23, 2021
Data from Eli Lilly’s COV-BARRIER trial shows baricitinib reduced deaths in hospitalised COVID-19 patients by 38%
The latest data published in Eli Lilly’s Phase 3 randomised, double-blind, placebo-controlled study (COV-BARRIER) shows the largest clinical effect reported to date for a reduction in mortality in the COVID-19 patient population
Apr 8, 2021
BenevolentAI named as one of Fierce Medtech’s Fierce 15 of 2020
BenevolentAI was selected as one of the most promising private companies in the industry by Fierce Medtech in its Fierce 15 2020 list.
Mar 8, 2021
Tech Nation Visa: the gateway to world-leading UK tech jobs
Drawing attention to the Tech Nation Visa, a great initiative that enables the brightest international talent to live and work in the UK.
Feb 19, 2021
BenevolentAI announces first patient dosed in its Atopic Dermatitis clinical trial
A molecule designed and developed by BenevolentAI to treat mild to moderate Atopic Dermatitis has entered clinical trials.
Feb 11, 2021