Accelerating Clinical Trials

Developing AI and analytics to understand the drivers of study or clinical trial efficiency.

Overview

Clinical trials (CT) are considered the gold standard method of studying whether new drugs or interventions are safe and effective in humans. Still, there are several known challenges associated with trials that introduce inefficiencies and make it difficult, expensive, and slow for researchers to execute trials and determine outcomes. Our aim is to develop Machine Learning (ML) and Artificial Intelligence (AI) techniques to help transform key steps in clinical trials, accelerating their efficiency through improved design, recruitment, or engagement. We do this by identifying or creating novel composite features or biomarkers that are important to trial efficiency. Subsequently, we develop models to study the impact of these features in practice using real-world data. We also aim to identify and extract new features that span across domains (e.g. social and behavioral features that slow down trial execution or skew outcomes). Our research can be summarized in three parts:

Knowledge Representation and Organization – Defining novel representations that integrate clinical trial data with other relevant data (social, behavioral, clinical, etc.) to support, for example, a composite 360° view of an individual patient, cohort or study.
Information Extraction and Discovery of New Features – Developing advanced analytics and language models that can help extract new biomarkers, features or patterns from multiple sources and types of data.
Advanced Analytics and Machine Learning – Developing AI models that leverage our representation and extraction tools and lead to a better understanding of trial inefficiencies (e.g., recruitment of a more diverse trial population, avoiding dropouts).

Technologies

Together with IBM’s Deep Search platform we are working on a number of ML and AI technologies to address the unmet needs of studies and trials:

Knowledge Graphs and Ontologies
Natural Language Processing Tools and Language-based Models
Statistical Graph Networks (Inference on Bayesian Networks and - Functional Graphical Models)
Predictive Modeling and Feature Selection, Deep Search Frameworks and Graph Neural Networks

Selected Assets

RCT-extract: extractor of more than 80 entities in clinical trials (available via Deep search)
Health & Social Person-centric Ontology: an ontology connecting the clinical and social determinants of health around a 360-view of an individual (available via HSPO)
UMLS Tagger: a UMLS-based annotator to detect concepts from text (available via Deep search)
Relation Extraction Module: classifier detecting semantic relations in text (available via Deep search)

Research Collaborations

Cleveland Clinic Foundation Our team is working together with experts at the Cleveland Clinic as part of the Accelerate Discovery Ecosystem for Healthcare. The goal is helping to identify, extract and understand the impact of new biomarker features (such as the social and behavioral determinants of health) in clinical trials.
Scaling up Proactive Digital Integrated Care (SEURO) SEURO is a EU funded Horizon 2020 project, where a team from IBM research Dublin will investigate novel techniques to examine outputs of clinical trials for predicting the long-term impact of adopting new technologies. We are developing new tools to identify the success of clinical trials (by measure of engagement or clinical endpoints). Tools developed through this work can contribute to risk stratification for patient selection and understanding features that may be linked to engagement.
Human Behavior Change Project Our teams have also been working together with behavioral scientists, computer scientists, and systems architects on the Human Behavior Change Project (HBCP). Through this we have leveraged Natural Language Processing and Machine Learning to extract information from intervention evaluation reports and to answer key questions about the evidence.

Publications

Accelerating the Discovery of Semantic Associations from Medical Literature: Mining Relations Between Diseases and Symptoms
- - Alberto Purpura
  - Francesca Bonin
  - et al.
- 2022
- EMNLP 2022
Discovering Associations between Social Determinants and Health Outcomes: Merging Knowledge Graphs from Literature and Electronic Health Data
- - Yoonyoung Park
  - Natasha Mulligan
  - et al.
- 2021
- AMIA Annual Symposium 2021
Social Determinant Trends of COVID-19: an analysis using Knowledge Graphs from Published Evidence and Online Trends
- - Martin Gleize
  - Natasha Mulligan
  - et al.
- 2021
- MIE 2021
Exploring the Social Drivers of Health during a Pandemic: Leveraging Knowledge Graphs and Population Trends in COVID-19
- - Joao Bettencourt-Silva
  - Natasha Mulligan
  - et al.
- 2020
- EFMI STC 2020
Knowledge Extraction and Prediction from Behavior Science Randomized Controlled Trials: A Case Study in Smoking Cessation
- - Francesca Bonin
  - Martin Gleize
  - et al.
- 2020
- AMIA Annual Symposium 2020
HBCP corpus: A new resource for the analysis of behaviour change intervention reports
- - Francesca Bonin
  - Ailbhe N. Finnerty
  - et al.
- 2020
- LREC 2020
Discovering new Social Determinants of Health concepts from Unstructured Data: Framework and Evaluation
- - Joao Bettencourt-Silva
  - Natasha Mulligan
  - et al.
- 2020
- MIE 2020

Contributors

Deep Search

Accelerating Clinical Trials

Overview

Technologies

Selected Assets

Research Collaborations

Publications

Accelerating the Discovery of Semantic Associations from Medical Literature: Mining Relations Between Diseases and Symptoms

Discovering Associations between Social Determinants and Health Outcomes: Merging Knowledge Graphs from Literature and Electronic Health Data

Social Determinant Trends of COVID-19: an analysis using Knowledge Graphs from Published Evidence and Online Trends

Exploring the Social Drivers of Health during a Pandemic: Leveraging Knowledge Graphs and Population Trends in COVID-19

Knowledge Extraction and Prediction from Behavior Science Randomized Controlled Trials: A Case Study in Smoking Cessation

HBCP corpus: A new resource for the analysis of behaviour change intervention reports

Discovering new Social Determinants of Health concepts from Unstructured Data: Framework and Evaluation

Contributors

Joao Bettencourt

Natasha Mulligan

Tobia Boschi

Alessandra Pascale

Vanessa Lopez Garcia

Deep Search

Overview

Technologies

Selected Assets

Research Collaborations

Publications

Contributors

Related projects