Among the key data-specific drivers of differentiation and value are:
-
Data quality: Data that can be delivered in a form that is ready for analytics to be performed, including requiring minimal scrubbing/cleaning
-
Data robustness: Data that is reliably accurate/complete across each of the data dimensions available
-
Data content: Valuable data assets are typically differentiated along one or more content-related dimensions, including data that is:
-
Unique/scarce: Data includes measurements of a unique treatment or response variable (e.g., scarce/specialized genomic data), allowing researchers to test new/innovative hypotheses
-
Real time: Data is up to date, allowing researchers to test novel hypotheses in real time
-
Deep: Data measures a large and diverse patient population, allowing researchers to obtain greater statistical power in their analyses
-
Broad: Data measures numerous variables per patient, enabling researchers to perform multidimensional analyses
-
Data breadth may be supported by the integration of a provider’s core data with other internal and external data sources (e.g., cross-referenced with other vendor partners’ data), enabling researchers to have a more comprehensive view of a patient
-
Data breadth may be impacted by the level of anonymization, which depends on to what degree patient consent has been obtained
-
Longitudinal: Data repeatedly measures a patient population across a time interval (e.g., before/during/after a treatment), allowing researchers to estimate treatment effects across a patient’s care journey
The importance of these drivers is illustrated by the variation in commercial value observed for various patient records.1 For example, episodic patient data can sell for approximately $75 per record, while an individual’s genomics data can sell for about $1,300 per record. Each of these patient records may be worth two to three times more, depending on levels of quality, robustness, detail or degree of integration with other datasets.
Data-specific drivers of differentiation and value can be further supported by vendor-specific drivers related to provider investments and strategy, including therapeutic area expertise (e.g., vendor specialization), customer service/training (e.g., quality of support staff and ability to address customer concerns) and competitive pricing. Each of these vendor-specific drivers is an additional area that providers can focus on to drive incremental value for their customers.
Data monetization in oncology
The relative value proposition of rich datasets and the challenges associated with limited access to quality data have varying levels of applicability across therapeutic areas. The opportunity for data monetization is particularly applicable in the oncology space due to:
-
The complexity of cancer care, including the variability of treatment plans and the variable efficacy of treatments across patient groups (e.g., patients with certain biomarkers and certain cancer types)
-
The high costs associated with cancer treatments and clinical trials that stakeholders seek to manage/control
-
The duration and the number of patient touchpoints during cancer treatment that generate large amounts of data
The field of biomarker research is a salient example illustrating the opportunity for data monetization among oncology providers. Biomarker discovery continues to be an active area for research across therapeutic areas (including chronic pain, cardiovascular and metabolic diseases, Alzheimer’s) and uses (e.g., diagnostic, prognostic, predictive, pharmacodynamic).
Biomarkers are used in cancer treatment to provide more targeted therapies and treatment strategies. The use of biomarkers in oncology clinical trials has continued to accelerate, accounting for approximately three-fourths of new trial starts in 2022 (see Figure 5).