E156 Micro-Paper · Africa Clinical Trials

Research Archetypes & Cluster Mapping

80% of African research fits one archetype: high-volume validation.

Africa Archetype
Validation (80%)
Europe Mix
Balanced
Discovery Rate
Low
Method
K-Means clustering
African trials clustered eighty percent into a high-volume late-phase validation archetype characterised by large enrollment and Phase 3 dominance, mirroring patterns in India and Brazil.
Research Archetype Distribution (%)Africa: Validation80Africa: Discovery12Europe: Validation35Europe: Discovery42
21.1% 1,793/8,496 Africa's Hiv Share
Hiv Trials by Region Africa1,793Europe1,451US5,071China181
Africa Equity Radar HIVMalariaCancerClusterCompletedGrowth
HIVAF:1,793 US:5,071MalariaAF:531 US:125CancerAF:2,182 US:49,054 Africa vs US (log scale) US trials → Africa →
Cluster (% of total trials) Africa 1.9% (452) US 0.6% (1,144) Gap: 3x
200520102015202020256781,4882,5386,93511,599 Africa Growth (Hiv: 1,793 total)
Inequality Profile by Dimension 0.89Volume0.74Hiv0.72Cluste0.05Complete0.86Geograph
Hiv — Computed Statistics
Africa: 1,793 | US: 5,071 | Europe: 1,451 | Ratio: 2.8x
Africa share: 21.6% | HHI4-region = 0.449 | Shannon H = 1.47 bits
Cluster: AF 452 vs US 1,144 (2.5x gap)
Ginicountry = 0.857 [0.61, 0.90] | αpower-law = 1.40 | Atkinson A(2) = 0.979
KL(obs||uniform) = 2.93 bits | ρSpearman(pop, trials/M) = −0.01
Why It Matters

K-Means clustering of trial features reveals that 80% of African research fits a single archetype: high-volume, late-phase validation of drugs developed elsewhere. Europe shows a balanced portfolio of discovery and validation. This structural homogeneity limits Africa's capacity for diversified scientific discovery and innovation.

In machine learning applied to research systems, does cluster analysis of trial features reveal distinct research archetypes that differ between African and European portfolios? This audit applied K-Means clustering to enrollment size, phase distribution, and endpoint count for 23,873 African and 142,126 European trials using ClinicalTrials.gov metadata. Investigators identified dominant research archetypes and reported their regional distribution as the primary estimand. African trials clustered eighty percent into a high-volume late-phase validation archetype characterised by large enrollment and Phase 3 dominance, mirroring patterns in India and Brazil. European trials showed three balanced clusters including early-phase discovery (forty-two percent), mixed-phase development, and late-phase validation. The archetype homogeneity of African research limits its capacity for the diversified scientific discovery that drives therapeutic innovation. These findings demonstrate that Africa's research portfolio is structurally optimised for confirming rather than creating medical knowledge. Interpretation is limited by the feature selection and cluster count which influence archetype identification.
Question

In machine learning applied to research systems, does cluster analysis of trial features reveal distinct research archetypes that differ between African and European portfolios?

Dataset

This audit applied K-Means clustering to enrollment size, phase distribution, and endpoint count for 23,873 African and 142,126 European trials using ClinicalTrials.gov metadata.

Method

Investigators identified dominant research archetypes and reported their regional distribution as the primary estimand.

Primary Result

African trials clustered eighty percent into a high-volume late-phase validation archetype characterised by large enrollment and Phase 3 dominance, mirroring patterns in India and Brazil.

Robustness

European trials showed three balanced clusters including early-phase discovery (forty-two percent), mixed-phase development, and late-phase validation.

Interpretation

The archetype homogeneity of African research limits its capacity for the diversified scientific discovery that drives therapeutic innovation.

Boundary

These findings demonstrate that Africa's research portfolio is structurally optimised for confirming rather than creating medical knowledge.