CT.gov Structural Missingness
2026-03-29 | full-registry ct.gov audit | plots, figures, and e156 bundle
Series
E156 Micro-Paper

CT.gov Structural Missingness

A 156-word micro-paper on what disappears from the registry before results reporting is even considered.

Publication links
IPD statements
Descriptions
Locations

Paper

The registry can look complete enough to browse while still lacking the fields needed for interpretation, replication, and scrutiny.

Reading note

What information disappears from ClinicalTrials.gov even before one asks whether results were posted? We analysed the March 29, 2026 full-registry snapshot and quantified structural missingness in publication links, IPD statements, detailed descriptions, locations, and outcome fields across sponsor groups. The source universe included 578,109 studies, allowing field-level omission rates and sponsor-specific sparsity patterns to be estimated without sampling. Across the full registry, 63.4 percent of records lacked publication links, 48.3 percent lacked IPD sharing statements, 32.7 percent lacked detailed descriptions, and 10.2 percent lacked locations. Structural sparsity was not evenly distributed: industry remained heavily affected, NIH had the highest average hiddenness score among named sponsor classes, and UNKNOWN mostly reflected malformed metadata. Missingness therefore extends beyond results reporting into the descriptive fields needed for interpretation, replication, and scrutiny, with the loss being less context for appraisal, replication, accountability, and public scrutiny across therapeutic areas. These metrics capture registry-visible information loss rather than proven intent to conceal.

No publication link
63.4%
All studies
No IPD statement
48.3%
All studies
No description
32.7%
All studies
No locations
10.2%
All studies