Included in the WHI Investigators' Data is a file indicating which participants have genetic data on dbGaP. (See the dbGaP Data Dictionary).
Genetic and phenotypic data are routinely submitted to dbGaP. Please refer to the dbGaP website for detailed information on what WHI datasets are currently available, instructions on how to access and download data, searchable FAQs, and dbGaP contact information. Note that we can provide only a limited amount of help with questions related to data on dbGaP. For genetic data that are not yet available on dbGaP, or for very limited genetic datasets, data may be available from the WHI Clinical Coordinating Center (CCC) as detailed in our policy for accessing WHI genetic data.
The information below may be useful for understanding the types of WHI participants who were included in various WHI Core GWAS and baseline clinical CVD biomarker projects. For additional information about each study, please go to the individual WHI Study Pages and search for the Study ID number.
Table 1: Approximate # of participants with both GWAS1 and baseline clinical biomarkers2 by WHI study component
WHI Study Population
~ N Ppts3
~ N in MRC4
~ N in LLS5
Dietary Modification (DM) only, DM/CaD trial only, OS
Hormone Trial (HT)
Both E+P and E-Alone
1 – Only ‘dbGaP-eligible’ participants were included in GWAS projects. GWAS projects included here are: SHARe (M5), GARNET (M13), ‘WHIMS+’ (W63), and LLS GWAS (W66). GWAS data from M5, M13, and W63 were included in a GWAS imputation project in 2013, the data from which will be uploaded to dbGaP eventually.
2 – Baseline biomarkers from W54, W58, and W66 are in this table. The ‘standard’ clinical biomarkers include: insulin, glucose, CRP, creatinine, cholesterol, HDL, LDL, triglycerides.
3 – (Ppts = Participants) See Table 2 below and the WHI Study Pages for participant selection criteria.
4 – (MRC = Medical Record Cohort) MRC ppts include HT, African-American, & Hispanic ppts enrolled in WHI Extension II (2010-2015).
5 – (LLS = Long Life Study; also known as the In-person Visit.) Between 2012 and 2013, a subset of MRC ppts were invited to join an in-person data/blood collection project.
Table 2: Approximate numbers with both GWAS* and baseline** clinical biomarkers by WHI study number
Baseline Clinical Biomarkers2
M5 - SHARe1
~8,405 African Americans ~3,602 Hispanic
W54 - SHARe baseline biomarkers
~8,405 African Americans ~3,602 Hispanic
M13 - GARNET
European Americans (EA) HT with CHD, stroke, VTE, diabetes, and matching controls (includes ~1,400 EA from WHIMS4)
W58 – EA HT baseline biomarkers
All EA GARNET controls (~2,208) plus a stratified random selection of EA GARNET cases3 (~859)
W635 – WHIMS+ GWAS
All dbGaP-eligible EA WHIMS ppts who were not included in the GARNET GWAS
All dbGaP-eligible EA WHIMS ppts (includes 1,400 from M13 and 4,661 from W63)
Selected EA HT ppts in neither GARNET nor WHIMS
W666 – LLS GWAS
LLS-eligibles not included in previous GWAS and baseline CVD biomarker projects
W666 – LLS baseline biomarkers
1 – SHARe = SNP Health Association Resource
2 – The ‘standard’ set of baseline clinical biomarkers includes the following: insulin, glucose, CRP, creatinine, cholesterol, HDL, LDL, triglycerides.
3 – Not all GARNET cases were in the W58 Biomarkers project. Some EA HT ppts who were in neither GARNET nor WHIMS were included in W58 to provide a more balanced cross-section of EA HT ppts. ~4,684 GARNET participants have both GWAS and baseline clinical biomarkers.
4 – WHIMS = WHI Memory Study (AS39). All dbGaP-eligible WHIMS participants have GWAS data (from either M13 or W63) and baseline biomarkers from W58.
5 – W63 is the ‘WHIMS +’ GWAS. W63 included WHIMS participants who were not genotyped in GARNET. W63 also included an age-stratified random selection of HT participants to provide a more balanced cross-section of HT participants.
6 – LLS = Long Life Study. Approximately 1,500 of the LLS-eligibles did not have GWAS and baseline biomarkers. W66 funded these measurements.
* GWAS data available for all participants (with sample available)
** Baseline clinical biomarker data are available for all EA, AA, and H participants in SHARe, GARNET, and “WHIMS+” (with sample available)
This flowchart describes the subset of participants with CVD biomarkers and GWAS data.
Table 3: Other large WHI genetic studies
Study Name (ID)
Study population (synopsis)
PAGE I (M6)
Y1: N~31,000 OS/CT
Y1: AIMs, SNPs 6, Metabochip
Y2: SNPs 384a and SNPs 384b
Y1-Y2 available on dbGaP. Note: ~10% of the Y1 participants are not dbGaP eligible.
Y3/4 Metabochip and phenotype data on AA, Hispanics, Asians, and Native Americans was uploaded to dbGaP in 2014.
N~2,230 for sequencing
N~8,900 for replication
Exome Chip for replication
Long Life Study (W64)
Data collected: 7,875
Blood collected: 7,481
Baseline: Clinical biomarkers, GWAS
LLS: CBC, WBC diff, clinical biomarkers