This document can be used to prepare or evaluate feasibility of ancillary study proposals and paper proposals, but is NOT intended for publication.

Demographics and Study Membership

File NameData as ofPopulationData collectedOne row perRows
dem_ctos_inv.dat3/6/2021CT+OSBaseline, MainParticipant161,808
ID - WHI Participant Common ID
Col 1
NMissing
161,8080
CTFLAG - CT Participant
Col 2
Indicates if a participant has been randomized to one or more of the Clinical Trial components (HRT, CAD, DM).
ValueDescriptionN%
0No93,67657.9
1Yes68,13242.1
HRTFLAG - HRT Participant
Col 3
Indicates if a participant has been randomized to the Hormone Replacement Therapy Trial.
ValueDescriptionN%
0No134,46183.1
1Yes27,34716.9
DMFLAG - DM Participant
Col 4
Indicates if a participant has been randomized to the Dietary Modification Trial.
ValueDescriptionN%
0No112,97369.8
1Yes48,83530.2
CADFLAG - CAD Participant
Col 5
Indicates if a participant has been randomized to the Calcium and Vitamin D Trial.
ValueDescriptionN%
0No125,52677.6
1Yes36,28222.4
OSFLAG - OS Participant
Col 6
Indicates if a participant has been enrolled into the Observational Study.
ValueDescriptionN%
0No68,13242.1
1Yes93,67657.9
EXTFLAG - Enrolled in WHI Extension 1
Col 7
Indicates if a participant is enrolled in the WHI Extension 1
ValueDescriptionN%
0No46,40128.7
1Yes115,40771.3
EXTDAYS - Days since randomization/enrollment to WHI Extension 1 enrollment
Col 8
Days between Main Study randomization and Extension 1 enrollment. The date of the Extension 1 enrollment is the receival date of the participant s consent to be part of the WHI Extension 1.
NMissingMinMaxMeanStdDev
115,40746,4012,1854,7523,056.095405.01
EXTSTARTYR - Years in WHI at start of WHI Extension 1
Col 9
The number of years in WHI at the time the WHI Extension 1 started. This may or may not not be the year the participant consented to the WHI Extension 1. In general a participant will have consented to the Extension 1 prior to the start of the Extension 1, but many also consented after the start of the Extension 1.
NMissingMinMaxMeanStdDev
115,40746,4017129.2881.182
EXT2FLAG - Enrolled in WHI Extension 2
Col 10
Indicates if a participant is enrolled in the WHI Extension 2
ValueDescriptionN%
0No68,24142.2
1Yes93,56757.8
EXT2DAYS - Days since randomization/enrollment to WHI Extension 2 enrollment
Col 11
Days between Main Study randomization and Extension 2 enrollment. The date of the Extension 2 enrollment is the receival date of the participant s Extension 2 consent form.
NMissingMinMaxMeanStdDev
93,56768,2414,1706,7075,031.122404.77
EXT2MRC - Extension 2 MRC Participant
Col 12
Indicates whether the participant is in Extension 2 and is in the Medical Record Cohort.
ValueDescriptionN%
0No139,49286.2
1Yes22,31613.8
EXT2SRC - Extension 2 SRC Participant
Col 13
Indicates whether the participant is in Extension 2 and is in the Self Report Cohort.
ValueDescriptionN%
0No90,55756
1Yes71,25144
DBGAPCONSENT - Current dbGaP consent status
Col 14
ValueDescriptionN%
0No DbGap Consent18,59511.5
1General Research Use117,67572.7
2Non Profit Use Only25,53815.8
AGE - Age at screening
Col 15
Age at screening. Computed from Form 2 birth date.
Usage Notes: Age at screening may differ from the age stratum to which a participant was randomized, due to participants requesting a correction to their Form 2 birth date after randomization. Once a participant was randomized to a specific age stratum, they remained in that stratum despite corrections to their birth date. Therefore, the "Age stratum at randomization or enrollment" variable may differ from the age reflected in this variable.
NMissingMinMaxMeanStdDev
161,8080498163.2387.236
AGER - Age group at screening
Col 16
Computed from Form 2 birth date. Age categorized into three 10-year intervals (<50-59, 60-69 and 70-79+).
Usage Notes: Categorization of this variable may differ from the "Age stratum at randomization or enrollment" variable due to participants requesting a correction to their Form 2 birth date after randomization/enrollment.
ValueDescriptionN%
1<50-5953,55933.1
260-6972,58944.9
370-79+35,66022
AGESTRAT - Age stratum at randomization or enrollment
Col 17
Usage Notes: Based on the birth date reported by the participant at the time of randomization or enrollment. Categorization of the Age at Screening variable may differ due to participants requesting a correction to their Form 2 birth date after randomization.
ValueDescriptionN%
150 to 5421,57013.3
255 to 5931,98319.8
360 to 6972,58844.9
470 to 7935,66722
REGION - U. S. Region at randomization or enrollment
Col 18
Residence at the time of randomization or enrollment. Four categories based on US Census definition.
ValueDescriptionN%
1Northeast36,91322.8
2South41,91925.9
3Midwest35,56322
4West47,41329.3
LANG - Current preferred language
Col 19
ValueDescriptionN%
1English160,14999
2Spanish1,6591
EDUC - Education at screening
Col 20
ValueDescriptionN%
1Didn't go to school1310.1
10Master's Degree23,66714.6
11Doctoral Degree (Ph.D,M.D.,J.D.,etc.)3,9602.4
2Grade school (1-4 years)5990.4
3Grade school (5-8 years)1,9351.2
4Some high school (9-11 years)5,9793.7
5High school diploma or GED27,62417.1
6Vocational or training school16,42910.2
7Some college or Associate Degree44,48027.5
8College graduate or Baccalaureate Degree17,56610.9
9Some post-graduate or professional18,22211.3
Missing1,2160.8%
INCOME - Family Income at screening
Col 21
ValueDescriptionN%
1Less than $10,0006,9374.3
2$10,000 to $19,99918,49911.4
3$20,000 to $34,99936,66522.7
4$35,000 to $49,99930,91219.1
5$50,000 to $74,99929,94818.5
6$75,000 to $99,99913,6138.4
7$100,000 to $149,9999,4375.8
8$150,000 or more4,9233
9Don't know4,3842.7
Missing6,4904%
HRTARM - HRT Arm
Col 22
Hormone Replacement Therapy study arm to which the participant was randomized
ValueDescriptionN%
0Not randomized to HRT134,46183.1
1E-alone intervention5,3103.3
2E-alone control5,4293.4
3E+P intervention8,5065.3
4E+P control8,1025
DMARM - DM Arm
Col 23
Dietary Modification study arm to which the participant was randomized
ValueDescriptionN%
0Not randomized to DM112,97369.8
1Intervention19,54112.1
2Control29,29418.1
CADARM - CaD Arm
Col 24
Calcium and vitamin D study arm to which the participant was randomized
ValueDescriptionN%
0Not randomized to CaD125,52677.6
1Intervention18,17611.2
2Control18,10611.2
CADDAYS - Days since CT randomization to CaD randomization
Col 25
NMissingMinMaxMeanStdDev
36,282125,526169833402.549103.873
BMDFLAG - Rand to BMD
Col 26
Indicates if the participant was randomized or enrolled at a bone density clinic and was not randomized or enrolled as a part of enhanced recruitment for that clinic.
ValueDescriptionN%
0No150,78893.2
1Yes11,0206.8
SHAREPPT - SHARe Analytic Sample Flag
Col 27
Indicates if a participant is part of the WHI SHARe (SNP Health Association Resource)
ValueDescriptionN%
0No149,80092.6
1Yes12,0087.4
LATREGION - Latitude (degrees N) of CC at CT randomization/OS enrollment
Col 28
Usage Notes: Region of residence at the time of randomization or enrollment, based on the latitude of the responsible clinical center.
ValueDescriptionN%
1Southern: < 35 degrees N51,26531.7
2Middle: 35-40 degrees N44,15027.3
3Northern: > 40 degrees N66,39341
WATTSCAT - Watts ((J/s) per m2) of CC at CT randomization/OS enrollment
Col 29
Usage Notes: The WATT is a unit of solar irradiance and measures the daily UVB flux reaching the earth, within the wavelength range necessary for vitamin D synthesis. The information is in: Lubin D, Jensen EH, Gies HP. Global surface ultraviolet radiation climatology from TOMS and ERBE data. Journal of Geophysical Research 1998;103 (D20):26061-26091. Categories are based on the WATTs of the clinical center at the time of randomization or enrollment.
ValueDescriptionN%
10.4 - 0.537,47723.2
20.735,01921.6
31.029,93218.5
41.436,46722.5
51.5-1.922,91314.2
LANGLEYSCAT - Langleys (g-cal per cm2) of CC at CT randomization/OS enrollment
Col 30
Usage Notes: The Langley is a unit of solar irradiance and relates to the amount that reaches a given area of the earth’s surface. The information is from national weather data on total solar irradiance in the United States and is adapted from Garland and Garland (Do sunlight and vitamin D reduce the likelihood of colon cancer? Int J Epidemiol 1980;9:227-31.) Categories are based on the Langleys of the clinical center at the time of randomization or enrollment.
ValueDescriptionN%
1300-32545,68028.2
235034,04321
3375-38019,06511.8
4400-43027,55117
5475-50035,46921.9
LATREGIONCAD - Latitude (degrees N) of CC at CaD randomization
Col 31
Usage Notes: Region of residence at the time of CaD trial randomization, based on the latitude of the responsible clinical center.
ValueDescriptionN%
1Southern: < 35 degrees N10,8786.7
2Middle: 35-40 degrees N10,0436.2
3Northern: > 40 degrees N15,3619.5
Missing125,52677.6%
WATTSCATCAD - Watts ((J/s) per m2) of CC at CaD randomization
Col 32
Usage Notes: The WATT is a unit of solar irradiance and measures the daily UVB flux reaching the earth, within the wavelength range necessary for vitamin D synthesis. The information is in: Lubin D, Jensen EH, Gies HP. Global surface ultraviolet radiation climatology from TOMS and ERBE data. Journal of Geophysical Research 1998;103 (D20):26061-26091. Categories are based on the WATTs of the clinical center at the time of CaD trial randomization.
ValueDescriptionN%
10.4 - 0.58,2575.1
20.78,4665.2
31.06,4744
41.48,2035.1
51.5-1.94,8823
Missing125,52677.6%
LANGLEYSCATCAD - Langleys (g-cal per cm2) of CC at CaD randomization
Col 33
Usage Notes: The Langley is a unit of solar irradiance and relates to the amount that reaches a given area of the earth’s surface. The information is from national weather data on total solar irradiance in the United States and is adapted from Garland and Garland (Do sunlight and vitamin D reduce the likelihood of colon cancer? Int J Epidemiol 1980;9:227-31.) Categories are based on the Langleys of the clinical center at the time of CaD trial randomization.
ValueDescriptionN%
4400-4306,0333.7
5475-5007,7114.8
1300-32510,7176.6
23507,8004.8
3375-3804,0212.5
Missing125,52677.6%