This document can be used to prepare or evaluate feasibility of ancillary study proposals and paper proposals, but is NOT intended for publication.

Breast Cancer Outcome Details (Form 122/130)

File NameData as ofPopulationData collectedOne row perRows
outc_bc_inv.dat2/17/2024CT+OSMain, Ext1, Ext2Outcome Form14,267

This file has one row for every form 122 or 130 recording in situ or invasive breast cancer outcome(s). A participant may have more than one of these forms. Only the first occurrence of each outcome is counted, e.g. Limiting the file to rows where BREASTINV=1 will find a single row for each participant with a invasive breast cancer outcome. Breast Cancers derived solely from a cause of death do not have an associated form 122 or 130 so are not included.

ID - WHI Participant Common ID
Col 1
NMissing
14,2670
ASCSOURCE - Ascertainment Source
Col 2
ValueDescriptionN%
1Local Form 12210
2Central Form 13014,266100
BREASTINV - Breast Cancer Invasive
Col 3
Usage Notes:

Defined as a breast cancer from a Death or the first adjudication where the form 122 or 130 details indicate an invasive cancer. Breast Cancer Invasive is adjudicated for CT and OS participants. WHI adjudicated all reports of breast cancer for a ppt until the first invasive Breast Cancer. i.e. The first Invasive Breast Cancer may not be the first Breast Cancer. This file does not include cancers determined solely by the cause of death.

ValueDescriptionN%
0No00
1Yes11,67681.8
Missing2,59118.2%
BREASTINSITU - Breast Cancer In Situ
Col 4
Usage Notes:

Defined as the first breast cancer where the form 122 or 130 details indicate an in situ cancer. Breast Cancer In Situ is adjudicated for CT and OS participants. Breast Cancer In Situ occurring on or after a Breast Cancer Invasive is not counted.

ValueDescriptionN%
0No00
1Yes2,59118.2
Missing11,67681.8%
ICDCODE - ICD-O-2 site code
Col 5
Usage Notes:

All centrally adjudicated cancers (F130) have an ICD-O-2 code. The only locally adjudicated cancers with a code are those marked as "other" on F122. F122 codes do not include a decimal. See the SEER coding section of the data prep/use document for details. A description of the ICD site codes can be found in the reference file seer_icd_site_codes.dat by merging on the ICDCODE variable.

NMissing
14,2661
BEHAVIOR - Tumor Behavior
Col 6
Usage Notes:

For cancers that have been SEER coded, tumor behavior comes from the 5th digit of the morphology (see MRPHHISTB). For cancers that have not been SEER coded, F130/Question 3 or F122/Question 4 is used.

ValueDescriptionN%
1Invasive11,67681.8
2In Situ2,59118.2
3Borderline00
9Unknown00
RPRTSRC - Reporting source
Col 7

Reporting Source: (Mark only one. If more than one category applies, mark the first applicable category.)

ValueDescriptionN%
1Hospital inpatient5,25136.8
2Hospital outpatient/radiation/chemo, surgical center, clinic8,98563
3Laboratory only including pathology office250.2
4Physician's office/private medical practitioner40
5Nursing/convalescent home/hospice10
6Autopsy only00
7Death certificate only00
Missing10%
DIAGSTAT - Diagnostic confirmation status
Col 8

Diagnostic Confirmation Status: (Mark only one. If more than one category applies, mark the first applicable category.)

ValueDescriptionN%
1Positive histology (pathology)14,13699.1
2Positive exfoliative cytology, no positive histology190.1
3Positive histology (pathology), regional/distant meta. site900.6
4Positive micro confirmation, method not specified10
5Positive laboratory test/marker study10
6Direct visualization w/o microscopic confirm00
7Radiography & other imaging techniques w/o micro confirm110.1
8Clinical diagnosis only (other than 5, 6, 7)80.1
9Unknown if microscopically confirmed00
Missing10%
LATERAL - Laterality
Col 9
ValueDescriptionN%
0Not a paired site00
1Right - origin of primary6,95748.8
2Left - origin of primary7,28251
3Only one side right or left unspecified10
4Bilateral involvement, lateral origin unknown single primary00
5Paired site, no lateral info, midline tumor260.2
Missing10%
MRPHHISTB - Morphology - histology/behavior
Col 10

Morphology: first 5 digits

Usage Notes:

ICD-O-2 codes were used for all cancers except Lymphoma and Leukemia. WHI used ICD-O-3 codes for those cancers. WHI used SEER 1988, 2nd edition rules when coding cancers, see the SEER coding section of the data prep/use document for details. A description of the histology/behavior codes can be found in the reference file seer_icd_hist_codes.dat by merging on the MRPHHISTB variable.

NMissing
14,2661
GRADING - Morphology - grading
Col 11

Morphology: 6th digit

Usage Notes:

WHI used SEER 1988, 2nd edition rules when coding cancers, see the SEER coding section of the data prep/use document for details.

ValueDescriptionN%
1Well differentiated3,21622.5
2Moderately differentiated5,83940.9
3Poorly differentiated3,28923.1
4Anaplastic6594.6
5T-Cell00
6B-Cell00
7Null cell00
8NK Cell00
9Unknown/not done1,2638.9
Missing10%
SIZE - F130 EOD SEER - Size
Col 12

Evidence of Disease (SEER): digits 1-3

Usage Notes:

When size doesnt make sense, this field is used for other purposes (e.g. Leukemias have no size, this field records AIDS status). Size is usually measured in mm, though it may vary by cancer type. Even for cancers that do have a size, not all values indicate an actual size (e.g. 2 for many cancers means "<=2mm", but for breast cancer it means "Mammographic diagnosis only, tumor not clinically palpable" ). WHI used the SEER 1988, 2nd edition rules for coding cancers. See the SEER coding section of the data prep/use document for details.

NMissingMinMaxMeanStdDev
14,2661099981.105244.713
EXTENSION - F130 EOD SEER - Extension
Col 13

Evidence of Disease (SEER): digits 4-5

Usage Notes:

WHI used SEER 1988, 2nd edition rules when coding cancers, see the SEER coding section of the data prep/use document for details.

NMissingMinMaxMeanStdDev
14,266109910.65813.604
INVOLVE - F130 EOD (SEER) - Lymph node involvement
Col 14

Evidence of Disease (SEER): digit 6

Usage Notes:

WHI used SEER 1988, 2nd edition rules when coding cancers, see the SEER coding section of the data prep/use document for details.

NMissingMinMaxMeanStdDev
14,2661090.8242.009
POSLYMPH - F130 EOD (SEER) - Number of positive lymph nodes
Col 15

Evidence of Disease (SEER): digits 7-8

Usage Notes:

WHI used SEER 1988, 2nd edition rules when coding cancers, see the SEER coding section of the data prep/use document for details.

NMissingMinMaxMeanStdDev
14,266109926.52742.971
NUMLYMPH - F130 EOD (SEER) - Number of lymph nodes examined
Col 16

Evidence of Disease (SEER): digits 9-10

Usage Notes:

WHI used SEER 1988, 2nd edition rules when coding cancers, see the SEER coding section of the data prep/use document for details.

NMissingMinMaxMeanStdDev
14,26610996.39111.71
STAGE - Summary Stage (SEER)
Col 17
Usage Notes:

WHI used SEER 1988, 2nd edition rules when coding cancers, see the SEER coding section of the data prep/use document for details.

ValueDescriptionN%
1In Situ2,58518.1
2Localized8,79861.7
3Regional2,55417.9
4Distant1981.4
9Unknown1300.9
Missing20%
HIST8522 - F130 Breast Histology 8522 Subclass
Col 18

Subclassification for Breast Histology 8522.

Usage Notes:

Not collected on early versions of F130.

ValueDescriptionN%
0Not Applicable8,05056.4
1Ductal in situ plus lobular in situ1391
2Ductal invasive plus lobular in situ3372.4
3Ductal invasive plus lobular invasive7155
4Lobular invasive plus ductal in situ2071.5
5Invasive cancer, ductal and lobular nos3272.3
Missing4,49231.5%
ERASSAY - F130 Estrogen receptor assay
Col 19
ValueDescriptionN%
1Positive10,63174.5
2Negative1,84112.9
3Borderline150.1
8Ordered/Results not available1791.3
9Unknown/Not done1,59711.2
Missing40%
ERDY - F130 Days to Estrogen Assay
Col 20
NMissingMinMaxMeanStdDev
12,4991,768110,5433,977.4972,539.52
ERTYPE - F130 Type of Estrogen Assay
Col 21
ValueDescriptionN%
1fmol/mg protein1210.8
2ICC/IHC11,27879
8Other1431
9Unknown9526.7
Missing1,77312.4%
PRASSAY - Progesterone receptor assay
Col 22
ValueDescriptionN%
1Positive8,89762.4
2Negative3,31023.2
3Borderline530.4
8Ordered/Results not available1871.3
9Unknown/Not done1,81512.7
Missing50%
PRDY - F130 Days to Progesterone receptor assay
Col 23
NMissingMinMaxMeanStdDev
12,2771,990110,5433,972.3312,533.428
PRTYPE - F130 Type of Progesterone receptor assay
Col 24
ValueDescriptionN%
1fmol/mg protein1190.8
2ICC/IHC11,09977.8
8Other1411
9Unknown9146.4
Missing1,99414%
HER2NEU - F130 Her 2/Neu
Col 25
Usage Notes:

Not collected on early versions of F130.

ValueDescriptionN%
1Positive1,2849
2Negative8,10856.8
3Borderline1010.7
8Ordered/Results not available1160.8
9Unknown/Not done2,49917.5
Missing2,15915.1%
HER2NEUDY - F130 Days to HER 2/NEU
Col 26
Usage Notes:

Not collected on early versions of F130.

NMissingMinMaxMeanStdDev
9,5324,735710,5434,355.712,432.418
AGEDX - Age at diagnosis
Col 27
NMissingMinMaxMeanStdDev
14,26705010372.8068.306