ResearchPublications

Modeling the potential impact of missing race and ethnicity data in infectious disease surveillance systems on disparity measures: Scenario analysis of different imputation strategies
Abstract

BACKGROUND: Monitoring progress toward population health equity goals requires developing robust disparity indicators. However, surveillance data gaps that result in undercounting racial and ethnic minority groups might influence the observed disparity measures.

OBJECTIVE: This study aimed to assess the impact of missing race and ethnicity data in surveillance systems on disparity measures.

METHODS: We explored variations in missing race and ethnicity information in reported annual chlamydia and gonorrhea diagnoses in the United States from 2007 to 2018 by state, year, reported sex, and infection. For diagnoses with incomplete demographic information in 2018, we estimated disparity measures (relative rate ratio and rate difference) with 5 imputation scenarios compared with the base case (no adjustments). The 5 scenarios used the racial and ethnic distribution of chlamydia or gonorrhea diagnoses in the same state, chlamydia or gonorrhea diagnoses in neighboring states, chlamydia or gonorrhea diagnoses within the geographic region, HIV diagnoses, and syphilis diagnoses.

RESULTS: In 2018, a total of 31.93% (560,551/1,755,510) of chlamydia and 22.11% (128,790/582,475) of gonorrhea diagnoses had missing race and ethnicity information. Missingness differed by infection type but not by reported sex. Missing race and ethnicity information varied widely across states and times (range across state-years: from 0.0% to 96.2%). The rate ratio remained similar in the imputation scenarios, although the rate difference differed nationally and in some states.

CONCLUSIONS: We found that missing race and ethnicity information affects measured disparities, which is important to consider when interpreting disparity metrics. Addressing missing information in surveillance systems requires system-level solutions, such as collecting more complete laboratory data, improving the linkage of data systems, and designing more efficient data collection procedures. As a short-term solution, local public health agencies can adapt these imputation scenarios to their aggregate data to adjust surveillance data for use in population indicators of health equity.

Download PDF

Full citation:
Ansari B, Hart-Malloy R, Rosenberg ES, Trigg M, Martin EG (2022).
Modeling the potential impact of missing race and ethnicity data in infectious disease surveillance systems on disparity measures: Scenario analysis of different imputation strategies
JMIR Public Health and Surveillance, 8 (11), e38037. doi: 10.2196/38037. PMCID: PMC9685511.