TY - JOUR
T1 - Quality of ethnicity data within Scottish health records and implications of misclassification for ethnic inequalities in severe COVID-19: a national linked data study.
AU - Amele, Sarah
AU - McCabe, Ronan
AU - Kibuchi, Eliud
AU - Pearce, Anna
AU - Hainey, K
AU - Demou, E
AU - Pattaro, S
AU - Irizar, Patricia
AU - Kapadia, Dharmi
AU - Nazroo, James
AU - Taylor, Harry
AU - Becares, Laia
AU - Buchanan, D
AU - Jayacodi, S
AU - Woolford, L
AU - Simpson, C
AU - Jeffrey, K
AU - Shi, T
AU - Daines, L
AU - Tibble, H
AU - Almaghrabi, F
AU - Sheikh, A
AU - Fagbamigbe, A
AU - Kurdi, A
AU - Robertson, C
AU - Katikireddi, Srinivasa Vittal
PY - 2024/3/1
Y1 - 2024/3/1
N2 - Background We compared the quality of ethnicity coding within the Public Health Scotland Ethnicity Look-up (PHS-EL) dataset, and other National Health Service datasets, with the 2011 Scottish Census. Methods Measures of quality included the level of missingness and misclassification. We examined the impact of misclassification using Cox proportional hazards to compare the risk of severe coronavirus disease (COVID-19) (hospitalization & death) by ethnic group. Results Misclassification within PHS-EL was higher for all minority ethnic groups [12.5 to 69.1%] compared with the White Scottish majority [5.1%] and highest in the White Gypsy/Traveller group [69.1%]. Missingness in PHS-EL was highest among the White Other British group [39%] and lowest among the Pakistani group [17%]. PHS-EL data often underestimated severe COVID-19 risk compared with Census data. e.g. in the White Gypsy/Traveller group the Hazard Ratio (HR) was 1.68 [95% Confidence Intervals (CI): 1.03, 2.74] compared with the White Scottish majority using Census ethnicity data and 0.73 [95% CI: 0.10, 5.15] using PHS-EL data; and HR was 2.03 [95% CI: 1.20, 3.44] in the Census for the Bangladeshi group versus 1.45 [95% CI: 0.75, 2.78] in PHS-EL. Conclusions Poor quality ethnicity coding in health records can bias estimates, thereby threatening monitoring and understanding ethnic inequalities in health.
AB - Background We compared the quality of ethnicity coding within the Public Health Scotland Ethnicity Look-up (PHS-EL) dataset, and other National Health Service datasets, with the 2011 Scottish Census. Methods Measures of quality included the level of missingness and misclassification. We examined the impact of misclassification using Cox proportional hazards to compare the risk of severe coronavirus disease (COVID-19) (hospitalization & death) by ethnic group. Results Misclassification within PHS-EL was higher for all minority ethnic groups [12.5 to 69.1%] compared with the White Scottish majority [5.1%] and highest in the White Gypsy/Traveller group [69.1%]. Missingness in PHS-EL was highest among the White Other British group [39%] and lowest among the Pakistani group [17%]. PHS-EL data often underestimated severe COVID-19 risk compared with Census data. e.g. in the White Gypsy/Traveller group the Hazard Ratio (HR) was 1.68 [95% Confidence Intervals (CI): 1.03, 2.74] compared with the White Scottish majority using Census ethnicity data and 0.73 [95% CI: 0.10, 5.15] using PHS-EL data; and HR was 2.03 [95% CI: 1.20, 3.44] in the Census for the Bangladeshi group versus 1.45 [95% CI: 0.75, 2.78] in PHS-EL. Conclusions Poor quality ethnicity coding in health records can bias estimates, thereby threatening monitoring and understanding ethnic inequalities in health.
KW - COVID-19
KW - ethnicity
KW - quality
UR - http://www.scopus.com/inward/record.url?scp=85186480194&partnerID=8YFLogxK
UR - https://www.mendeley.com/catalogue/bcce77ca-eb6f-3707-ba0d-9173ec1862b6/
U2 - 10.1093/pubmed/fdad196
DO - 10.1093/pubmed/fdad196
M3 - Article
C2 - 37861114
SN - 1741-3842
VL - 46
SP - 116
EP - 122
JO - Journal of Public Health
JF - Journal of Public Health
IS - 1
ER -