Facilitating Harmonization of Variables in Framingham, MESA, ARIC, and REGARDS Studies Through a Metadata Repository.
Pubmed ID: 37850400
Pubmed Central ID: PMC10841164
Journal: Circulation. Cardiovascular quality and outcomes
Publication Date: Nov. 1, 2023
MeSH Terms: Humans, Cohort Studies, Longitudinal Studies, Atherosclerosis, Reproducibility of Results, Metadata
Grants: N01 HC025195, N01 HC095159, U01 NS041588, UL1 TR001079, P30 DK063491, N01 HC095167, N01 HC095161, N01 HC095164, N01 HC095166, N01 HC095160, N01 HC095169, N01 HC095165, N01 HC095168, UL1 TR000040, N01 HC095163, N01 HC095162, HHSN268201700004I, UL1 TR001881, UL1 TR001420, HHSN268201500001I, HHSN268201500001C, HHSN268201500003I, HHSN268201700001I, HHSN268201700003I, HHSN268201700005I, HHSN268201700002I, HHSN268201500003C, HHSN268201700002C, HHSN268201700005C, HHSN268201700001C, HHSN268201700003C, HHSN268201700004C, R01 HL136666, R61 NS120246, R33 NS120246
Authors: Zhao J, Pencina MJ, Stevens LM, Hall JL, Henao R, Mallya P, Hong C, Schibler T, Manchanda V, Economou-Zavlanos N, Wojdyla DM
Cite As: Mallya P, Stevens LM, Zhao J, Hong C, Henao R, Economou-Zavlanos N, Wojdyla DM, Schibler T, Manchanda V, Pencina MJ, Hall JL. Facilitating Harmonization of Variables in Framingham, MESA, ARIC, and REGARDS Studies Through a Metadata Repository. Circ Cardiovasc Qual Outcomes 2023 Nov;16(11):e009938. Epub 2023 Oct 18.
Studies:
Abstract
BACKGROUND: High-quality research in cardiovascular prevention, as in other fields, requires inclusion of a broad range of data sets from different sources. Integrating and harmonizing different data sources are essential to increase generalizability, sample size, and representation of understudied populations-strengthening the evidence for the scientific questions being addressed. METHODS: Here, we describe an effort to build an open-access repository and interactive online portal for researchers to access the metadata and code harmonizing data from 4 well-known cohort studies-the REGARDS (Reasons for Geographic and Racial Differences in Stroke) study, FHS (Framingham Heart Study), MESA (Multi-Ethnic Study of Atherosclerosis), and ARIC (Atherosclerosis Risk in Communities) study. We introduce a methodology and a framework used for preprocessing and harmonizing variables from multiple studies. RESULTS: We provide a real-case study and step-by-step guidance to demonstrate the practical utility of our repository and interactive web page. In addition to our successful development of such an open-access repository and interactive web page, this exercise in harmonizing data from multiple cohort studies has revealed several key themes. These themes include the importance of careful preprocessing and harmonization of variables, the value of creating an open-access repository to facilitate collaboration and reproducibility, and the potential for using harmonized data to address important scientific questions and disparities in cardiovascular disease research. CONCLUSIONS: By integrating and harmonizing these large-scale cohort studies, such a repository may improve the statistical power and representation of understudied cohorts, enabling development and validation of risk prediction models, identification and investigation of risk factors, and creating a platform for racial disparities research. REGISTRATION: URL: https://precision.heart.org/duke-ninds.