Design and analysis of outcomes following SARS-CoV-2 infection in Veterans

Abstract: Background: Understanding how SARS-CoV-2 infection impacts long-term patient outcomes requires identification of comparable persons with and without infection. We report the design and implementation of a matching strategy employed by the Department of Veterans Affairs’ (VA) COVID-19 Observational Research Collaboratory (CORC) to develop comparable cohorts of SARS-CoV-2 infected and uninfected persons for the purpose of inferring potential causative long-term adverse effects of SARS-CoV-2 infection in the Veteran population. Methods: In a retrospective cohort study, we identified VA health care system patients who were and were not infected with SARS-CoV-2 on a rolling monthly basis. We generated matched cohorts within each month utilizing a combination of exact and time-varying propensity score matching based on electronic health record (EHR)-derived covariates that can be confounders or risk factors across a range of outcomes. Results: From an initial pool of 126,689,864 person-months of observation, we generated final matched cohorts of 208,536 Veterans infected between March 2020-April 2021 and 3,014,091 uninfected Veterans. Matched cohorts were well-balanced on all 39 covariates used in matching after excluding patients for: no VA health care utilization; implausible age, weight, or height; living outside of the 50 states or Washington, D.C.; prior SARS-CoV-2 diagnosis per Medicare claims; or lack of a suitable match. Most Veterans in the matched cohort were male (88.3%), non-Hispanic (87.1%), white (67.2%), and living in urban areas (71.5%), with a mean age of 60.6, BMI of 31.3, Gagne comorbidity score of 1.4 and a mean of 2.3 CDC high-risk conditions. The most common diagnoses were hypertension (61.4%), diabetes (34.3%), major depression (32.2%), coronary heart disease (28.5%), PTSD (25.5%), anxiety (22.5%), and chronic kidney disease (22.5%). Conclusion: This successful creation of matched SARS-CoV-2 infected and uninfected patient cohorts from the largest integrated health system in the United States will support cohort studies of outcomes derived from EHRs and sample selection for qualitative interviews and patient surveys. These studies will increase our understanding of the long-term outcomes of Veterans who were infected with SARS-CoV-2.

Read the full article
Report a problem with this article

Related articles

  • More for Researchers

    Identifying opioid relapse during COVID-19 using natural language processing of nationwide Veterans Health Administration electronic medical record data

    Abstract: Novel and automated means of opioid use and relapse risk detection are needed. Unstructured electronic medical record data, including written progress notes, can be mined for clinically relevant information, including the presence of substance use and relapse-critical markers of risk and recovery from opioid use disorder (OUD). In this study, we used natural language processing (NLP) to automate the extraction of opioid relapses, and the timing of these occurrences, from veteran patients' electronic medical record. We then demonstrated the utility of our NLP tool via analysis of pre-/post-COVID-19 opioid relapse trends among veterans with OUD. For this demonstration, we analyzed data from 107,606 veterans OUD enrolled in Veterans Health Administration, comparing a pandemic-exposed cohort (n = 53,803; January 2019-March 2021) to a matched prepandemic cohort (n = 53,803; October 2017-December 2019). The recall of our NLP tool was 75% and our precision was 94%, demonstrating moderate sensitivity and excellent specificity. Using the NLP tool, we found that the odds of opioid relapse postpandemic onset were proportionally higher compared to prepandemic trends, despite patients having fewer mental health encounters from which to derive instances of relapse postpandemic onset. In this research application of the tool, and as hypothesized, we found that opioid relapse risk was elevated postpandemic. The application of NLP Methods: to identify and monitor relapse risk holds promise for future surveillance, risk prevention, and clinical outcome research.