Computing 3-step theory of suicide factor scores from Veterans Health Administration clinical progress notes

Abstract: Background: Literature on how to translate information extracted from clinical progress notes into numeric scores for 3-step theory of suicide (3ST) factors is nonexistent. We determined which scoring option would best discriminate between patients who will attempt or die by suicide and patients with neither suicidal ideation nor attempts, and we tested hypotheses related to the 3ST. Methods: We used terminology-driven natural language processing (NLP) to extract information from Veterans Health Administration (VHA) clinical progress notes. Counts of those extractions served as input to evaluate candidate scoring options for each 3ST factor (psychological pain, hopelessness, connectedness, capability for suicide). Logistic regression models adjusted for common demographic characteristics were used to test the 3ST hypotheses. Results: Optimal contrasts between groups were obtained with P - A for psychological pain, hopelessness, and capability for suicide, and A/(P + 1) for connectedness, where P and A, respectively, indicate the patient-level number of extractions indicating presence and absence of the factor. Limitations: Additional research is necessary to verify whether our conclusions hold in a cohort that is more reflective of the general VHA population. Conclusion: Terminology-driven 3ST factor scores discriminate patients who attempt or die by suicide from patients without suicidal ideation or attempts. Our results corroborate the validity of the 3ST for VHA patients.

Read the full article
Report a problem with this article

Related articles

  • More for Researchers

    Identifying opioid relapse during COVID-19 using natural language processing of nationwide Veterans Health Administration electronic medical record data

    Abstract: Novel and automated means of opioid use and relapse risk detection are needed. Unstructured electronic medical record data, including written progress notes, can be mined for clinically relevant information, including the presence of substance use and relapse-critical markers of risk and recovery from opioid use disorder (OUD). In this study, we used natural language processing (NLP) to automate the extraction of opioid relapses, and the timing of these occurrences, from veteran patients' electronic medical record. We then demonstrated the utility of our NLP tool via analysis of pre-/post-COVID-19 opioid relapse trends among veterans with OUD. For this demonstration, we analyzed data from 107,606 veterans OUD enrolled in Veterans Health Administration, comparing a pandemic-exposed cohort (n = 53,803; January 2019-March 2021) to a matched prepandemic cohort (n = 53,803; October 2017-December 2019). The recall of our NLP tool was 75% and our precision was 94%, demonstrating moderate sensitivity and excellent specificity. Using the NLP tool, we found that the odds of opioid relapse postpandemic onset were proportionally higher compared to prepandemic trends, despite patients having fewer mental health encounters from which to derive instances of relapse postpandemic onset. In this research application of the tool, and as hypothesized, we found that opioid relapse risk was elevated postpandemic. The application of NLP Methods: to identify and monitor relapse risk holds promise for future surveillance, risk prevention, and clinical outcome research.