Leticia Cuellar-Hengartner

Los Alamos National Laboratory

Journal article Open access Peer reviewed

High dimensional predictions of suicide risk in 4.2 million US Veterans using ensemble transfer learning

by Sayera Dhaubhadel, Kumkum Ganguly, Ruy M Ribeiro, Judith D Cohn, James M Hyman, Nicolas W Hengartner, Beauty Kolade, Anna Singley, Tanmoy Bhattacharya and Patrick Finley ... (30 authors)

Published 01/20/2024

Scientific reports, 14, 1, 1793 - 19

We present an ensemble transfer learning method to predict suicide from Veterans Affairs (VA) electronic medical records (EMR). A diverse set of base models was trained to predict a binary outcome constructed from reported suicide, suicide attempt, and overdose diagnoses with varying choices of study design and prediction methodology. Each model used twenty cross-sectional and 190 longitudinal variables observed in eight time intervals covering 7.5 years prior to the time of prediction. Ensembles of seven base models were created and fine-tuned with ten variables expected to change with study design and outcome definition in order to predict suicide and combined outcome in a prospective cohort. The ensemble models achieved c-statistics of 0.73 on 2-year suicide risk and 0.83 on the combined outcome when predicting on a prospective cohort of [Formula: see text] 4.2 M veterans. The ensembles rely on nonlinear base models trained using a matched retrospective nested case-control (Rcc) study cohort and show good calibration across a diversity of subgroups, including risk strata, age, sex, race, and level of healthcare utilization. In addition, a linear Rcc base model provided a rich set of biological predictors, including indicators of suicide, substance use disorder, mental health diagnoses and treatments, hypoxia and vascular damage, and demographics.

Journal article Open access

Assessing the impact of human mobility to predict regional excess death in Ecuador

by Nicolas W. Hengartner, Irene Torres, Ethan Romero-Severson, Riya Mahesh, Nathaniel Robert Ortega, Ruian Ke and Leticia Cuellar-Hengartner

Published 12/2022

Scientific Reports, 12, 1, 370

Journal article Open access

Excess deaths reveal the true spatial, temporal and demographic impact of COVID-19 on mortality in Ecuador

by Leticia Cuellar, Irene Torres, Ethan Romero-Severson, Riya Mahesh, Nathaniel Robert Ortega, Sarah Ann Pungitore, Nicolas W. Hengartner and Ruian Ke

Published 02/2022

International Journal of Epidemiology, 51, 1, 54-62

Journal article Open access

Excess deaths reveal unequal impact of COVID-19 in Ecuador

by Leticia Cuellar-Hengartner, Irene Torres, Ethan Romero-Severson, Riya Mahesh, Nathaniel Robert Ortega, Sarah Ann Pungitore, Ruian Ke and Nicolas W. Hengartner

Published 09/28/2021

BMJ Global Health, 6, 9, e006446

Journal article Open access Peer reviewed

CAT: computer aided triage improving upon the Bayes risk through epsilon-refusal triage rules

by Nicolas Hengartner, Leticia Cuellar, Xiao-Cheng Wu, Georgia Tourassi, John Qiu, Blair Christian, Tanmoy Bhattacharya, Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States) and Leticia Cuellar-Hengartner

Published 12/21/2018

BMC bioinformatics, 19, Suppl 18, 485 - 485

BackgroundManual extraction of information from electronic pathology (epath) reports to populate the Surveillance, Epidemiology, and End Result (SEER) database is labor intensive. Systematizing the data extraction automatically using machine-learning (ML) and natural language processing (NLP) is desirable to reduce the human labor required to populate the SEER database and to improve the timeliness of the data. This enables scaling up registry efficiency and collection of new data elements. To ensure the integrity, quality, and continuity of the SEER data, the misclassification error of ML and NPL algorithms needs to be negligible. Current algorithms fail to achieve the precision of human experts who can bring additional information in their assessments. Differences in registry format and the desire to develop a common information extraction platform further complicate the ML/NLP tasks. The purpose of our study is to develop triage rules to partially automate registry workflow to improve the precision of the auto-extracted information.ResultsThis paper presents a mathematical framework to improve the precision of a classifier beyond that of the Bayes classifier by selectively classifying item that are most likely to be correct. This results in a triage rule that only classifies a subset of the item. We characterize the optimal triage rule and demonstrate its usefulness in the problem of classifying cancer site from electronic pathology reports to achieve a desired precision.ConclusionsFrom the mathematical formalism, we propose a heuristic estimate for triage rule based on post-processing the soft-max output from standard machine learning algorithms. We show, in test cases, that the triage rule significantly improve the classification accuracy.

Journal article Open access Peer reviewed

Identifying Security Checkpoints Locations to Protect the Major U.S. Urban Areas

by Leticia Cuellar-Hengartner, Daniel Watkins, Deborah Ann Kubicek, Erick Rodriguez and Phillip David Stroud

Published 09/01/2015

Homeland Security Affairs, 11

Leticia Cuellar-Hengartner

Los Alamos National Laboratory

Output list