An AI model predicting acute kidney injury works, but not without some tweaking

The model identified AKI 48 hours in advance, allowing ample time for clinicians to intervene and provide treatment.

10:39 AM

Author | Clarissa Piatek

scientist examining a kidney
Getty Images

In 2019, Google AI subsidiary DeepMind used a large dataset of patient records from the Veterans Health Administration to develop a predictive model for acute kidney injury—a potentially fatal condition whose prognosis improves the earlier a treatment intervention is administered. The DeepMind model purported to predict AKI 48 hours in advance, allowing ample lead time for clinicians to intervene and administer treatment.

The study team reviewed electronic health record data over a five-year period from more than 700,000 individuals.

“This is a phenomenal model because it can predict AKI up to 48 hours in advance, in a continuous manner, and has the best model performance compared to all previously published models,” said Jie Cao, a Ph.D., student in the Department of Computational Medicine and Bioinformatics. Cao is a researcher in the ML4LHS Lab, run by Karandeep Singh, M.D., MMSc., assistant professor in the Department of Learning Health Sciences and the Biomedical & Clinical Informatics Lab run by Kayvan Najarian, Ph.D., professor in the Department of Computational Medicine and Bioinformatics. All are of the University of Michigan. However, “concerns were raised about the generalizability of a model like this given the predominantly male [VA] population that it was trained on,” Cao added.

This led Cao and her colleagues to evaluate the model’s generalizability in a non-VA, more sex-balanced population. Their findings have been published in Nature Machine Intelligence. 

The researchers reconstructed aspects of the DeepMind model, then trained and validated this model on two cohorts: one comprising 278,813 VA hospitalizations (from 118 VA hospitals) and the other 165,359 U-M hospitalizations. Not surprisingly, given the 94% male population with which the original model was developed, the reconstructed model performed worse for female patients in both cohorts.

To mitigate the model’s sex-based discrepancies, researchers updated the model with data from U-M’s more sex-balanced patient population, which extended the original model from 160 decision trees to 170. This small extension improved performance in the U-M cohort both overall and between sexes.

Like Podcasts? Add the Michigan Medicine News Break on Spotify, Apple Podcasts or anywhere you listen to podcasts.

“The extended model was successful at U-M. It used the VA model as the backbone, added information from U-M, and the final product worked well for the U-M patient population,” said Cao, lead author of the paper. “When researchers would like to benefit from the rich information contained in the original model and do not want to build a new local model from scratch, our study is a good example of how ‘fine-tuning’ could work when the original model was not trained on a diverse population,” she explained.

When the extended model was applied to VA patients, however, the discrepancies in model performance between males and females actually worsened.

“This finding surprised us to some extent,” said Cao, “but it is also reasonable and helps us understand the problem better. Difference in patient characteristics is one common factor contributing to model performance discrepancy. By matching the female patients at two different health systems and still finding discrepant model performance, we actually show that difference in patient characteristics wasn’t the only reason contributing to model performance discrepancy.”

Lower performance of the extended model in female VA patients, then, was not a factor of patient characteristics or low sample size, but likely attributable to variables such as differences in practice patterns between males and females at the VA.

Overall, the study demonstrated the value of updating existing models with data from the population to which the model will be applied.  

“If a predictive model is to be taken out of one healthcare system and applied to another, the population the model was trained on is often different from the population it is going to be applied to. Even if the training population is diverse, we could observe a drop in model performance if nothing is done,” said Cao. “Our ‘extended model’ approach is to provide a solution to partially address this issue.”

To achieve peak performance, a model would, in theory, be applied only to a population matching the population it was trained on. But this is often not the case in practice.

“In the real world,” said Cao, this approach is “infeasible due to limited resources, time, expertise, etc. Our ‘extended model’ strategy is a workaround in these scenarios.”

This research is significant, said Cao, because it shows “the complexity of discrepancies in model performance in subgroups that cannot be explained simply on the basis of sample size.” It also offers “a potential strategy to mitigate the generalizability issue,” she said, and, finally, it demonstrates “the importance of reproducing and evaluating artificial intelligence studies.”

Live your healthiest life: Get tips from top experts weekly. Subscribe to the Michigan Health blog newsletter

Headlines from the frontlines: The power of scientific discovery harnessed and delivered to your inbox every week. Subscribe to the Michigan Health Lab blog newsletter


More Articles About: All Research Topics Kidney Disease Kidney Failure Preventative health and wellness Future Think Hospitals & Centers Health Care Quality Health Tech
Health Lab word mark overlaying blue cells
Health Lab

Explore a variety of health care news & stories by visiting the Health Lab home page for more articles.

Media Contact Public Relations

Department of Communication at Michigan Medicine

[email protected]

734-764-2220

Stay Informed

Want top health & research news weekly? Sign up for Health Lab’s newsletters today!

Subscribe
Featured News & Stories deflated orange balloon on white background close up on the part that's blown into
Health Lab
The truth about penile implants
Erectile dysfunction is a common issue in the United States, but most people aren't aware of a penile prosthesis being a potential, life changing, option. An urologist, who performs these procedures, discusses what to look for in a surgeon, why high volume centers are best and dispels misinformation on the topic.
man walking on treadmill in open space with person in blue scrubs and scrub hat
Health Lab
Less than 1% of COPD patients receive pulmonary rehab following hospitalization
Rehabilitation can help patients who are recovering from an injury or illness. However, new data suggests patients hospitalized for chronic obstructive pulmonary disorder, or COPD, aren't receiving it.
rx bottle orange see through person paying in background
Health Lab
New law regulating out-of-pocket drug spending saves cancer patients more than $7,000 a year
A study finds that efforts to cap prescription drug spending are yielding significant out-of-pocket savings for these patients.
emergency room front doors with sign and blurred motion of people and a vechile
Health Lab
Black stroke patients arrive later to hospitals, EMS less likely to notify
Research found that it took approximately 28 minutes longer for a Black patient to be brought in for emergency care after displaying symptoms of a stroke.
blue cells close up with orangey see through circle going in and an explosion of bright yellow orange to the far left in that circle with purple circles floating in background
Health Lab
Researchers identify factor that drives prostate cancer-causing genes
Researchers have uncovered a key reason why a typically normal protein goes awry and fuels cancer. They found the protein NSD2 alters the function of the androgen receptor, an important regulator of normal prostate development.
microscope
Health Lab
Antibody targeting CD38 enzyme shows positive impact in treating systemic sclerosis
Using the Ab38 antibody to target the CD38 enzyme could be a new gateway to treat and prevent fibrosis for patients with systemic sclerosis.