AI Tool Uses Medical Records to Identify High-Risk Pancreatic Cancer Patients

By News Release

 

A deep-learning algorithm has successfully identified people at the highest risk for pancreatic cancer up to three years before diagnosis using solely the patients’ medical records. The research was led by investigators at Harvard Medical School and the University of Copenhagen, in collaboration with VA Boston Healthcare System, Dana-Farber Cancer Institute, and the Harvard T.H. Chan School of Public Health and published in Nature Medicine.

The findings suggest that AI-based population screening could be valuable in finding those at elevated risk for the disease and could expedite the diagnosis of a condition found all too often at advanced stages when treatment is less effective and outcomes are dismal, the researchers said. Pancreatic cancer is one of the deadliest cancers in the world, and its toll projected to increase.

Currently, there are no population-based tools to screen broadly for pancreatic cancer. Those with a family history and certain genetic mutations that predispose them to pancreatic cancer are screened in a targeted fashion. But such targeted screenings can miss other cases that fall outside of those categories, the researchers said.

“One of the most important decisions clinicians face day to day is who is at high risk for a disease, and who would benefit from further testing, which can also mean more invasive and more expensive procedures that carry their own risks,” said study co-senior investigator Chris Sander, faculty member in the Department of Systems Biology in the Blavatnik Institute at HMS. “An AI tool that can zero in on those at highest risk for pancreatic cancer who stand to benefit most from further tests could go a long way toward improving clinical decision-making.”

Applied at scale, Sander added, such an approach could expedite detection of pancreatic cancer, lead to earlier treatment, and improve outcomes and prolong patients’ life spans. “Many types of cancer, especially those hard to identify and treat early, exert a disproportionate toll on patients, families and the healthcare system as a whole,” said study co-senior investigator Søren Brunak, professor of disease systems biology and director of research at the Novo Nordisk Foundation Center for Protein Research at the University of Copenhagen. “AI-based screening is an opportunity to alter the trajectory of pancreatic cancer, an aggressive disease that is notoriously hard to diagnose early and treat promptly when the chances for success are highest.”

In the new study, the AI algorithm was trained on two separate data sets totaling 9 million patient records from Denmark and the United States. The researchers “asked” the AI model to look for telltale signs based on the data contained in the records. Based on combinations of disease codes and the timing of their occurrence, the model was able to predict which patients are likely to develop pancreatic cancer in the future. Notably, many of the symptoms and disease codes were not directly related to or stemming from the pancreas.

The researchers tested different versions of the AI models for their ability to detect people at elevated risk for disease development within different time scales — 6 months, one year, two years, and three years. Overall, each version of the AI algorithm was substantially more accurate at predicting who would develop pancreatic cancer than current population-wide estimates of disease incidence — defined as how often a condition develops in a population over a specific period of time. The researchers said they believe the model is at least as accurate in predicting disease occurrence as are current genetic sequencing tests that are usually available only for a small subset of patients in data sets.

The AI model was trained on the health records of 6.2 million patients from Denmark’s national health system spanning 41 years. Of those patients, 23,985 developed pancreatic cancer over time. During the training, the algorithm discerned patterns indicative of future pancreatic cancer risk based on disease trajectories, that is, whether the patient had certain conditions that occurred in a certain sequence over time.

Unlike other cancers, pancreatic cancer is harder and more expensive to screen and test for. Physicians look mainly at family history and the presence of genetic mutations, which, while important indicators of future risk, often miss many patients. One particular advantage of the AI tool is that it could be used on any and all patients for whom health records and medical history are available, not just in those with known family history or genetic predisposition for the disease. This is especially important, the researchers add, because many patients at high risk may not even be aware of their genetic predisposition or family history.

In the absence of symptoms and without a clear indication that someone is at high risk for pancreatic cancer, clinicians may be understandably cautious to recommend more sophisticated and more expensive testing, such as CT scans, MRI or endoscopic ultrasound. When these tests are used and suspicious lesions discovered, the patient must undergo a procedure to obtain a biopsy. Positioned deep inside the abdomen, the organ is hard to access and easy to provoke and inflame. Its irritability has earned it the moniker “the angry organ.” An AI tool that identifies those at the highest risk for pancreatic cancer would ensure that clinicians test the right population, while sparing others unnecessary testing and additional procedures, the researchers said.

About 44% of people diagnosed in the early stages of pancreatic cancer survive five years after diagnosis, but only 12% of cases are diagnosed that early. The survival rate drops to 2 to 9% in those whose tumors have grown beyond their site of origin, researchers estimate.

“That low survival rate is despite marked advances in surgical techniques, chemotherapy, and immunotherapy,” Sander said. “So, in addition to sophisticated treatments, there is a clear need for better screening, more targeted testing, and earlier diagnosis, and this where the AI-based approach comes in as the first critical step in this continuum.”