Forget the Score: How You Take A Memory Test May Predict Dementia

Credit: Getty Images For Unsplash+

It’s Not About Getting It Right, It’s About How You Get It Wrong

In A Nutshell

How precisely someone recalls the exact wording of a short story, not just the general idea, is associated with significantly lower odds of developing dementia years later.
Analyzing the types of mistakes people make on cognitive tests, rather than just their scores, dramatically improves the ability to identify who will develop dementia.
Machine learning models trained on detailed error data correctly flagged more than 99% of people who later developed dementia, within the study’s dataset, though external validation is still needed.
A formal diagnosis of mild cognitive impairment was a surprisingly weak predictor, correctly identifying only 22.5% of people who eventually developed dementia.

Most people would not think twice about remembering the gist of a story but fumbling the exact words. A sweeping new study says they should. How precisely someone recalls the specific phrasing of a short story, not just the general idea, may be a potentially powerful early signal that dementia is approaching, detectable years before any formal diagnosis.

Researchers analyzing decades of data from the Framingham Heart Study found that people who scored higher on a measure of word-for-word story recall had about 76% lower odds (a statistical measure, not direct risk reduction) of developing dementia over the following years compared to those who struggled with exact details. A broader measure of overall cognitive ability, captured through the specific types of mistakes people made during a battery of brain tests, was linked to roughly 85% lower odds for each standard unit of improvement.

When researchers fed this error data into machine learning models, those models correctly flagged more than 99% of the 222 people who would eventually receive a dementia diagnosis, within this dataset using cross-validation, though the researchers caution that independent validation would be needed before any clinical application.

Published in the Journal of the International Neuropsychological Society, the findings argue that the standard approach to cognitive testing, boiling performance down to a single score on each task, misses critical clues hiding in plain sight. Patterns of errors, thinking strategies, and even partial successes on memory tasks all carry diagnostic weight that conventional scoring leaves on the table.

How Memory Tests Miss What Matters

Most cognitive tests work like a school exam: a patient completes a task, and a clinician records a summary score. Did the person name 25 out of 30 pictures correctly? Could they repeat seven digits backward? These scores judge whether memory, attention, or language abilities fall within a normal range. But a single number cannot capture how someone arrived at the answer.

Two people might score identically on a story recall test, yet one remembered the exact wording while the other filled in gaps with related but inaccurate details. Under traditional scoring, those differences vanish.

Developed by the late neuropsychologist Edith Kaplan, the Boston Process Approach was designed to fix this problem. Rather than tallying correct answers, clinicians trained in this method document the types of errors a person makes. Did they mix up words from different parts of the test? Did they describe an object they could not name, circling around the right word without landing on it? Each behavior offers a window into which brain systems might be struggling.

Since 2005, the Framingham Heart Study, a landmark community health study that began in 1948, has been collecting this detailed error-and-strategy data using an extensive scoring system and weekly team meetings to maintain consistency. The result is an unusually rich dataset: 169 distinct variables drawn from a battery of well-known brain tests.

memory test infographic — Word-for-word story recall and error patterns on memory tests may predict dementia years before diagnosis, research shows. (Image generated by StudyFinds)

What Mistakes on Memory Tests Reveal About Dementia Risk

Researchers analyzed data from 2,363 participants who were free of dementia at the time of their cognitive assessment. These were primarily older adults with an average age of about 71.5 years; 54.2% were female and 96.5% identified as non-Hispanic White.

Using a specialized modeling technique suited to uneven, count-based data, the team uncovered hidden patterns in how errors cluster together. One model met all predetermined quality standards, revealing a general cognitive factor, essentially a person’s overall tendency to produce correct or strategic responses and avoid errors, plus seven specific factors tied to individual tests. Across subgroups, the model performed largely consistently, whether comparing younger versus older participants, men versus women, or those with more versus less education, though not perfectly identical in all respects.

Among the 2,311 participants with at least one follow-up evaluation, 222, about 9.6%, were eventually diagnosed with dementia over a median of 5.2 years. Two factors stood out as protective: higher scores on the general cognitive factor and stronger word-for-word story recall.

One counterintuitive finding deserves attention. “Near miss” errors on story memory, cases where a person recalled something conceptually related to the correct detail but got it slightly wrong, appeared to reflect some preserved memory capacity according to the model. Even getting it almost right was a meaningful signal. Prior research suggests that gist-based memory may function as a backup system when precise recall begins to fail, a system that is itself compromised in Alzheimer’s disease.

Machine Learning Models Caught 99% of Future Dementia Cases Within This Dataset

When the team trained machine learning models to predict dementia, models using only traditional summary scores correctly identified 95.9% of people who eventually developed dementia. Adding detailed error data pushed that figure to 99.4%, within this dataset using cross-validation. The researchers stress that multiple independent samples would need to validate these models before they could be considered reliable for clinical use.

One particularly telling comparison: a clinical diagnosis of mild cognitive impairment at the time of testing was a surprisingly poor standalone marker. Only 46.3% of those participants later developed dementia, and the diagnosis correctly identified just 22.5% of all eventual cases.

Mistakes on cognitive tests are not noise to be discarded. They are patterned data that clinicians have been systematically setting aside for decades. Paying closer attention to how people think, not just how well, may turn out to be one of the most practical tools available in the push for earlier detection.

Disclaimer: This article is based on a published peer-reviewed study and is intended for informational purposes only. It should not be taken as medical advice. If you have concerns about memory or cognitive health, consult a qualified healthcare professional.

Paper Notes

Limitations

Models used in the study do not permit survival analyses, meaning researchers could predict whether someone would develop dementia but not precisely when. Because 96.5% of participants identified as non-Hispanic White, the team could not assess whether the model measures cognition equivalently across different racial and ethnic groups. Many of the 169 available process variables were not retained in the final model, likely because certain error types were too rare to model reliably. All-cause dementia was examined rather than specific conditions, such as Alzheimer’s versus vascular dementia, which may show different patterns of sensitivity to particular error types. Strict measurement invariance did not fully hold across age, sex, and education groups. Additionally, while machine learning models showed strong performance, the researchers note that external validation in multiple independent samples would be required before such models could be considered reliable for clinical use.

Funding and Disclosures

This work was supported by the Framingham Heart Study of the National Heart, Lung, and Blood Institute of the National Institutes of Health and Boston University School of Medicine, and was funded in part with federal funds from the National Heart, Lung, and Blood Institute under Contract No. 75N92019D00031. Additional support came from the National Institute on Aging (grants U19 AG068753, R01-AG072654, R01-AG062109, R01AG083735, U01AG068221, K07-AG066813, and K01-AG057798), the U.S. Department of Veterans Affairs Merit Award (CX002400), and a VA Career Development Award (CX002625). Rhoda Au is a Scientific Advisor to Signant Health and NovoNordisk. All other authors report no conflicts of interest. All participants provided written informed consent, and the study was approved by the institutional review board at Boston University in accordance with the Helsinki Declaration of 1975. This article is published as Open Access under a Creative Commons Attribution licence.

Publication Details

Title: Psychometric modeling of Boston process approach data for dementia prediction in the Framingham Heart Study | Authors: Brandon Frank, Ashita Gurnani, Landon Hurley, Calvin Guan, Stacy L. Andersen, Sherral A. Devine, Maureen K. O’Connor, Andrew Budson, Chunyu Liu, Honghuang Lin, Sanford Auerbach, Yulin Liu, David J. Libon, Catherine C. Price, Lindsay Farrer, Jesse Mez, Alvin Ang, and Rhoda Au. Brandon Frank and Ashita Gurnani contributed equally as shared first authors. | Journal: Journal of the International Neuropsychological Society (2026), 1–12 | DOI: https://doi.org/10.1017/S1355617726101921 | Affiliations include: VA Boston Healthcare System; Alzheimer’s Disease Research Center and Department of Neurology, Boston University Chobanian & Avedisian School of Medicine; the Framingham Heart Study; VA Bedford Healthcare System; Boston University School of Public Health; University of Massachusetts Chan Medical School; Rowan University School of Osteopathic Medicine; University of Florida; and Slone Epidemiology Center at Boston University, among others.