This site is a work in progress and has not been widely shared. Content may contain errors. Feedback is welcome.
This site is undergoing review. Some annotations were human-generated, some AI-generated — all are being verified.
Back to papers

Your Brain on ChatGPT: Accumulation of Cognitive Debt When Using an AI Assistant for Essay Writing Task

Kosmyna, Hauptmann, Yuan, Situ, Liao, Beresnitzky, Braunstein, Maes

2025arXiv27 citations
Experimental evidenceComputer Science / AICausal
LLM / Generative AIWriting / contentEducationHuman-AI collaborationDecision-making
Abstract

This study explores the neural and behavioral consequences of LLM-assisted essay writing. Participants were divided into three groups: LLM, Search Engine, and Brain-only (no tools). Each completed three sessions under the same condition. In a fourth session, LLM users were reassigned to Brain-only group (LLM-to-Brain), and Brain-only users were reassigned to LLM condition (Brain-to-LLM). A total of 54 participants took part in Sessions 1-3, with 18 completing session 4. We used electroencephalography (EEG) to assess cognitive load during essay writing, and analyzed essays using NLP, as well as scoring essays with the help from human teachers and an AI judge. Across groups, NERs, n-gram patterns, and topic ontology showed within-group homogeneity. EEG revealed significant differences in brain connectivity: Brain-only participants exhibited the strongest, most distributed networks; Search Engine users showed moderate engagement; and LLM users displayed the weakest connectivity. Cognitive activity scaled down in relation to external tool use. In session 4, LLM-to-Brain participants showed reduced alpha and beta connectivity, indicating under-engagement. Brain-to-LLM users exhibited higher memory recall and activation of occipito-parietal and prefrontal areas, similar to Search Engine users. Self-reported ownership of essays was the lowest in the LLM group and the highest in the Brain-only group. LLM users also struggled to accurately quote their own work. While LLMs offer immediate convenience, our findings highlight potential cognitive costs. Over four months, LLM users consistently underperformed at neural, linguistic, and behavioral levels. These results raise concerns about the long-term educational implications of LLM reliance and underscore the need for deeper inquiry into AI's role in learning.

Summary

Kosmyna et al. conduct a lab experiment with 54 university students writing essays under three conditions (ChatGPT assistance, search engine, or unassisted) across four sessions, using EEG to measure brain activity, NLP to analyze essay content, and interviews to assess memory and ownership.

Main Finding

LLM-assisted essay writing significantly reduced neural connectivity across all frequency bands compared to unassisted writing, with the LLM group showing 83% inability to quote their essays accurately versus 11% in unassisted groups, and impaired essay ownership despite higher structural scores.

Primary Datasets

EEG recordings (Neuroelectrics Enobio 32-channel headset); Participant-written essays; ChatGPT interaction logs; Post-session interview recordings; Background questionnaires

Secondary Datasets

None

Key Methods
Randomized controlled lab experiment with EEG recording (32-channel), dynamic Directed Transfer Function (dDTF) analysis across frequency bands, NLP analysis of essays (named entity recognition, n-grams, ontology), human teacher and AI judge scoring, and post-session interviews.
Sample Period
2024
Geographic Coverage
United States (Greater Boston area: MIT, Wellesley, Harvard, Tufts, Northeastern)
Sample Size
54 participants for sessions 1-3 (18 per group); 18 participants for session 4
Level of Analysis
Individual
Occupation Classification
None
Industry Classification
None
Replication Package
Partial
Notes
MIT Media Lab; arXiv:2506.08872 [Claude classification]: Study focuses on cognitive neuroscience of AI-assisted writing using EEG analysis. Sample is highly educated (university students, ages 18-39, mean 22.9). Four sessions over 4 months with 20-minute essay writing tasks. Session 4 involved group reassignment (LLM-to-Brain and Brain-to-LLM). Key neural findings: Brain-only group showed significantly stronger connectivity in alpha (semantic processing), beta (cognitive control), theta (working memory), and delta (executive monitoring) bands. Statistical analysis used repeated measures ANOVA with False Discovery Rate correction. Essays scored by both human teachers and AI judge, with notable disagreements on uniqueness metrics.