Your Brain on ChatGPT: Accumulation of Cognitive Debt When Using an AI Assistant for Essay Writing Task
Kosmyna, Hauptmann, Yuan, Situ, Liao, Beresnitzky, Braunstein, Maes
This study explores the neural and behavioral consequences of LLM-assisted essay writing. Participants were divided into three groups: LLM, Search Engine, and Brain-only (no tools). Each completed three sessions under the same condition. In a fourth session, LLM users were reassigned to Brain-only group (LLM-to-Brain), and Brain-only users were reassigned to LLM condition (Brain-to-LLM). A total of 54 participants took part in Sessions 1-3, with 18 completing session 4. We used electroencephalography (EEG) to assess cognitive load during essay writing, and analyzed essays using NLP, as well as scoring essays with the help from human teachers and an AI judge. Across groups, NERs, n-gram patterns, and topic ontology showed within-group homogeneity. EEG revealed significant differences in brain connectivity: Brain-only participants exhibited the strongest, most distributed networks; Search Engine users showed moderate engagement; and LLM users displayed the weakest connectivity. Cognitive activity scaled down in relation to external tool use. In session 4, LLM-to-Brain participants showed reduced alpha and beta connectivity, indicating under-engagement. Brain-to-LLM users exhibited higher memory recall and activation of occipito-parietal and prefrontal areas, similar to Search Engine users. Self-reported ownership of essays was the lowest in the LLM group and the highest in the Brain-only group. LLM users also struggled to accurately quote their own work. While LLMs offer immediate convenience, our findings highlight potential cognitive costs. Over four months, LLM users consistently underperformed at neural, linguistic, and behavioral levels. These results raise concerns about the long-term educational implications of LLM reliance and underscore the need for deeper inquiry into AI's role in learning.
Kosmyna et al. conduct a lab experiment with 54 university students writing essays under three conditions (ChatGPT assistance, search engine, or unassisted) across four sessions, using EEG to measure brain activity, NLP to analyze essay content, and interviews to assess memory and ownership.
LLM-assisted essay writing significantly reduced neural connectivity across all frequency bands compared to unassisted writing, with the LLM group showing 83% inability to quote their essays accurately versus 11% in unassisted groups, and impaired essay ownership despite higher structural scores.
Primary Datasets
EEG recordings (Neuroelectrics Enobio 32-channel headset); Participant-written essays; ChatGPT interaction logs; Post-session interview recordings; Background questionnaires
Secondary Datasets
None
- Key Methods
- Randomized controlled lab experiment with EEG recording (32-channel), dynamic Directed Transfer Function (dDTF) analysis across frequency bands, NLP analysis of essays (named entity recognition, n-grams, ontology), human teacher and AI judge scoring, and post-session interviews.
- Sample Period
- 2024
- Geographic Coverage
- United States (Greater Boston area: MIT, Wellesley, Harvard, Tufts, Northeastern)
- Sample Size
- 54 participants for sessions 1-3 (18 per group); 18 participants for session 4
- Level of Analysis
- Individual
- Occupation Classification
- None
- Industry Classification
- None
- Replication Package
- Partial