This site is a work in progress and has not been widely shared. Content may contain errors. Feedback is welcome.
This site is undergoing review. Some annotations were human-generated, some AI-generated — all are being verified.
Back to papers

Effects of AI Feedback on Learning, the Skill Gap, and Intellectual Diversity

Riedl, Bogert

2024arXiv preprint1 citations
Observational labor marketManagement / Organizational BehaviorCausal
AI (General)Decision-makingTraining / upskillingAugmentation vs. substitution
Abstract

Can human decision-makers learn from AI feedback? Using data on 52,000 decision-makers from a large online chess platform, we investigate how their AI use affects three interrelated long-term outcomes: Learning, skill gap, and diversity of decision strategies. First, we show that individuals are far more likely to seek AI feedback in situations in which they experienced success rather than failure. This AI feedback seeking strategy turns out to be detrimental to learning: Feedback on successes decreases future performance, while feedback on failures increases it. Second, higher-skilled decision-makers seek AI feedback more often and are far more likely to seek AI feedback after a failure, and benefit more from AI feedback than lower-skilled individuals. As a result, access to AI feedback increases, rather than decreases, the skill gap between high- and low-skilled individuals. Finally, we leverage 42 major platform updates as natural experiments to show that access to AI feedback causes a decrease in intellectual diversity of the population as individuals tend to specialize in the same areas. Together, those results indicate that learning from AI feedback is not automatic and using AI correctly seems to be a skill itself. Furthermore, despite its individual-level benefits, access to AI feedback can have significant population-level downsides including loss of intellectual diversity and an increasing skill gap.

Summary

Riedl and Bogert use panel data from 52,000 chess players on lichess.org and 42 platform updates as natural experiments to study how endogenous AI feedback seeking affects learning, skill gaps, and intellectual diversity in strategic decision-making

Main Finding

AI feedback on failures increases future performance while feedback on successes decreases it; higher-skilled individuals seek more AI feedback (especially on failures) and learn faster, thus AI access increases the skill gap; access to AI feedback causally decreases intellectual diversity at the population level

Primary Datasets

Lichess.org platform data (games, AI analysis records, player ratings)

Secondary Datasets

None

Key Methods
Panel regression with individual and time fixed effects; generalized random forest for heterogeneous treatment effects; regression discontinuity in time combined with natural experiments (42 platform updates); control function approach for endogeneity
Sample Period
2017-2023
Geographic Coverage
Global (online chess platform lichess.org)
Sample Size
403,010 games from 52,251 human players (up to 50 games per player against bots)
Level of Analysis
Individual, Task
Occupation Classification
None
Industry Classification
None
Notes
arXiv:2409.18660 [Claude classification]: Uses chess engine Stockfish for AI feedback analysis. Leverages 42 platform updates as natural experiments combined with regression discontinuity in time. The AI studied is a chess engine, not LLM/generative AI. Control function approach addresses endogeneity of losing games. [Claude classification]: Uses chess engine Stockfish for AI feedback analysis. Leverages 42 platform updates as natural experiments combined with regression discontinuity in time. The AI studied is a chess engine, not LLM/generative AI. Control function approach addresses endogeneity of losing games. [Claude classification]: Uses chess engine Stockfish for AI feedback analysis. Leverages 42 platform updates as natural experiments combined with regression discontinuity in time. The AI studied is a chess engine, not LLM/generative AI. Control function approach addresses endogeneity of losing games. [Claude classification]: Uses chess engine Stockfish for AI feedback analysis. Leverages 42 platform updates as natural experiments combined with regression discontinuity in time. The AI studied is a chess engine, not LLM/generative AI. Control function approach addresses endogeneity of losing games. [Claude classification]: Uses chess engine Stockfish for AI feedback analysis. Leverages 42 platform updates as natural experiments combined with regression discontinuity in time. The AI studied is a chess engine, not LLM/generative AI. Control function approach addresses endogeneity of losing games. [Claude classification]: Uses chess engine Stockfish for AI feedback analysis. Leverages 42 platform updates as natural experiments combined with regression discontinuity in time. The AI studied is a chess engine, not LLM/generative AI. Control function approach addresses endogeneity of losing games. [Claude classification]: Uses chess engine Stockfish for AI feedback analysis. Leverages 42 platform updates as natural experiments combined with regression discontinuity in time. The AI studied is a chess engine, not LLM/generative AI. Control function approach addresses endogeneity of losing games. [Claude classification]: Uses chess engine Stockfish for AI feedback analysis. Leverages 42 platform updates as natural experiments combined with regression discontinuity in time. The AI studied is a chess engine, not LLM/generative AI. Control function approach addresses endogeneity of losing games. [Claude classification]: Uses chess engine Stockfish for AI feedback analysis. Leverages 42 platform updates as natural experiments combined with regression discontinuity in time. The AI studied is a chess engine, not LLM/generative AI. Control function approach addresses endogeneity of losing games. [Claude classification]: Uses chess engine Stockfish for AI feedback analysis. Leverages 42 platform updates as natural experiments combined with regression discontinuity in time. The AI studied is a chess engine, not LLM/generative AI. Control function approach addresses endogeneity of losing games. [Claude classification]: Uses chess engine Stockfish for AI feedback analysis. Leverages 42 platform updates as natural experiments combined with regression discontinuity in time. The AI studied is a chess engine, not LLM/generative AI. Control function approach addresses endogeneity of losing games.