Roediger & Karpicke (2006)
Test-enhanced learning: Taking memory tests improves long-term retention. Psychological Science, 17(3), 249–255.
In their landmark experiment, Roediger and Karpicke had college students study brief prose passages (about 250 words on subjects like sea otters or the sun) under one of two conditions. The "study-study" group read the passage four times, with brief pauses between each reading. The "study-test" group read the passage once and then took three free-recall tests, writing down everything they could remember without seeing the text again.
Five minutes after the final session, the study-study group performed slightly better. They had just re-read the material, after all. But when both groups were tested again one week later, the result reversed dramatically: the study-test group recalled around 60% of the material, while the study-study group recalled only about 40%. The act of retrieval, even without feedback, had cemented the material in long-term memory in a way that re-reading had not.
The authors called this the "testing effect" and argued it ran counter to how most educational practice was designed. Students typically prepare for exams by re-reading notes and highlighting passages, activities that produce a feeling of mastery during the act but leave little residue a week later. Active retrieval, by contrast, feels harder in the moment but builds durable memory traces.
Adesope, Trevisan & Sundararajan (2017)
Rethinking the use of tests: A meta-analysis of practice testing. Review of Educational Research, 87(3), 659–701.
This meta-analysis pulled together 272 effect sizes from 118 independent experiments published between 1924 and 2014. The authors compared practice testing against alternative study activities (re-studying, note-taking, concept mapping, doing nothing), across age groups from elementary school through university, across subjects from biology to history to a foreign language, and across retention intervals from minutes to weeks.
The main finding: practice testing produced a Hedges' g effect size of 0.61, which in educational research is considered medium-to-large and roughly translates to moving an average student from the 50th to the 73rd percentile. The benefit was robust across virtually every condition tested. It held for free-recall, short-answer, and multiple-choice formats. It held with feedback and (with smaller effect) without. It held for novices and experts. It transferred to new contexts: students tested on one set of facts performed better on related, untested material than students who had simply re-studied.
The meta-analysis also found that practice testing reduced the metacognitive overconfidence that often accompanies re-reading. Students who study via testing develop a more accurate sense of what they actually know.
How TradeInTune applies it
Every module ends with a decision you have to commit to. Quizzes, scenario setups, chart identifications. There is no "watch the lesson, you'll pick it up". There is "make the call, see what the market did." Wrong answers don't end the lesson; they queue the concept for tomorrow's drill, where retrieval happens again. The platform is built around the testing effect, not around the illusion of fluency that comes from re-watching.