Tuesday, February 25, 2020

Fight the Power!: Retrieval Practice

Learning By Doing

Let's start with a handful of questions. Without looking back at any of the previous posts, try to answer the following questions:

  1. What are the three processes involved in memory? 
  2. What is the shape of the forgetting curve? 
  3. How many items can be held in working memory at the same time? 
  4. What is the capacity of long-term memory? 
  5. Are there memories that we never forget?
The answers can be found at at the end of this post [1]. 

Wait...what was I going to say? 

Do you remember sliding down the memory curve? If not, it's okay. It’s been a while. Forgetting is a normal (and adaptive!) part of memory. Forgetting is non-linear, meaning it decays quickly and eventually slows down. If you plot it on a graph, then it might look something like this (see Fig. 1). The y-axis is the probability of successfully recalling a memory, and the x-axis is the amount of time that has elapsed since the last time you tried to recall that same memory. 



Figure 1. An idealized forgetting graph.

Notice the shape of the graph. It resembles a power function. In fact, most mathematical models of forgetting follow a power function, P = at−b , where P is the probability of accurately recalling an item, t represents time, and b is the forgetting rate [2]

In another past post, we tried to address the question of why the forgetting curve looks like this. John Anderson and his colleague Lael Schooler put forth the argument that memory is adapted to our informational environment. We forget because the environment does not demand that we remember. Put another way, memory, and therefore forgetting, is a reflection of the environment. That's an interesting argument because it means we can structure the environment in such a way that guards against forgetting.


Inoculating Against Forgetting

If Anderson and Schooler's argument is accurate, what can we do to improve our memory? Burr Settles, Research Director at Duolingo, has an excellent suggestion. In his blog post, he suggests that we treat forgetting by administering little booster shots over time [3]. If you remember a vocabulary word accurately, then the system waits a longer time span than if you forget. If you forget, then the system asks you to recall that word more frequently. It's pretty ingenious, and it's an excellent example of using technology to solve a tricky educational problem.

The concept behind the recommendation is called retrieval practice. In other words, you give your students an opportunity to retrieve a word, concept, or fact from long-term memory. Merely attempting to recall an item ends up helping to boost that item's strength in memory. The critical component is that you try. If you fail, however, then you are going to need feedback (i.e., you need to see the item you were trying to recall). Retrieval practice has been shown to be more effective than rereading or reviewing the same material [4].

It seems weird, but that's how memory works. By the fact that you are trying to recall something signals to the memory system that this item is important, and that I need to remember it for next time.


The S.T.E.M. Connection

How do we harness Dr. Settle's suggestion in a classroom environment, where specific items (such as words) are not being tracked by a computer for each individual student? Is there a way to help teachers administer those memory booster shots to their students? ðŸ’‰

The traditional method of teaching is to introduce a topic, solve a few illustrative problems that relate to that topic in class, assign some homework problems, and then give a test a few days or weeks later to see if the students retained the material. For highly important topics, the same items might make a reappearance on the final exam. Wouldn't the unit test and final exam count as a booster? 

Depending on the time series, probably not. There are two potential problems. First, if a topic hasn't been discussed in several weeks, then it is likely the memory system is going to treat that memory as unimportant, and it will find itself on the fast side of the forgetting curve. Second, if too much time elapses between the presentation and evaluation, then the probability of successful recall is going to be very low.

There are a couple of ways to combat this situation. First, if you are an educator, and you are in complete control over the homework items assigned to your students, then you can "sneak" an old item into the current problem set. The problem, of course, is that if you do this too often, then your inoculation graph might look like this:



Figure 2. Spaced practice for multiple items with different decay rates.

As you can see, this can get really messy, really fast. One way to deal with that complexity is to schedule homework assignments where all of the problems are review items.

Second, if your domain has facts or skills that build on older ideas, then students will automatically receive practice on the foundational material. Math is a great example. Learning about ratios can help students understand slope, which then leads into solving linear equations. By exercising the more complex skills, such as solving linear equations, student receive practice on ratio reasoning.

I understand that implementing these suggestions is difficult because there are a lot of factors at play in the classroom, but I hope it is helpful to think about forgetting in terms of multiple, overlapping power functions. With that image in mind, we can keep giving doses of anti-forgetting shots [5].

Share and Enjoy!

Dr. Bob

Going Beyond the Information Given

[1] Answers are: 1) encoding, storage, and retrieval; 2) it's a decelerating power curve; 3) between five and nine items; 4) extremely large; 5) there is evidence that we have permanent memories for some items.

[2] Of course, there is some debate about that. Two of undergraduate professors argue that the empirically observable power law might be an artifact of averaging over multiple exponential functions. I know. Your mind is blown, right? Mine was too when I first heard their argument. All of the gory details can be found in: Anderson, R. B. & Tweney, R. D. (1997). Artifactual power curves in forgetting. Memory & Cognition, 25, 724–730.

[3] Burr Settles, B. (2016, December 14How we learn how you learn. Retrieved from https://making.duolingo.com/how-we-learn-how-you-learn.

[4] Roediger III, H. L., & Butler, A. C. (2011). The critical role of retrieval practice in long-term retention. Trends in cognitive sciences, 15(1), 20-27.

[5] If you've been following this blog, you might notice that booster shots show up every so often. This post is at attempt to boost your memory of the forgetting curve and the environmental factors that influence memory!