## Teaching for Understanding vs. Teaching for Reasoning Skills

In my last post, Dan Meyer and I discussed whether having students make and test their own conjectures can lead to poor long-term content learning. I think it easily can, not because of poor teaching but because of humans’ limited working memory. Dan’s reply captured the inquiry-learning perspective perfectly, and I made a first pass at replying in the comments but promised to reply with more later. Here goes:

**Conceptual Understanding is Fragile**

Dan writes:

If calculating LCMs were my highest goal here, I would turn to other strategies, including lecture and definition. But calculating LCMs is secondary to conjecturing and testing your conjectures. That’s the higher goal here.

Can you tell me what help you see direct instruction offering me there?

In short, Dan seems to be saying that lecture is great for teaching computational skills, but that he’s willing to sacrifice some efficiency in computational learning in order to develop students’ reasoning skills (e.g., testing conjectures). That’s great, but it skirts the question of improving conceptual understanding, which is a totally separate dimension. Even students with good reasoning skills (the habits of mind that lead to productive inquiry) and strong computational fluency can have poor conceptual understanding. This happens when they regularly go through the instructional sequence Dan lays out: inquiry until students make the desired discovery, followed by notes and drill practice on the skill. The assumption underlying this approach is that once students have made a discovery for themselves, they understand it deeply enough to move on to application and drill practice–that having discovered a concept naturally leads to having a strong, durable conceptual understanding of it.

In reality, conceptual understanding is fragile. Students need practice retrieving the reasons for their conclusions in different contexts to establish them in long-term memory and get them connected to related conceptual schema that are already there. The mere fact of having made a discovery doesn’t guarantee I’ll remember the reasons for it tomorrow, nor that I’ll think to transfer that understanding to related situations. I still need lots of practice explaining “why?” and “how would it be different if…”, as well as “would the same pattern apply in this situation?” and “how could you represent that another way”.

Too often in inquiry lessons (including my own), this practicing of the reasons is relegated to the whole-class debrief in which small groups describe their thinking while sharing out from the investigation. I know this is an attempt to provide conceptual practice, but it’s nowhere close to what’s needed. For any given “how would it be different if…” question, at least 50% of students probably don’t understand, but because they successfully made the desired discovery, they (and the teacher, often including me) accept it. How much formative assessment do teachers do in this debrief phase? If incorrect reasons pop up in the discussion, do they just let another student speak up to correct the record, or do they stop and reteach that reason to mastery?

In short, the discovery is not the lesson. It’s just the set-up for the real lesson, which is when we rehearse the reasons for what we’ve concluded and how it’s connected to everything else we know.

**Dan’s Challenge: How Can Direct Instruction Teach Conjecture-Making?**

To go back to Dan’s comment,

But calculating LCMs is secondary to conjecturing and testing your conjectures. That’s the higher goal here…Can you tell me what help you see direct instruction offering me there?

My short answer is that since cognitive load is the issue, you’d want to instruct students directly on any techniques they could use to reduce their own cognitive load: making organized lists, searching through the problem space in an organized way, etc…the sort of things we encourage students to do anyway.

Secondly, I’d say that direct instruction of conceptual understanding (the sort I referred to as the “real lesson” above) probably helps students make and test conjectures. Better understanding leads them to ask better questions during the inquiry phase. I haven’t done a lot of reading on this, but one paper I remember from grad school shows that people with poor content knowledge tend to ask shallower or less relevant questions. Here’s the abstract:

Questions should emerge when a person studies a device (e.g., a lock) and encounters a breakdown scenario (“the key turns but the bolt doesn’t move”). Participants read illustrated texts and breakdown scenarios, with instructions to ask questions or think aloud. Participants subsequently completed a device-comprehension test, and tests of cognitive ability and personality. Deep comprehenders did not ask more questions, but did generate a higher proportion of good questions about plausible faults that explained the breakdowns. An excellent litmus test of deep comprehension is the quality of questions asked when confronted with breakdown scenarios.

I’m sure there’s better research out there on this topic, but the main idea would basically be that students at a low Van Hiele level would be inhibited from making good conjectures by being literally unable to perceive the deep features of the scenario. Wouldn’t those students develop their conjecture-making and testing abilities more if they had better conceptual understanding?

Please note that I’m not an advocate for direct instruction, just for attending to what students are really thinking, even when they appear to have made the right discovery. If I’m an advocate for anything, it’s for paying attention to cognitive factors in learning, not because they’re more important than motivational ones, but because I think the MTBoS community sometimes gives them short shrift. So cognitive load is my thing, much more than direct instruction.

**Have They Encoded the Wrong Rule?**

See Dan’s comment in my previous post for the context, but no, I don’t think they have. When you learn the rule that for numbers like 2 and 10 (in which one is a multiple of the other), the LCM is just the larger number, you’re actually learning two separate facts: the rule, and when the rule works. Learning the first and not yet knowing the second doesn’t mean you have the rule wrong, it means you’re ready to make your next discovery.

## Dan’s “Shipping Routes” and Cognitive Load

During Dan’s series of lesson makeovers last summer, I intended to write up a critique of the Least Common Multiple makeover he and Dave Major created. Dan’s write-up seemed to illustrate his view of the respective roles of inquiry and direct instruction, and I’ve always thought it showed a misinterpretation of the cognitive research that cautions against unguided inquiry. I didn’t bother chiming in until Grant Wiggins wrote this blog post, bringing up the issue of cognition, working memory, and inquiry vs. direct instruction again. So here are my thoughts.

For context, Shipping Routes shows students the clip below and asks whether the two boats will ever get back to port at the same time:

Students are then sent to a simulator programmed by Dave Major, allowing them to choose different round-trip times for the two boats and showing their cycles back and forth. Students can discover for themselves when the boats get back in sync, and what that has to do with least common multiples. For example, boats with round-trip times of 2 min and 5 min will be back in sync after 10 min. Sadly, Dave’s simulator seems to be offline right now, so you can’t try it for yourself.

**Dan’s approach**

In his write-up, Dan shies away from direct instruction right from the get-go:

I could

tellstudents what to look for here and how to approach the problem. I could show a few worked examples…Two problems there:

- Some students will need more than just three examples to determine a pattern.
- My
selectionof those particular examples – that is, my decomposition of the entire solution space into just three categories – did a lot of the intellectual heavy liftingformy students. They need to decide on those three categories and come up with a rule that takes them all into account.

Regular readers of Dan’s blog (cuz, yeah, there really aren’t regular readers of my blog) may know that worked examples are an interest of mine. Here’s how the stuff I’ve learned applies to Shipping Routes.

**Science of Memory and Learning**

Humans have 2 kinds of memory, working memory and long-term memory, and they function completely differently. Retrieving information from long-term memory is almost effortless, and we can process huge amounts of information from long-term memory simultaneously. In contrast, working memory can only hold on to 4-7 chunks of information, and only for about 30 seconds or so. The word “chunks” is important. For a demonstration, watch the first 1:30 minutes of this video:

There are various theories about how information gets encoded in long-term memory, but the main idea is that the information has to stay in working memory for long enough to get practiced/recalled several times before the working memory dumps it–each practice opportunity strengthens its foothold in long-term memory. Working memory will dump it when new information comes in, so if you keep throwing new information at someone who’s still encoding the previous information, you make it very hard for them to form long-term memories, even if they seem to be processing what you’re saying as you’re saying it. It could have made sense to them when you said it and then disappeared from their mind when you said the next thing. (Sound familiar to any teachers?)

For that reason, inquiry instruction can make it very hard to encode information into long-term memory. As Sweller et al (2006) say in paper that spawned much controversy, “Inquiry-based instruction requires the learner to search a problem space for problem-relevant information. All problem-based searching makes heavy demands on working memory. Furthermore, that working memory load does not contribute to the accumulation of knowledge in long-term memory because while working memory is being used to search for problem solutions, it is not available and cannot be used to learn.”

Grant Wiggins really does not like this paper–much of his most recent post (which inspired me to dust off my keyboard here) is spent picking it apart, particularly the “ludicrous” emphasis Sweller et al place on novice vs. expert learners. I think Wiggins misunderstands the authors on that point. Granted, I’m a layman, but I think the difference between novice and expert learners is pretty simple: isn’t it the difference between someone who’s encoded the information being studied into long-term memory and someone who’s still processing it from working memory?

The strong opinions (not limited to Wiggins by any means) reflect the fact that this theory of memory and cognition points out shortcomings of inquiry learning. As I’ll describe at the bottom of this post, it does not actually mean that you should never use inquiry, but some people on both sides of the debate say it does, particularly when the theory itself is misunderstood.

**Critiquing Shipping Routes**

How would this play out in Shipping Routes? In order to be successful with Dan’s lesson, students need to try different pairs of round-trip times on the simulator to discover what governs when the boats are back in sync. Students can set times to the tenth of a second, e.g, 3.2 min for the first boat and 4 min for the second boat. These boats would by in sync for the first time after 16 minutes. Would they notice how 16 arises from 3.2 and 4 as the least common multiple?

Let’s give the lesson the benefit of the doubt and say that a teacher would suggest (or have a student suggest) that everyone try whole-number times to start. So one group of students might, for example, try 2 min and 6 min. and discover that the answer, 6 min, is the larger number. At this point, Dan would have the teacher challenge this group to see if that rule always works, with an eye toward finding counterexamples. Dan says,

If a student just tries the first example and says, “It’s easy. It’s always the longer of the two times.” I can then say, “Great. But try that on several more examples and make sure it works.” (It won’t.) Or I can

suggestone of the other two categories. But I’d rather not offer those categories before the student has even considered why she mightneedthem, or even the fact that therearedifferent categories.

The other categories are coprime numbers (like 2 and 5) or numbers with a common factor (like 6 and 10). Students have to identify the three situations and find the rule for each, with minimal guidance.

Do you see how this might overtax working memory to the point of inhibiting long-term memory formation…how students might successfully find one or two of the three rules on Monday and then walk in on Tuesday having forgotten what they’d discovered? Or find all three rules, but be unable to remember the first one by the time they find the last one? If you don’t think this would tax their memories, keep in mind that many students have to stop and think or count on their hands just to remember that (3)(9) = 27.

**Modifying Shipping Routes**

How could Shipping Routes be adapted so it accounted for working memory limitations? Here’s what I would do. Whatever rule students discover first, before encouraging them to find cases where their new rule DOESN’T work and where they need to create a new rule, I’d have them fully digest the math behind the rule that they have found. They’ve just found this rule. It’s fragile, sitting in their memory buffer temporarily–this is our chance to have them encode it into their long-term memory and harden it by connecting it to other conceptual schema already in their brains. (Yes, I do believe strongly in developing conceptual understanding in addition to mere skill development). If we have them switch to a new search for a new rule, we throw away most of the benefit of the discovery they just made.

How would I help them practice it and connect it to other concepts? After students played around with the simulator for a while, I’d present them with text input boxes like this and ask students to fill in the blanks:

**Boat 1****:** 2 min

**Boat 2:** ______

**Time until they’re in sync:** _____

A computer could easily analyze student responses and figure out which rule they’ve discovered. Let’s take the example of a student who discovered that for coprime numbers like 2 and 5, you multiply the numbers. Now I’d want to have the students really dig into that rule.

For example, I could ask them to re-represent the combination they came up with on a double number line:

Then I’d ask them to self-explain the connection via drop-down menus, as in Martina Rau’s paper finding that multiple representations didn’t help students learn fractions unless students were prompted to self-explain the connections between the representations. (The picture shows drop-down menus for self-explaining adding fractions, but you could do something similar for explaining why 2 and 5 have a common multiple at 10):

Then I’d have students practice this rule a minimum of 5-10 times, perhaps following the approach of this other paper by Martina Rau.

And *then, *finally, I’d have the online lesson challenge these students to find a case that breaks their rule, and the learning cycle would begin again.

**So When & How Should We Use Inquiry?**

As I said above, understanding working memory limitations doesn’t mean you should never use inquiry. Rather than spelling out my views on this here, I’ll just point you all again to Dan’s posting of my thoughts on this topic over at his blog. Incidentally, I did try to convince some researchers in the learning sciences to use Dan’s makeover lessons as a test-bed for studying instructional principles, and I still think that would be a great idea (see my first blog post ever). I mean, researchers, if you want teachers to actually pay attention to your work, why not conduct your studies in the context of the lessons we’re all talking about online?

**Update: **Dan’s reply from the comments:

“Whatever rule students discover first, before encouraging them to find cases where their new rule DOESN’T work and where they need to create a new rule, I’d have them fully digest the math behind the rule that they have found. They’ve just found this rule. It’s fragile, sitting in their memory buffer temporarily–this is our chance to have them encode it into their long-term memory and harden it by connecting it to other conceptual schema already in their brains.”

But you have them encoding an incorrect rule!

If I have any recurring complaint about the literature around direct instruction it’s that it doesn’t account for the effect of direct instruction on student motivation. If I have a /second/ complaint it’s that it doesn’t adequately account for the difference between what teachers /say/ and what students /learn/. It’s like, “If the students didn’t learn what the teacher said, the teacher either needs to say it again or say it better.”)

“Do you see how this might overtax working memory to the point of inhibiting long-term memory formation…how students might successfully find one or two of the three rules on Monday and then walk in on Tuesday having forgotten what they’d discovered?”

I wouldn’t count on students being able to calculate LCMs based on this activity alone. They’d need plenty of fluency practice also, which would give me more opportunities for direct instruction. If calculating LCMs were my highest goal here, I would turn to other strategies, including lecture and definition. But calculating LCMs is secondary to conjecturing and testing your conjectures. That’s the higher goal here.

Can you tell me what help you see direct instruction offering me there?

**My initial reply (more to come later):**

@Dan, there’s lots to chew on in your reply. I’ll have to reply in pieces, because I need to pick up a certain little guy from daycare.

I agree that direct instruction literature often fails to account for student motivation–that’s important, and it’s what led me to slowly adopt lots of your techniques. I’m more skeptical of your second complaint. You say that the direct instruction literature “doesn’t adequately account for the difference between what teachers say and what students learn.” But this is a cognitive concern, and cognition is where direct instruction literature has is strongest results. That’s the area where we find numerous studies showing stronger or more efficient learning than you get from inquiry learning.

Perhaps you’re saying that direct instruction is efficient at teaching skills, but that it bombs when it comes to teaching what those skills mean and how they’re connected conceptually. If so, that’s pretty much how I interpret Grant Wiggins’ post, too.

Here’s why I disagree with that. The way I see the original Shipping Routes lesson playing out is that we’d do that activity, there’d be lots of good conjecturing and testing/discussing of conjectures, and then we’d start taking notes and doing fluency practice. For the reasons I described above, I think lots of students would be starting essentially from zero at that point. Lots of good conceptual wrestling would have been done during the investigation, but little of the understanding would have stuck in students’ long-term memory, so what would actually end up sticking would be the explanations delivered in note-taking and fluency practice. Too little guidance during the inquiry phase forces me as teacher to do too much “knowledge dumping” at the end, when I just try to pour the concepts into kids’ heads.

If you really want to attend to the differences between what teachers say and what students learn, you have to give them lots of feedback at the moment they’re constructing the concepts. This calls for inquiry that is more guided. We still want to achieve the Generation Effect and the euphoria of figuring things out for yourself, but we want it to stick, too.

I’ll have to respond later to your first objection, that they’d be encoding an incorrect rule, and to your closing question about how direct instruction could help with the metacognition you prioritize. Happy Friday, and thanks for taking the time to read the post and respond.

Anybody else feel like responding, or do I have to get my mom to comment on here 🙂

**Update^2: **For the rest of the reply, see my next blog post here: Teaching for Understanding vs. Teaching for Reasoning Skills

## Teaching Direct Variation Conceptually (as a foundation for slope)

OK, the following may rate as one of my best test questions ever. It might take you a second to see the point:

If you don’t get it yet, compare it to this alternate version:

By now you’ve probably seen what I’m getting at: the answer to #1 is “Not enough information”, because there’s a jar there, so the weight per candy is not 7 grams.

Let me share the struggles that led to this idea, and the way this approach will underpin my teaching of linear functions next year:

First of all, I want to continue to make my math real-world (even if nobody at Dan Meyer’s blog can articulate why), but too often that means students have to wade through lots of WORDS to understand a question or task. This is especially daunting for English Language Learners, and I’m working on meeting their needs better. So I need to find ways to capture the essence of concepts and story problems with mostly pictures and a minimum of text.

Secondly, my students often have a hard time seeing why rates are often calculated as (y_{2} – y_{1})/(x_{2} – x_{1}) instead of y/x. With a good context like Domino Effect by Mathalicious, students will discover the idea for themselves, and they’ll be able to memorize it and apply it in real-life situations that obviously give them 2 points. But I’ve always taught direct variation at the end of linear functions, as a special case after the main concept…and after that lesson, some students revert to calculating slope as y/x. They can’t distinguish cases when you need to subtract first from cases when you can just divide.

So instead of teaching direct variation last, next year I’ll teach it first, as the introduction to linear functions. Since they won’t know (y_{2} – y_{1})/(x_{2} – x_{1}) yet, we’ll just start by sorting situations into 2 categories: ones in which you can find the **rate** (not “slope”–that word won’t exist for us yet) by dividing y/x, and situations in which that makes no sense. We’ll make predictions for new situations in the first category by writing proportions, writing equations of the form y=kx, drawing graphs that pass through the origin, etc. And mixed in with all of that practice will be situations in which direct variation and proportionality don’t work:

(The bar itself weighs something here).

Once we’ve really hammered the idea that there are 2 categories of linear scenarios, ones in which the rate is just y/x and ones in which it’s not, we can finally do Domino Effect and figure out how to deal with the second category. As an aside, I do need to find more 3-Acts that are simple linear scenarios and are still enticing for students. If anybody knows of some, please tell me in the comments. I tried Dan Meyer’s Pencil Sharpener task, which he seems to have taken down from 101q’s, but it gave me a really poor prediction. That’s no good, because early in the year I want my students to feel *impressed* by their new ability to predict things.

As a final note, I think even before starting the direct variation lesson, we might use Simpsons Sunblock to get students thinking about proportions. In the past, they’ve tended to make their predictions by writing and solving a proportion (or at least informally by doubling the x- and y-values from a particular point in their data table to predict the needed values). Then when we formally introduce the concept of proportionality/direct variation, we can return to Simpson’s Sunblock and talk about how the method they already used there was an example of it. And we can translate the proportions they used into direct variation equations.

Yay! Why is it that March is always when I realize what I should have been doing in September/October?

## Cryptography and Horizontal Function Transformations

This lesson has gone well enough for me that I think it might be worth putting out there for others to use. I’ve been troubled for years by the lame explanations I had to give for why horizontal transformations are backwards. Why does y=(x – 3)^{2} get shifted to the *right? *Every explanation I’d give was met with the polite “Oh, okay” that’s student for “I have no idea what you just said.”

So here’s a taste of what how I taught the topic using the context of cryptography. I gave students the following function and asked them something to set a low barrier to entry: if someone sends you the message “8-9”, then what are they saying to you?

(The answer is “Hi”, of course). But that code is too obvious, so then I introduced the idea of transforming the code to something more secret. For example, you and a co-conspirator could agree that to read a message, you’d first add 3 to each number, and then look up the corresponding letter on the graph. So if I send you the number 1, what letter did I send? (The answer is that you first do 1 + 3 to get 4, so the correct letter is D). That’s simple enough, so we practiced it and built up to something more complicated:

The answer to question e) is 55. This is not easy to see without doing some calculations, and those calculations already get you thinking in the backwards logic of horizontal transformations. Since “S” is the number 19, you need (x+2)/3 to come out to 19. So you first multiply 19 by 3, and then subtract 2. Students should show work like this: f[(55+2)/3)] = f(57/3) = f(19) = S.

After that discussion, it’s time to let the students have some fun by sending a secret message to a classmate with the code f(2x-3).

By then, we’ve noticed that it’s annoying to have to recalculate each letter you want to send. Wouldn’t it be nicer to just notice the patterns, and adjust the entire code of f(x) at once? We look for patterns in how the points on the graph are transformed, like this:

(I’ve put the answers in the graph in red font). Here you can already see the backwards thinking in effect: although the fact that 0.25 < 1 might make you think that f(0.25x) is compressed, in fact it’s stretched. And you understand the logic–you need a larger value of x, because the 0.25 is going to shrink it, and you’re still trying to come out to the same letters of A, B, and C.

What I really like about this approach is that it also makes clear why the order of operations is reversed for horizontal function transformations. For example, in the function y=sin(2x+pi), you *first* shift everything to the left by pi, and *then* compress by a factor of 0.5. Why are these transformations applied in the reverse order of operations? Because in essence, you’re solving an equation, just like we solved (x+2)/3 = 19, above…and solving an equation always involves “undoing” the expression in the reverse order in which it was built up.

Students apply that reasoning, first by encoding the letters of the word “JAMES” using the function f(0.5x + 6), which has two transformations. Then, looking at the pattern of the steps they took in each case (first subtracting 6 from both sides of an equation, and then multiplying by 2 to cancel out the 0.5), they apply those transformations graphically to another word, “BOND”, in the same order. First subtracting 6 from each x-value means moving it to the left 6, and then multiplying by 2 means stretching it horizontally by a factor of 2. Students know that the order matters here, because of some exploration we did in the intro section of this activity.

Students then translate their understanding to actual mathematical graphs:

This should set my students up nicely to understand period and phase shift. Think again about the function y=sin(2x+pi). When transforming graphs, it’s usually easier to stretch/compress before you translate. Since order of operations is reversed, if we want to compress first, we have to rewrite it as y=sin[2(x + pi/2)]. This way, you compress by 0.5 first, and then just shift everything by pi/2 to the left.

And, if you’ve gotten all the way down to this part of the blog post, here’s a link to the activity: https://www.dropbox.com/s/gam9uzwqrnl91l8/Introduction%20to%20Functions%20and%20Cryptography.pdf

For some context, see Kate Nowak’s related musings: http://function-of-time.blogspot.com/2013/09/building-functions-clarified.html

## Teaching is hard

I’m in my 10th year as a teacher. I’m a much better teacher than I was in my first couple years, and a much better teacher than I was before I found the MTBoS. But I struggle with keeping a growth mindset–I keep wondering when I will finally be the teacher I want to be. Part of this is that my standards go up every year. Lessons that seem okay this year would have struck me as awesome 5 years ago. And part of it is that this is only the second year ever that I haven’t had either a brand new prep or a new school to adjust to. But still, teaching is hard. So for all of us out here busting our butts over summers, weekends, etc…keep it up, and remember that a growth mindset is the most important thing.

## Rough idea of how I want to use Penny Circle

Been trying to figure out how to use Penny Circle, by Dan Meyer and Desmos. I think the activity is terrific, but like L Hodge (in the comments on Dan’s blog), I think the activity does some of the intellectual work for students, so they won’t be challenged to think as hard as I’d like. (However, it’s a great first experience of the IDEA of modeling, so I’m not criticizing it…I just want to take it further).

I only have a second right now (will flesh this out later), but I think I’m going to have students do the activity as intended, and then ask them to create an equation for diameter of the penny circle after Dan has been putting down pennies for *t* seconds. The Desmos activity generates the function *n*(*d*), where *n* is the number of pennies and *d* is the diameter of the circle. My students will have to find the function *d*(*n*), and then use some video of Dan putting pennies down to find *n*(*t*). Then they can use composition of functions to get *d*(*t*). I think there is some video of Dan putting pennies down in the 3-Act. Anyways, back to work, but if anyone has comments or especially criticisms/warnings, I’d love to hear them.