With anything in life, the simpler the better. This type of motto is forgotten with the chain rule in most calculus texts, which should really be left as simply the following:
Teaching how to apply the chain rule is, in theory, simple: teach your students to identify when we’re dealing with a composition of two functions, find your outer function f and your inner function g, and go to town. Nevertheless, calculus texts, always fond of killing as many trees as possible, like to pad the section with “generalized” versions of other derivative rules, which are simply specialized versions of the chain rule. For example, the following take up most of Rogawski’s section on the chain rule (it has been modified to use prime notation instead of Leibniz notation):
So now, when approached with a problem, the typical student now wonders whether to use one of five different things, even though it’s always the chain rule.
Thoughts?
#1 by Jon Rogawski at March 16th, 2010
| Quote
Paul – I agree that simpler is better, so I thought about this quite a bit. In the end, I felt it was important for students to learn to apply the Chain Rule by pure reflex for those special cases that come up repeatedly – after all, developing the right pavlovian reflexes is also part of learning mathematics. But there are no absolutes in pedagogy, and I certainly acknowledge your point (BTW, for the 2nd Edition currently in progress, I did reduce this slightly).
#2 by Kellen Myers at October 24th, 2010
| Quote
I will point out that Strauss, Bradley, & Smith is the other Calc textbook at Rutgers, and this text also has several listings of rules that are no more than special cases of rules they’ve already learned.
In particular, p 140 features the “Extended Derivative Formulas” each of which is a combination of one of the power, trigonometric, exponential, or logarithmic rules plus the chain rule.
This semester alone, at least three or four students have asked me whether they should use these rules rather than the chain rule.
These are certainly useful examples, but I think I agree that presenting them as additional rules for derivatives is not helpful – it is, I would reckon, somewhat harmful.
#3 by Sam at January 5th, 2011
| Quote
Pavlovian? This is the sort of thing that kills mathematics for so many otherwise interested kids.
That said, I will admit that it’s fun for me when I get to the “Pavlovian” point, that is, I’ve mastered the use of a concept and now I don’t have to think about it.
But I have to get there myself, in this case, by seeing the chain rule and then doing a bunch of exercises. If you try to beat it in, you’re going to lose me.
#4 by Mark Timothy at February 2nd, 2011
| Quote
I always fear mathmatics and i think i have a phobia of it. I do not know how people understand these stuff.
#5 by Sophia at June 1st, 2011
| Quote
In standard calculus you will note that the derivative of a function is a multiplier for a linear term in the variable. The n-th derivative is the multiplier for an n-th degree term. Effectively then the first derivative times the variable is a linear function, and higher order n-th degree functions.
#6 by David at June 1st, 2011
| Quote
I really wonder why most teachers requires students to do the long method of computation when in fact, there are simpler calculations that will result to the same answers. I think this is where students start to be confused with the subject.
#7 by mustafa at July 14th, 2011
| Quote
i am student at hte college, so i have a problem to understand the concept of chain rule in Leibniz notation , it doesnt make sanse ??
#8 by Charlie at August 21st, 2011
| Quote
Brings back college days – Calculus was my favorite subject, but now I wonder how I actually made it through. I am now going to play around with the chain rule for a bit and see how it stacks up. Thank you for reigniting the old math spark!
Charlie
#9 by David at October 28th, 2011
| Quote
I agree with Paul’s sentiments. For some reason, text books ( and more critically lecturers) often teach multiple subset rules rather than one superset rule. In my opinion it is best to state and prove a superset theorem, and then demonstrate how each of the subset rules is an application of the initial theorem. This allows students (who are so inclined) to really understand the maths, rather than remembering rules. This builds flexible and robust knowledge, while wrote learning is brittle outside of the confines of the recognised examples. There is another subtle problem with teaching multiple subset rules. It implies they are necessary, i.e. it implies the student has missed something that distinguishes these cases from the more general case. That mightn’t be concerning for someone who knows they aren’t necessary. But for a student learning this for the first time he must think that they are necessary. This increases the burden on memory, and for all but the most diligent students (who will seek out the unifying principle independently) it gives them a sense that they are missing something diagnostic. This means they will lack confidence in recognition of the problem type at exam time. Unfortunately the problem is rife. During my finance days at university we were told to remember at least 5 different formulae for annuity valuations, when the sum of a geometric progression was the only tool required. As another example I have often wondered why y = e^(f(x)) dy/dx = f’(x)e^(f(x)) is taught in preference to the more general: y = a^(f(x)) dy/dx = f’(x)a^(f(x))logbase e (a) {my apologies for notation but I am no html expert}. As Paul notes, both cases should be shown to have a connection with the chain rule. My hope is simple: that writers of text books prove the superset rule first, and then when writing the subset rules, draw attention to how they are no more than specific cases of that superset rule.