With anything in life, the simpler the better. This type of motto is forgotten with the chain rule in most calculus texts, which should really be left as simply the following:

Teaching how to apply the chain rule is, in theory, simple: teach your students to identify when we’re dealing with a composition of two functions, find your outer function f and your inner function g, and go to town. Nevertheless, calculus texts, always fond of killing as many trees as possible, like to pad the section with “generalized” versions of other derivative rules, which are simply specialized versions of the chain rule. For example, the following take up most of Rogawski’s section on the chain rule (it has been modified to use prime notation instead of Leibniz notation):


So now, when approached with a problem, the typical student now wonders whether to use one of five different things, even though it’s always the chain rule.

Thoughts?