General Substitution

Reducing an integral

At the heart of integration is substitution. It is our most powerful and versatile weapon for tackling almost any kind of integral. Pretty much every integral you solve will almost certainly involve some form of substitution. In fact, you’ll come to see that most methods of integration are actually just abstractions upon substitution. Just as chain rule governs the forces of nature, substitution gives rise to the echelons of integration. It truly is the most vital skill for an integrator to nurture.

There are many, many kinds of substitution. Spotting which kind of substitution to use, and what exact substitution to make, is often nontrivial. It takes much practice to build up the intuition for spotting what works in a particular scenario, but with enough experience, it can become almost second nature.

What is Substitution?

A substitution rewrites an integral in terms of a different variable. For instance, if we have the integral

3(3x+1)2 dxint 3(3x+1)^2 dx

This is an integral with respect to the variable xx. We can choose to let a new variable tt denote the expression 3x+13x + 1.

3x+1=t3x+1 = t

Then we go through our expression, replacing each xx with the equivalent expression in terms of tt.

3(3x+1)2 dx=3t2 dxegin{align*} int 3( extcolor{#4d9dcd}{3x+1})^2 dx &= int 3 extcolor{#4d9dcd}{t}^2 dx end{align*}

What this has done is made the integral so much nicer to deal with. More importantly, it’s in a form we can now easily integrate. So substitution is a way of simplifying integrals that lets us tackle them in steps.

However, we cannot integrate this quite yet, as we can only integrate with respect to tt, not xx. So we also need to replace dxdx with the corresponding dtdt.

Substituting with Respect

There are many ways to think about this, but the end results are the same. To get our integral in terms of dtdt instead of dxdx, we’ll find a way to relate dtdt to dxdx.

Let’s take our substitution, and differentiate both sides with respect to xx:

t=3x+1ddx(t)=ddx(3x+1)dtdx=3egin{align*} t &= 3x+1 \ rac{d}{dx} left( t ight) &= rac{d}{dx} left( 3x+1 ight) \ rac{dt}{dx} &= 3 end{align*}

Now we can move dxdx over to the other side and divide by 33 to obtain our relationship between dtdt and dxdx:

dt=3 dxdx=13 dtegin{align*} dt &= 3 dx \ dx &= rac{1}{3} dt end{align*}

dxdx and dtdt here represent differentials, so writing the relationship between them in this way is known as differential form.

And algebraically, everything will work out fine if we simply replace dxdx in our integral with this expression of dtdt. We end up with an integral only in terms of tt, which we can then evaluate.

3t2 dx=3t213 dt=t2 dt=13t3+cegin{align*} int 3t^2 extcolor{#4d9dcd}{dx} &= int 3t^2 cdot extcolor{#4d9dcd}{ rac{1}{3} dt} \ &= int t^2 dt \ &= rac{1}{3} t^3 + c end{align*}

After carrying out the integration, remember to ‘undo’ your substitution by putting xx back in:

13t3+c=13(3x+1)+c rac{1}{3} extcolor{#4d9dcd}{t}^3 + c = rac{1}{3} left( 3x+1 ight) + c

Deriving Substitution

So why exactly does substitution work? It’s perhaps easy to see when the substitution is just a translation (t=x+at = x + a), but things can quickly become unintuitive once we start scaling (t=axt = ax) or nesting functions (t=g(x)t = g(x)).

Fundamentally, substitution in integration is just the equivalent of chain rule from differentiation (except in reverse of course). When we differentiate nested functions, the derivative ‘pops out’:

ddxf(g(x))=f(g(x))g(x) rac{d}{dx} f(g(x)) = f'(g(x)) cdot extcolor{#4d9dcd}{g'(x)}

The way to prove this is to consider the ’chain’ of variables. Let t=g(x)t = g(x) to denote the nested function, then y=f(g(x))=f(t)y = f(g(x)) = f(t). We ‘split’ the derivative to incorporate the intermediate variable tt:

dydx=dydtdtdx=f(t)g(x)=f(g(x))g(x)egin{align*} rac{dy}{dx} &= rac{dy}{dt} cdot rac{dt}{dx} \ &= f'(t) , g'(x) \ &= f'(g(x)), g'(x) end{align*}

Well, shouldn’t be too hard to go back the other way.

If we find dtdx\frac{dt}{dx}, multiplying the integral by it will cancel dxdx, leaving just dtdt.

dxdxdt=dtint dx cdot rac{dx}{dt} = int dt

Or, more simply, we can find an expression for dxdx in terms of dtdt, and replace it directly.

Residual Derivative

Multiple Substitution