Differentiation from first principles

Differentiation from first principles essentially involves finding the derivative ‘manually’, without using the power rule.

Normally, we could differentiate a function like by using the power rule:

But from first principles, we would do it like this:

So we get the same answer of , but we had to do a lot more work to get there! Unfortunately, in an exam, they might ask you to do it this way, so it’s worth learning :(

The meaning of

The notation means ‘the limit as approaches ’. In this context, it essentially means that we are considering values of that are incredibly close to , but not actually equal to . This is important because if we were to substitute directly into the expression , we would end up with a division by zero, which you may know that you can’t exactly do :)

Some examples of differentiation from first principles

Differentiate from first principles

Answer:

Differentiate from first principles

Answer:

Proof that the derivative of a constant is zero

One nice thing about differentiation from first principles is that it allows us to prove some stuff that otherwise we just have to accept as true! For example, you may have realised that the derivative of a constant function is always zero. But why is that true?

The power rule answer

One answer is that, with the power rule, we multiply the coefficient of by the power of , and then reduce the power by . For a constant function, it’s the same thing as having a coefficient of !

So when we differentiate it, we take the power of (0) to the front, and so multiply the coefficient by , which gives us (nothing).

That’s one explanation as to why the derivative of a constant is zero.

The first principles answer

Let’s try our first principles method on a constant function, say :

So we see that the derivative of a constant function is indeed zero, as we expected!

In fact, we don’t even need the condition that here, because the expression is always zero, no matter what is (as long as it’s not zero, because we can’t divide by zero!).

This is important, because it shows us that the gradient of a constant function is always zero, even if we consider values of that are not close to zero (measuring the gradient of a ‘larger triangle’ of the graph). That’s because a constant function is just a flat horizontal line, so the gradient is always zero!