Skip to content

Differentiation from first principles

Differentiation from first principles essentially involves finding the derivative ‘manually’, without using the power rule.

Normally, we could differentiate a function like by using the power rule:

But from first principles, we would do it like this:

So we get the same answer of , but we had to do a lot more work to get there! Unfortunately, in an exam, they might ask you to do it this way, so it’s worth learning :(

The notation means ‘the limit as approaches ’. In this context, it essentially means that we are considering values of that are incredibly close to , but not actually equal to . This is important because if we were to substitute directly into the expression , we would end up with a division by zero, which you may know that you can’t exactly do :)

Some examples of differentiation from first principles

Section titled “Some examples of differentiation from first principles”
  • Answer:
  • Answer:

Proof that the derivative of a constant is zero

Section titled “Proof that the derivative of a constant is zero”

One nice thing about differentiation from first principles is that it allows us to prove some stuff that otherwise we just have to accept as true! For example, you may have realised that the derivative of a constant function is always zero. But why is that true?

One answer is that, with the power rule, we multiply the coefficient of by the power of , and then reduce the power by . For a constant function, it’s the same thing as having a coefficient of !

So when we differentiate it, we take the power of (0) to the front, and so multiply the coefficient by , which gives us (nothing).

That’s one explanation as to why the derivative of a constant is zero.

Let’s try our first principles method on a constant function, say :

So we see that the derivative of a constant function is indeed zero, as we expected!

In fact, we don’t even need the condition that here, because the expression is always zero, no matter what is (as long as it’s not zero, because we can’t divide by zero!).

This is important, because it shows us that the gradient of a constant function is always zero, even if we consider values of that are not close to zero (measuring the gradient of a ‘larger triangle’ of the graph). That’s because a constant function is just a flat horizontal line, so the gradient is always zero!