How does the following theorem make sense? I understand that the greater the magnitude of $f''(a)$ and $∣x-a∣$ are, the greater the error should be, but how do we actually get a formula? What is the intuition?

Thanks.
How does the following theorem make sense? I understand that the greater the magnitude of $f''(a)$ and $∣x-a∣$ are, the greater the error should be, but how do we actually get a formula? What is the intuition?

Thanks.
They are comparing it a parabola with constant concavity M. They have said that if the function is less concave then a parabola with concavity M, the error must be less than that parabola. You can draw a few examples and convince yourself of that.