## Thursday, 19 June 2014

I was always kind of sad that I was never taught how to derive the quadratic formula in school. We were taught how to use it, but it felt wrong to use an equation when I didn't understand where it came from or why it worked. At any rate it seemed very mysterious to me. I could understand there being a square root in the equation, but what was with the '4ac' term doing there?

I tried to derive the quadratic formula for myself, but my formula ended up looking a bit different from the one that we all know, but as far as I could tell worked. I later learned that my formula only worked for a=1, which I then fixed. I now know that it was just another way of expressing the quadratic equation, albeit in a much less convenient form. My chemistry teacher, as it so happens, criticised it for having too many fractions in the equation after an associate of mine pointed it out to him.

But that was many years ago, and even if I still had the notes, they are buried under thousands of pages of school work. I'll do my best to recreate the approach I took here, but you could probably find a similar derivation somewhere. It was based on a trick I was taught in class for solving quadratic equations.

For example, what is the value of x if x² + 4x + 3 = 0? We can easily solve the equation x² + 4x + 4 = 0, which we can factorise into (x + 2)² = 0, at which point it becomes obvious that x = -2. We can usilise this result by noting that x² + 4x + 3 = (x² + 4x + 4) - 1 = (x + 2)² - 1 = 0. It becomes immediately obvious then that x = -2 ± 1.  Note that the former equation only had one possible solution (or the positive and negative solutions were the same, if you prefer).

Basically, our approach is to relate our quadratic equation of our choosing to an equation that is immediately solvable, and using the results of that equation to solve the one that we want. This is essentially what the quadratic formula doing for us. Anyway, here's the equation that we have to unpack:

[1]

We need to generalise our trick if we wish to use it for any coefficients of [1] that might come our way. In each and every case, our quadratic equation will end up taking the form:

[2]

Being careful not to confuse A, B and C with the coefficients of [1], we can easily solve for 'x' as:

[3]

Already we see a striking similarity with the quadratic formula that we all know, but we want to express 'x' in terms of the coefficients presented in [1]. To do this, we need to establish the rules that relate coefficients to A, B and C, something that we didn't have to do when we dealt with individual equations. We do this by expanding [2] and then equating the coefficients with [1]. So firstly:

[4]

Then, comparing the RHS with equation [1], we can establish that:

[5]

[6]

[7]

Finally, we can substitute [5], [6] and [7] into equation [3] to get

[8]

It was probably around [8] that I originally stopped and called it a day, but we can simplify this equation by taking the '4a' out of the square root, like so:

[9]

[10]

You might have noticed that although I had taken care to include both the positive and negative square root in [3], I used only the positive square root for [5]. Sadly, there are no insights to be drawn from this, the negative square root just leads to the same quadratic formula.