I found an algorithm that used an approximation of $\log_2(x + 1)$ from $0$ to $1$ which simply followed the line $y = x + k$ where $k$ was some constant they discovered to be something like $0.043$. I wondered where this constant came from, so I made a Desmos page to experiment, and I graphically figured out what it represents.
My idea was that I'd have to take the integral of the absolute value of the difference between the two curves and then minimize the average height with respect to the value $k$. This did not yield the desired $0.043$ that the algorithm used. However, the $0.043$ seems to be the number where the integral is spread out the most "evenly" (another way I could see it is that the maximum error is minimized. Notice how the maxima all match up). Is there some sort of statistical or actual algebraic way of expressing this? I'm just a high school student, I'd love to know how to express my findings better. Cheers.

Let me explain my comments.
Let’s call $\alpha:= \log_2(1/\ln(2))-1/\ln(2)+1$. In this answer, we want to find $k\in\mathbb R$ so that the following quantity is minimized: $$ \max_{[0,1]} |\log_2(x+1)-(x+k)|=:err(k) $$ (the second quantity in RobPratt’s answer), which I called the maximum error, and which I believe is the quantity that is being minimized in the algorithm mentioned by the OP.
By definition of absolute value, and by some properties of $\max$ and $\min$, we have that $$ err(k)=\max\left\{\;\max_{[0,1]} \log_2(x+1)-(x+k)\;\;,\;\; \max_{[0,1]} -\left(\log_2(x+1)-(x+k)\right)\;\right\}=$$ $$ =\max\left\{M-k,-m+k\right\}, $$ where $M$ and $m$ are respectively the maximum and minimum values of the function $f(x)=\log_2(x+1)-x$.
Now one can use some Analysis, computing the derivative of the above function and studying its graph (I will skip this as I think it should not be difficult), to conclude that $$ M=\alpha\qquad\text{and}\qquad m=0. $$ It follows that $$ err(k)=\max\{\alpha-k,k\}=|k-\alpha/2|+\alpha/2 $$ achieves its minimum value when $k=\alpha/2$, meaning $$err(\alpha/2)=\alpha/2.$$ As I mentioned, this value is approximately $\alpha/2= 0.04303566$, pretty close to the number $0.043$ in the question.
Hence the division by $2$. It’s essentially because of the absolute value. If one looks at the graph I think can figure out why we have to choose $k$ to be half of $err(0)$.