Problem:
Consider fitting the following data (y-values):
$$ 16, 60, 180, 400, 700, 1200, 2000, 3000, 4300, 6000, 8000, 10500$$
where the x-values range from 1 to 12
into the following model:
$$
y = \alpha + \beta x^\gamma
$$
where $\alpha,\beta \in \mathbb{N}$ and $\gamma \in \mathbb{R}$.
The integral constraints may be relaxed. Exponential model may be considered.
Attempts and Questions:
I've tried:
- Building the model in Geogebra and fiddling around with the parameters.
- Using Wolfram Alpha, applying power fit, and exponential fit directly on the data. Wolfram uses least-squares best fit. The integral constraints were not satisfied.
Linearise the model based on Ian Miller's answer using Excel with $\beta = 11.05077$ and $\gamma = 2.6908$.
- What would be a general way to approach this sort of problem?
Ignoring $\alpha$, is my model equivalent to the one used in wolfram?(confusion on my part, addressed)- Should I minimise the error using percentage (relative error) instead of traditional squared difference? $10500 \pm 200$ shold be more tolerable compared to $16 \pm 200$, for instance.
Here are the results: $$ \begin{array}{cc|cccc} & & \text{Geogebra} & \text{Wolfram (power)} & \text{Wolfram (exponential)} & \text{Linearisation} \\ x & y & 4x^{3.17} & 4.88906x^{3.08717} & 198.887e^{0.33343x} & 11.05077x^{2.6908}\\ \hline 1 & 16 & 6 & 4.88906 & 277.596 & 11.05077\\ 2 & 60 & 48 & 41.5486 & 387.4539 & 71.35166\\ 3 & 180 & 162 & 145.271 & 540.7877 & 212.4371\\ 4 & 400 & 384 & 353.091 & 754.803 & 460.6971915\\ 5 & 700 & 750 & 703.177 & 1053.514 & 839.81\\ 6 & 1200 & 1296 & 1234.56 & 1470.44 & 1371.646\\ 7 & 2000 & 2058 & 1986.95 & 2052.362 & 2076.741\\ 8 & 3000 & 3072 & 3000.67 & 2864.579 & 2974.59\\ 9 & 4300 & 4374 & 4316.52 & 3998.229 & 4083.836\\ 10 & 6000 & 6000 & 5975.79 & 5580.518 & 5422.412\\ 11 & 8000 & 7986 & 8020.13 & 7788.993 & 7007.643\\ 12 & 10500 & 10368 & 10491.6 & 10871.47 & 8856.323\\ \end{array} $$
I have been doing almost the same as Julián Aguirre did. Using $\alpha,\beta \in \mathbb{N}$ and $\gamma \in \mathbb{R}$ as yo you wished, I varied parameters $\alpha,\beta$ (running the calculation over a quite large grid o values) until I get a minimum value of the sum of squares.
It appears that, whatever $\alpha$ could be, $\beta=5$ always leads to the minimum value of the sum of squares. For this value of $\beta$, the following table gives the sum of squares as a function of $\alpha$ in the area of the minimum of $SSQ$. $$\left( \begin{array}{cc} \alpha & SSQ \\ 0 & 6988 \\ 1 & 6908 \\ 2 & 6842 \\ 3 & 6790 \\ 4 & 6752 \\ 5 & 6729 \\ 6 & 6719 \\ 7 & 6724 \\ 8 & 6743 \\ 9 & 6776 \end{array} \right)$$
while $\alpha=23$ would lead to $SSQ=8723$.
For the optimum value $(\alpha=6)$, rounded to next integer, the predicted values are $$\{11,48,153,362,714,1247,2000,3013,4327,5982,8019,10479\}$$ the residuals being $$\{5,12,27,38,-14,-47,0,-13,-27,18,-19,21\}$$ and the model is $$y=6+5 \,x^{3.07743}$$