I am trying to better understand related rates problems. Prior to being exposed to related rates problems, the derivatives I solved were always with respect to the independent variable of the equation, e.g. $y=x^2, \frac{dy}{dx}=2x$. With related rates, the problems are solved by taking the derivatives with respect to time, i.e.$\frac{dy}{dt}=2x\frac{dx}{dt}$. Although I am able to solve the problems, I am not comfortable with the switch from taking a derivative with respect to the independent variable x to taking the derivative with respect to a third variable time. I’ve thought long and hard about my issue and I think I have a solution. Would it be correct to say that the instantaneous rate of change of a function with respect to its independent variable, i.e. its derivative, gives the ratio of related rates? In other words, I can determine the ratio of related rates at a specific point by finding the instantaneous rate of change of the function at that point.
For example, if I wanted to know the rate of change with respect to time of the area of a circle given the rate of change in radius with respect to time, I would start by taking the derivative of the area equation $A=πr^2$ with respect to r to find the instantaneous rate of change of A with respect to r; then to find the rate of change of area with respect to time at a specified radius, I would find the instantaneous rate of change at that radius and multiply it by the rate of change of radius with time, i.e. $\frac{dA}{dr}=\frac{d}{dr}[\pi r^2]=2\pi r$, $\frac{dA}{dt}=\frac{dA}{dr} \frac{dr}{dt}$, $\frac{dA}{dt}=2\pi r\frac{dr}{dt}$.
If the problem involved more than one independent variable, I could find the contribution from each independent variable separately and sum them together. For example, if I wanted to know the rate of change in the volume of a cylinder when both the height and radius are changing with time. I could find the instantaneous rate of change of the volume with respect to the cylinder height and use it to find the rate of change of volume with time due to the change of height with time, i.e. $\frac{dV}{dh}=\frac{d}{dh}[\pi r^2h]=\pi r^2$, $\frac{dV}{dt}=\frac{dV}{dh} \frac{dh}{dt}$, $\frac{dV}{dt}=\pi r^2\frac{dh}{dt}$; and then find the instantaneous rate of change of the volume with respect to the cylinder radius and use it to find the rate of change of volume with time due to the change of radius with time, i.e. $\frac{dV}{dr}=\frac{d}{dr}[\pi r^2h]=2\pi rh$, $\frac{dV}{dt}=\frac{dV}{dr} \frac{dr}{dt}$, $\frac{dV}{dt}=2\pi rh\frac{dr}{dt}$. The total rate of change of volume with time would be the sum of the change due to the height changing with time plus the change due to the radius changing with time, i.e. $\frac{dV}{dt}=\pi r^2\frac{dh}{dt}+2\pi rh\frac{dr}{dt}$.
Am I thinking about the math correctly? Thank you for your insight.
While you get the correct results, you don't offer any reasoning that explains your motivation for doing this, which means we cannot tell if you are thinking about the math correctly.
Quite frankly, it makes me suspicious that you are looking at the chain rule (single and multi-variable) - which can be found in many places - and just tossing operations together that will get you to them. If so, then you are most certainly not thinking about the math correctly. These are not pulled out of thin air. They are developed from principles.
First of all, it is not "dependent $y$, independent $x$, third variable $t$". Instead, $t$ is independent, $x$ is dependent on $t$, and $y$ is dependent on $x$ (which makes it dependent on $t$ as well).
(What follows is "heuristic", which means it gives only the broad outline, skipping over fiddly details that are important to serious mathematics, but get in the way of following the overall logic.)
So what happens if we make a change $\Delta t$ in the value of $t$? Then $x$ changes by some amount $\Delta x$, and in turn, this causes the value of $y$ to change by some amount $\Delta y$. The ratio of the change in $y$ with respect to the change in $t$ is $$\frac{\Delta y}{\Delta t}$$ Assuming that $\Delta x \ne 0$, we can multiply and divide by it without changing the overall value, and do some rearranging: $$\begin{align}\frac{\Delta y}{\Delta t} &= \frac{\Delta y\Delta x}{\Delta t\Delta x}\\&= \frac{\Delta y\Delta x}{\Delta x\Delta t} \\&= \frac{\Delta y}{\Delta x}\frac{\Delta x}{\Delta t}\end{align}$$ If we let $\Delta t \to 0$, then $\frac{\Delta y}{\Delta t} \to \frac {dy}{dt}$ and $\frac{\Delta x}{\Delta t} \to \frac {dx}{dt}$. This only converges, though, if $\Delta x \to 0$. But when that holds then, $\frac{\Delta y}{\Delta x} \to \frac {dy}{dx}$. Thus the previous calculation becomes the chain-rule: $$\frac {dy}{dt} = \frac{dy}{dx}\frac{dx}{dt}$$ Note that even though math teachers will warn you not to think of derivatives as a simple division, the chain rule really does come from a cancellation of the "dx" parts (but before the limit is taken).
In the multivariate case, functional notation is useful: $z = f(x, y)$ (two variables is easier to follow, but it scales up to any number of variables). Now we suppose that $x$ and $y$ are themselves dependent on some variable $t$ (and so $z$ will be dependent on $t$ as well). Then a change in $t$ induces a change in each of the other variables:
$$\Delta z = f(x + \Delta x, y + \Delta y) - f(x,y)$$ By adding and subtracting $f(x, y + \Delta y)$ we can get changes in one variable at a time: $$\Delta z = [f(x + \Delta x, y + \Delta y) - f(x, y + \Delta y)] + [f(x, y + \Delta y) - f(x,y)]$$ Similarly treating each of the two differences as before: $$\frac{\Delta z}{\Delta t} = \frac{f(x + \Delta x, y + \Delta y) - f(x, y + \Delta y)}{\Delta t} + \frac{f(x, y + \Delta y) - f(x,y)}{\Delta t}\\=\frac{f(x + \Delta x, y + \Delta y) - f(x, y + \Delta y)}{\Delta x}\frac{\Delta x}{\Delta t} + \frac{f(x, y + \Delta y) - f(x,y)}{\Delta y}\frac{\Delta y}{\Delta t}$$
Now as $\Delta y \to 0$, $$\frac{f(x, y + \Delta y) - f(x,y)}{\Delta y} \to \frac{\partial z}{\partial y}$$ where we use the "partial derivative" $\partial$ to remind you that the value of this derivative depends not only on how $z$ changes with $y$, but also on the particular variable $x$ that was the "partner" of $y$. I.e., if we had another variable $u$ dependent on $t$ such that $z(t) = f(x(t), y(t)) = g(u(t), y(t))$ for the same $y$, we would find in general that $$\frac{\partial f}{\partial y} \ne \frac{\partial g}{\partial y}$$ even though both can be expressed as $\frac{\partial z}{\partial y}$. Even though it isn't explicitly mentioned, $\frac{\partial z}{\partial y}$ depends on whether $y$ is being paired with $x$ or with $u$.
Similarly as $\Delta x \to 0$, $$\frac{f(x + \Delta x, y + \Delta y) - f(x,y + \Delta y)}{\Delta y} \to \frac{\partial z}{\partial x}(x,y + \Delta y)$$ which, when $\Delta y \to 0$ as well, goes to $\frac{\partial z}{\partial x}(x,y)$.
So letting $\Delta t \to 0$, which also sends $\Delta x \to 0$ and $\Delta z \to 0$, we get:
$$\frac{dz}{dt} = \frac{\partial z}{\partial x}\frac{dx}{dt} + \frac{\partial z}{\partial y}\frac{dy}{dt}$$
If you think of a rectangle in the $xy$ plane with sides of $dx$ and $dy$, then $dz$ jumps from the value at the corner $(x, y)$ to the diagonally opposite corner $(x+dx, y+dy)$. But $\frac{\partial z}{\partial x}dx + \frac{\partial z}{\partial y}dy$ goes around the side, breaking the path into two parts, when first only $x$ changes, then only $y$ changes. In the end, you get to the same place, but by a different path.