$\exists-$ states that: if $\Sigma, A(u) \vdash B$ where u occurs nowhere else, then $\Sigma, \exists x A(x) \vdash B$.
Why does this translate to $\exists$ and not $\forall$? Intuitively, $A(u)$ where $u$ occurs nowhere else should mean that $u$ can be any arbitrary thing, so shouldn't $\Sigma, A(u) \vdash B$ be valid for any conceivable $u$ (a.k.a, all $x$) meaning $\Sigma, \exists xA(x) \vdash B$?
This is giving me trouble mostly because of the below theorem I've already learned:
Why is it that in this case $A(u)$ can translate in to a "for all" while in $\exists-$ it only gives us "there exists"? In both cases $u$ is arbitrary, so it should refer to anything$-$"for all"$-$in my head.

The term $u$ is arbitrary yes, but what isn't arbitrary is the fact that we have a derivation $\Sigma, A(u) \vdash B$.
What does it mean to have $\Sigma, A(u) \vdash B$? It means that, assuming $\Sigma$ and $A(u)$ (for some particular $u$), we can prove $B$. Moreover, since $u$ doesn't appear anywhere else, it can't be a "proper" part of the proof: all that is necessary to prove $B$ is that $A(x)$ is true for some x. The $\exists- $ says precisely this, that we don't care about the specific $u$ that satisfies $A(x)$, only that there is something which satisfies it.
On the other hand, suppose we can prove $A(u)$ from $\Sigma$ without assuming $u$. This means that the proof of $A(u)$ can't really "depend" on $u$, so we should be able to prove $A(x)$ for any $x$. And this is what the $\forall +$ rule says.